A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.

Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most dam...

Full description

Bibliographic Details
Main Authors: Matilda Rentoft, Daniel Svensson, Andreas Sjödin, Pall I Olason, Olle Sjöström, Carin Nylander, Pia Osterman, Rickard Sjögren, Sergiu Netotea, Carl Wibom, Kristina Cederquist, Andrei Chabes, Johan Trygg, Beatrice S Melin, Erik Johansson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0213350
id doaj-9b8a06b2b5274ef9a69fa9d3f6ad38d7
record_format Article
spelling doaj-9b8a06b2b5274ef9a69fa9d3f6ad38d72021-03-03T20:47:07ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01143e021335010.1371/journal.pone.0213350A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.Matilda RentoftDaniel SvenssonAndreas SjödinPall I OlasonOlle SjöströmCarin NylanderPia OstermanRickard SjögrenSergiu NetoteaCarl WibomKristina CederquistAndrei ChabesJohan TryggBeatrice S MelinErik JohanssonWhole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most damaging variants, e.g. those found in coding regions, and overlooking the remaining genetic variation. Such a biased approach explains in part why the genetic causes of many families with dominantly inherited diseases, in spite of being included in whole-genome sequencing studies, are left unsolved today. Here we explore the use of a geographically matched control population to minimize the number of candidate disease-causing variants without excluding variants based on assumptions on genomic position or functional predictions. To exemplify the benefit of the geographically matched control population we apply a typical disease variant filtering strategy in a family with an autosomal dominant form of colorectal cancer. With the use of the geographically matched control population we end up with 26 candidate variants genome wide. This is in contrast to the tens of thousands of candidates left when only making use of available public variant datasets. The effect of the local control population is dual, it (1) reduces the total number of candidate variants shared between affected individuals, and more importantly (2) increases the rate by which the number of candidate variants are reduced as additional affected family members are included in the filtering strategy. We demonstrate that the application of a geographically matched control population effectively limits the number of candidate disease-causing variants and may provide the means by which variants suitable for functional studies are identified genome wide.https://doi.org/10.1371/journal.pone.0213350
collection DOAJ
language English
format Article
sources DOAJ
author Matilda Rentoft
Daniel Svensson
Andreas Sjödin
Pall I Olason
Olle Sjöström
Carin Nylander
Pia Osterman
Rickard Sjögren
Sergiu Netotea
Carl Wibom
Kristina Cederquist
Andrei Chabes
Johan Trygg
Beatrice S Melin
Erik Johansson
spellingShingle Matilda Rentoft
Daniel Svensson
Andreas Sjödin
Pall I Olason
Olle Sjöström
Carin Nylander
Pia Osterman
Rickard Sjögren
Sergiu Netotea
Carl Wibom
Kristina Cederquist
Andrei Chabes
Johan Trygg
Beatrice S Melin
Erik Johansson
A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
PLoS ONE
author_facet Matilda Rentoft
Daniel Svensson
Andreas Sjödin
Pall I Olason
Olle Sjöström
Carin Nylander
Pia Osterman
Rickard Sjögren
Sergiu Netotea
Carl Wibom
Kristina Cederquist
Andrei Chabes
Johan Trygg
Beatrice S Melin
Erik Johansson
author_sort Matilda Rentoft
title A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
title_short A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
title_full A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
title_fullStr A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
title_full_unstemmed A geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
title_sort geographically matched control population efficiently limits the number of candidate disease-causing variants in an unbiased whole-genome analysis.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description Whole-genome sequencing is a promising approach for human autosomal dominant disease studies. However, the vast number of genetic variants observed by this method constitutes a challenge when trying to identify the causal variants. This is often handled by restricting disease studies to the most damaging variants, e.g. those found in coding regions, and overlooking the remaining genetic variation. Such a biased approach explains in part why the genetic causes of many families with dominantly inherited diseases, in spite of being included in whole-genome sequencing studies, are left unsolved today. Here we explore the use of a geographically matched control population to minimize the number of candidate disease-causing variants without excluding variants based on assumptions on genomic position or functional predictions. To exemplify the benefit of the geographically matched control population we apply a typical disease variant filtering strategy in a family with an autosomal dominant form of colorectal cancer. With the use of the geographically matched control population we end up with 26 candidate variants genome wide. This is in contrast to the tens of thousands of candidates left when only making use of available public variant datasets. The effect of the local control population is dual, it (1) reduces the total number of candidate variants shared between affected individuals, and more importantly (2) increases the rate by which the number of candidate variants are reduced as additional affected family members are included in the filtering strategy. We demonstrate that the application of a geographically matched control population effectively limits the number of candidate disease-causing variants and may provide the means by which variants suitable for functional studies are identified genome wide.
url https://doi.org/10.1371/journal.pone.0213350
work_keys_str_mv AT matildarentoft ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT danielsvensson ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT andreassjodin ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT palliolason ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT ollesjostrom ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT carinnylander ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT piaosterman ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT rickardsjogren ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sergiunetotea ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT carlwibom ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT kristinacederquist ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT andreichabes ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT johantrygg ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT beatricesmelin ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT erikjohansson ageographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT matildarentoft geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT danielsvensson geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT andreassjodin geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT palliolason geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT ollesjostrom geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT carinnylander geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT piaosterman geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT rickardsjogren geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT sergiunetotea geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT carlwibom geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT kristinacederquist geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT andreichabes geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT johantrygg geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT beatricesmelin geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
AT erikjohansson geographicallymatchedcontrolpopulationefficientlylimitsthenumberofcandidatediseasecausingvariantsinanunbiasedwholegenomeanalysis
_version_ 1714820663488806912