Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes

Abstract Background In this study, the prevalence of different types of mucopolysaccharidoses (MPS) was estimated based on data from the exome aggregation consortium (ExAC) and the genome aggregation database (gnomAD). The population-based allele frequencies were used to identify potential disease-c...

Full description

Bibliographic Details
Main Authors: Pâmella Borges, Gabriela Pasqualim, Roberto Giugliani, Filippo Vairo, Ursula Matte
Format: Article
Language:English
Published: BMC 2020-11-01
Series:Orphanet Journal of Rare Diseases
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13023-020-01608-0
id doaj-1c12f9b1c4f54d9d98be953a87b012c7
record_format Article
spelling doaj-1c12f9b1c4f54d9d98be953a87b012c72020-11-25T04:03:17ZengBMCOrphanet Journal of Rare Diseases1750-11722020-11-011511910.1186/s13023-020-01608-0Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomesPâmella Borges0Gabriela Pasqualim1Roberto Giugliani2Filippo Vairo3Ursula Matte4Cell, Tissue and Gene Laboratory, Clinicas Hospital of Porto AlegreGenetics Laboratory, Biological Sciences Institute, Federal University of Rio Grande (FURG)Graduate Programme in Genetics and Molecular Biology, Federal University of Rio Grande Do Sul (UFRGS)Center for Individualized Medicine, Mayo ClinicCell, Tissue and Gene Laboratory, Clinicas Hospital of Porto AlegreAbstract Background In this study, the prevalence of different types of mucopolysaccharidoses (MPS) was estimated based on data from the exome aggregation consortium (ExAC) and the genome aggregation database (gnomAD). The population-based allele frequencies were used to identify potential disease-causing variants on each gene related to MPS I to IX (except MPS II). Methods We evaluated the canonical transcripts and excluded homozygous, intronic, 3′, and 5′ UTR variants. Frameshift and in-frame insertions and deletions were evaluated using the SIFT Indel tool. Splice variants were evaluated using SpliceAI and Human Splice Finder 3.0 (HSF). Loss-of-function single nucleotide variants in coding regions were classified as potentially pathogenic, while synonymous variants outside the exon–intron boundaries were deemed non-pathogenic. Missense variants were evaluated by five in silico prediction tools, and only those predicted to be damaging by at least three different algorithms were considered disease-causing. Results The combined frequencies of selected variants (ranged from 127 in GNS to 259 in IDUA) were used to calculate prevalence based on Hardy–Weinberg's equilibrium. The maximum estimated prevalence ranged from 0.46 per 100,000 for MPSIIID to 7.1 per 100,000 for MPS I. Overall, the estimated prevalence of all types of MPS was higher than what has been published in the literature. This difference may be due to misdiagnoses and/or underdiagnoses, especially of the attenuated forms of MPS. However, overestimation of the number of disease-causing variants by in silico predictors cannot be ruled out. Even so, the disease prevalences are similar to those reported in diagnosis-based prevalence studies. Conclusion We report on an approach to estimate the prevalence of different types of MPS based on publicly available population-based genomic data, which may help health systems to be better prepared to deal with these conditions and provide support to initiatives on diagnosis and management of MPS.http://link.springer.com/article/10.1186/s13023-020-01608-0Mucopolysaccharidoses (MPS)Estimated prevalenceExome aggregation consortium (ExAC)Genome aggregation database (gnomAD)In silico analysis
collection DOAJ
language English
format Article
sources DOAJ
author Pâmella Borges
Gabriela Pasqualim
Roberto Giugliani
Filippo Vairo
Ursula Matte
spellingShingle Pâmella Borges
Gabriela Pasqualim
Roberto Giugliani
Filippo Vairo
Ursula Matte
Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
Orphanet Journal of Rare Diseases
Mucopolysaccharidoses (MPS)
Estimated prevalence
Exome aggregation consortium (ExAC)
Genome aggregation database (gnomAD)
In silico analysis
author_facet Pâmella Borges
Gabriela Pasqualim
Roberto Giugliani
Filippo Vairo
Ursula Matte
author_sort Pâmella Borges
title Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
title_short Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
title_full Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
title_fullStr Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
title_full_unstemmed Estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
title_sort estimated prevalence of mucopolysaccharidoses from population-based exomes and genomes
publisher BMC
series Orphanet Journal of Rare Diseases
issn 1750-1172
publishDate 2020-11-01
description Abstract Background In this study, the prevalence of different types of mucopolysaccharidoses (MPS) was estimated based on data from the exome aggregation consortium (ExAC) and the genome aggregation database (gnomAD). The population-based allele frequencies were used to identify potential disease-causing variants on each gene related to MPS I to IX (except MPS II). Methods We evaluated the canonical transcripts and excluded homozygous, intronic, 3′, and 5′ UTR variants. Frameshift and in-frame insertions and deletions were evaluated using the SIFT Indel tool. Splice variants were evaluated using SpliceAI and Human Splice Finder 3.0 (HSF). Loss-of-function single nucleotide variants in coding regions were classified as potentially pathogenic, while synonymous variants outside the exon–intron boundaries were deemed non-pathogenic. Missense variants were evaluated by five in silico prediction tools, and only those predicted to be damaging by at least three different algorithms were considered disease-causing. Results The combined frequencies of selected variants (ranged from 127 in GNS to 259 in IDUA) were used to calculate prevalence based on Hardy–Weinberg's equilibrium. The maximum estimated prevalence ranged from 0.46 per 100,000 for MPSIIID to 7.1 per 100,000 for MPS I. Overall, the estimated prevalence of all types of MPS was higher than what has been published in the literature. This difference may be due to misdiagnoses and/or underdiagnoses, especially of the attenuated forms of MPS. However, overestimation of the number of disease-causing variants by in silico predictors cannot be ruled out. Even so, the disease prevalences are similar to those reported in diagnosis-based prevalence studies. Conclusion We report on an approach to estimate the prevalence of different types of MPS based on publicly available population-based genomic data, which may help health systems to be better prepared to deal with these conditions and provide support to initiatives on diagnosis and management of MPS.
topic Mucopolysaccharidoses (MPS)
Estimated prevalence
Exome aggregation consortium (ExAC)
Genome aggregation database (gnomAD)
In silico analysis
url http://link.springer.com/article/10.1186/s13023-020-01608-0
work_keys_str_mv AT pamellaborges estimatedprevalenceofmucopolysaccharidosesfrompopulationbasedexomesandgenomes
AT gabrielapasqualim estimatedprevalenceofmucopolysaccharidosesfrompopulationbasedexomesandgenomes
AT robertogiugliani estimatedprevalenceofmucopolysaccharidosesfrompopulationbasedexomesandgenomes
AT filippovairo estimatedprevalenceofmucopolysaccharidosesfrompopulationbasedexomesandgenomes
AT ursulamatte estimatedprevalenceofmucopolysaccharidosesfrompopulationbasedexomesandgenomes
_version_ 1724440853294874624