An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
Human genetic studies have implicated more than a hundred genes in Autism Spectrum Disorder (ASD). Understanding how variation in implicated genes influence expression of co-occurring conditions and drug response can inform more effective, personalized approaches for treatment of individuals with AS...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-11-01
|
Series: | International Journal of Molecular Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/1422-0067/21/23/9029 |
id |
doaj-75671ff89e8d401da7079693382377e8 |
---|---|
record_format |
Article |
spelling |
doaj-75671ff89e8d401da7079693382377e82020-11-28T00:06:28ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672020-11-01219029902910.3390/ijms21239029An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum DisorderOlivia J. Veatch0Merlin G. Butler1Sarah H. Elsea2Beth A. Malow3James S. Sutcliffe4Jason H. Moore5Department of Psychiatry and Behavioral Sciences, University of Kansas Medical Center, Kansas City, MO 66160, USADepartment of Psychiatry and Behavioral Sciences, University of Kansas Medical Center, Kansas City, MO 66160, USADepartment of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USASleep Disorders Division, Department of Neurology, Vanderbilt University Medical Center, Nashville, TN 37232, USAVanderbilt Genetics Institute, Department of Molecular Physiology & Biophysics, Department of Psychiatry and Behavioral Sciences, Vanderbilt University School of Medicine, Nashville, TN 37232, USADepartment of Biostatistics, Epidemiology, & Informatics, University of Pennsylvania, Philadelphia, PA 19104, USAHuman genetic studies have implicated more than a hundred genes in Autism Spectrum Disorder (ASD). Understanding how variation in implicated genes influence expression of co-occurring conditions and drug response can inform more effective, personalized approaches for treatment of individuals with ASD. Rapidly translating this information into the clinic requires efficient algorithms to sort through the myriad of genes implicated by rare gene-damaging single nucleotide and copy number variants, and common variation detected in genome-wide association studies (GWAS). To pinpoint genes that are more likely to have clinically relevant variants, we developed a functional annotation pipeline. We defined clinical relevance in this project as any ASD associated gene with evidence indicating a patient may have a complex, co-occurring condition that requires direct intervention (e.g., sleep and gastrointestinal disturbances, attention deficit hyperactivity, anxiety, seizures, depression), or is relevant to drug development and/or approaches to maximizing efficacy and minimizing adverse events (i.e., pharmacogenomics). Starting with a list of all candidate genes implicated in all manifestations of ASD (i.e., idiopathic and syndromic), this pipeline uses databases that represent multiple lines of evidence to identify genes: (1) expressed in the human brain, (2) involved in ASD-relevant biological processes and resulting in analogous phenotypes in mice, (3) whose products are targeted by approved pharmaceutical compounds or possessing pharmacogenetic variation and (4) whose products directly interact with those of genes with variants recommended to be tested for by the American College of Medical Genetics (ACMG). Compared with 1000 gene sets, each with a random selection of human protein coding genes, more genes in the ASD set were annotated for each category evaluated (<i>p</i> ≤ 1.99 × 10<sup>−2</sup>). Of the 956 ASD-implicated genes in the full set, 18 were flagged based on evidence in all categories. Fewer genes from randomly drawn sets were annotated in all categories (x = 8.02, sd = 2.56, <i>p</i> = 7.75 × 10<sup>−4</sup>). Notably, none of the prioritized genes are represented among the 59 genes compiled by the ACMG, and 78% had a pathogenic or likely pathogenic variant in ClinVar. Results from this work should rapidly prioritize potentially actionable results from genetic studies and, in turn, inform future work toward clinical decision support for personalized care based on genetic testing.https://www.mdpi.com/1422-0067/21/23/9029bioinformaticshuman geneticspharmacogenomicsautism |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Olivia J. Veatch Merlin G. Butler Sarah H. Elsea Beth A. Malow James S. Sutcliffe Jason H. Moore |
spellingShingle |
Olivia J. Veatch Merlin G. Butler Sarah H. Elsea Beth A. Malow James S. Sutcliffe Jason H. Moore An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder International Journal of Molecular Sciences bioinformatics human genetics pharmacogenomics autism |
author_facet |
Olivia J. Veatch Merlin G. Butler Sarah H. Elsea Beth A. Malow James S. Sutcliffe Jason H. Moore |
author_sort |
Olivia J. Veatch |
title |
An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder |
title_short |
An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder |
title_full |
An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder |
title_fullStr |
An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder |
title_full_unstemmed |
An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder |
title_sort |
automated functional annotation pipeline that rapidly prioritizes clinically relevant genes for autism spectrum disorder |
publisher |
MDPI AG |
series |
International Journal of Molecular Sciences |
issn |
1661-6596 1422-0067 |
publishDate |
2020-11-01 |
description |
Human genetic studies have implicated more than a hundred genes in Autism Spectrum Disorder (ASD). Understanding how variation in implicated genes influence expression of co-occurring conditions and drug response can inform more effective, personalized approaches for treatment of individuals with ASD. Rapidly translating this information into the clinic requires efficient algorithms to sort through the myriad of genes implicated by rare gene-damaging single nucleotide and copy number variants, and common variation detected in genome-wide association studies (GWAS). To pinpoint genes that are more likely to have clinically relevant variants, we developed a functional annotation pipeline. We defined clinical relevance in this project as any ASD associated gene with evidence indicating a patient may have a complex, co-occurring condition that requires direct intervention (e.g., sleep and gastrointestinal disturbances, attention deficit hyperactivity, anxiety, seizures, depression), or is relevant to drug development and/or approaches to maximizing efficacy and minimizing adverse events (i.e., pharmacogenomics). Starting with a list of all candidate genes implicated in all manifestations of ASD (i.e., idiopathic and syndromic), this pipeline uses databases that represent multiple lines of evidence to identify genes: (1) expressed in the human brain, (2) involved in ASD-relevant biological processes and resulting in analogous phenotypes in mice, (3) whose products are targeted by approved pharmaceutical compounds or possessing pharmacogenetic variation and (4) whose products directly interact with those of genes with variants recommended to be tested for by the American College of Medical Genetics (ACMG). Compared with 1000 gene sets, each with a random selection of human protein coding genes, more genes in the ASD set were annotated for each category evaluated (<i>p</i> ≤ 1.99 × 10<sup>−2</sup>). Of the 956 ASD-implicated genes in the full set, 18 were flagged based on evidence in all categories. Fewer genes from randomly drawn sets were annotated in all categories (x = 8.02, sd = 2.56, <i>p</i> = 7.75 × 10<sup>−4</sup>). Notably, none of the prioritized genes are represented among the 59 genes compiled by the ACMG, and 78% had a pathogenic or likely pathogenic variant in ClinVar. Results from this work should rapidly prioritize potentially actionable results from genetic studies and, in turn, inform future work toward clinical decision support for personalized care based on genetic testing. |
topic |
bioinformatics human genetics pharmacogenomics autism |
url |
https://www.mdpi.com/1422-0067/21/23/9029 |
work_keys_str_mv |
AT oliviajveatch anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT merlingbutler anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT sarahhelsea anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT bethamalow anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT jamesssutcliffe anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT jasonhmoore anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT oliviajveatch automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT merlingbutler automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT sarahhelsea automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT bethamalow automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT jamesssutcliffe automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder AT jasonhmoore automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder |
_version_ |
1724413039728394240 |