An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder

Human genetic studies have implicated more than a hundred genes in Autism Spectrum Disorder (ASD). Understanding how variation in implicated genes influence expression of co-occurring conditions and drug response can inform more effective, personalized approaches for treatment of individuals with AS...

Full description

Bibliographic Details
Main Authors: Olivia J. Veatch, Merlin G. Butler, Sarah H. Elsea, Beth A. Malow, James S. Sutcliffe, Jason H. Moore
Format: Article
Language:English
Published: MDPI AG 2020-11-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:https://www.mdpi.com/1422-0067/21/23/9029
id doaj-75671ff89e8d401da7079693382377e8
record_format Article
spelling doaj-75671ff89e8d401da7079693382377e82020-11-28T00:06:28ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672020-11-01219029902910.3390/ijms21239029An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum DisorderOlivia J. Veatch0Merlin G. Butler1Sarah H. Elsea2Beth A. Malow3James S. Sutcliffe4Jason H. Moore5Department of Psychiatry and Behavioral Sciences, University of Kansas Medical Center, Kansas City, MO 66160, USADepartment of Psychiatry and Behavioral Sciences, University of Kansas Medical Center, Kansas City, MO 66160, USADepartment of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USASleep Disorders Division, Department of Neurology, Vanderbilt University Medical Center, Nashville, TN 37232, USAVanderbilt Genetics Institute, Department of Molecular Physiology & Biophysics, Department of Psychiatry and Behavioral Sciences, Vanderbilt University School of Medicine, Nashville, TN 37232, USADepartment of Biostatistics, Epidemiology, & Informatics, University of Pennsylvania, Philadelphia, PA 19104, USAHuman genetic studies have implicated more than a hundred genes in Autism Spectrum Disorder (ASD). Understanding how variation in implicated genes influence expression of co-occurring conditions and drug response can inform more effective, personalized approaches for treatment of individuals with ASD. Rapidly translating this information into the clinic requires efficient algorithms to sort through the myriad of genes implicated by rare gene-damaging single nucleotide and copy number variants, and common variation detected in genome-wide association studies (GWAS). To pinpoint genes that are more likely to have clinically relevant variants, we developed a functional annotation pipeline. We defined clinical relevance in this project as any ASD associated gene with evidence indicating a patient may have a complex, co-occurring condition that requires direct intervention (e.g., sleep and gastrointestinal disturbances, attention deficit hyperactivity, anxiety, seizures, depression), or is relevant to drug development and/or approaches to maximizing efficacy and minimizing adverse events (i.e., pharmacogenomics). Starting with a list of all candidate genes implicated in all manifestations of ASD (i.e., idiopathic and syndromic), this pipeline uses databases that represent multiple lines of evidence to identify genes: (1) expressed in the human brain, (2) involved in ASD-relevant biological processes and resulting in analogous phenotypes in mice, (3) whose products are targeted by approved pharmaceutical compounds or possessing pharmacogenetic variation and (4) whose products directly interact with those of genes with variants recommended to be tested for by the American College of Medical Genetics (ACMG). Compared with 1000 gene sets, each with a random selection of human protein coding genes, more genes in the ASD set were annotated for each category evaluated (<i>p</i> ≤ 1.99 × 10<sup>−2</sup>). Of the 956 ASD-implicated genes in the full set, 18 were flagged based on evidence in all categories. Fewer genes from randomly drawn sets were annotated in all categories (x = 8.02, sd = 2.56, <i>p</i> = 7.75 × 10<sup>−4</sup>). Notably, none of the prioritized genes are represented among the 59 genes compiled by the ACMG, and 78% had a pathogenic or likely pathogenic variant in ClinVar. Results from this work should rapidly prioritize potentially actionable results from genetic studies and, in turn, inform future work toward clinical decision support for personalized care based on genetic testing.https://www.mdpi.com/1422-0067/21/23/9029bioinformaticshuman geneticspharmacogenomicsautism
collection DOAJ
language English
format Article
sources DOAJ
author Olivia J. Veatch
Merlin G. Butler
Sarah H. Elsea
Beth A. Malow
James S. Sutcliffe
Jason H. Moore
spellingShingle Olivia J. Veatch
Merlin G. Butler
Sarah H. Elsea
Beth A. Malow
James S. Sutcliffe
Jason H. Moore
An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
International Journal of Molecular Sciences
bioinformatics
human genetics
pharmacogenomics
autism
author_facet Olivia J. Veatch
Merlin G. Butler
Sarah H. Elsea
Beth A. Malow
James S. Sutcliffe
Jason H. Moore
author_sort Olivia J. Veatch
title An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
title_short An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
title_full An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
title_fullStr An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
title_full_unstemmed An Automated Functional Annotation Pipeline That Rapidly Prioritizes Clinically Relevant Genes for Autism Spectrum Disorder
title_sort automated functional annotation pipeline that rapidly prioritizes clinically relevant genes for autism spectrum disorder
publisher MDPI AG
series International Journal of Molecular Sciences
issn 1661-6596
1422-0067
publishDate 2020-11-01
description Human genetic studies have implicated more than a hundred genes in Autism Spectrum Disorder (ASD). Understanding how variation in implicated genes influence expression of co-occurring conditions and drug response can inform more effective, personalized approaches for treatment of individuals with ASD. Rapidly translating this information into the clinic requires efficient algorithms to sort through the myriad of genes implicated by rare gene-damaging single nucleotide and copy number variants, and common variation detected in genome-wide association studies (GWAS). To pinpoint genes that are more likely to have clinically relevant variants, we developed a functional annotation pipeline. We defined clinical relevance in this project as any ASD associated gene with evidence indicating a patient may have a complex, co-occurring condition that requires direct intervention (e.g., sleep and gastrointestinal disturbances, attention deficit hyperactivity, anxiety, seizures, depression), or is relevant to drug development and/or approaches to maximizing efficacy and minimizing adverse events (i.e., pharmacogenomics). Starting with a list of all candidate genes implicated in all manifestations of ASD (i.e., idiopathic and syndromic), this pipeline uses databases that represent multiple lines of evidence to identify genes: (1) expressed in the human brain, (2) involved in ASD-relevant biological processes and resulting in analogous phenotypes in mice, (3) whose products are targeted by approved pharmaceutical compounds or possessing pharmacogenetic variation and (4) whose products directly interact with those of genes with variants recommended to be tested for by the American College of Medical Genetics (ACMG). Compared with 1000 gene sets, each with a random selection of human protein coding genes, more genes in the ASD set were annotated for each category evaluated (<i>p</i> ≤ 1.99 × 10<sup>−2</sup>). Of the 956 ASD-implicated genes in the full set, 18 were flagged based on evidence in all categories. Fewer genes from randomly drawn sets were annotated in all categories (x = 8.02, sd = 2.56, <i>p</i> = 7.75 × 10<sup>−4</sup>). Notably, none of the prioritized genes are represented among the 59 genes compiled by the ACMG, and 78% had a pathogenic or likely pathogenic variant in ClinVar. Results from this work should rapidly prioritize potentially actionable results from genetic studies and, in turn, inform future work toward clinical decision support for personalized care based on genetic testing.
topic bioinformatics
human genetics
pharmacogenomics
autism
url https://www.mdpi.com/1422-0067/21/23/9029
work_keys_str_mv AT oliviajveatch anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT merlingbutler anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT sarahhelsea anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT bethamalow anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT jamesssutcliffe anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT jasonhmoore anautomatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT oliviajveatch automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT merlingbutler automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT sarahhelsea automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT bethamalow automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT jamesssutcliffe automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
AT jasonhmoore automatedfunctionalannotationpipelinethatrapidlyprioritizesclinicallyrelevantgenesforautismspectrumdisorder
_version_ 1724413039728394240