Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes

Summary: Although clinical and laboratory data have long been used to guide medical practice, this information is rarely integrated with multi-omic data to identify endotypes. We present Merged Affinity Network Association Clustering (MANAclust), a coding-free, automated pipeline enabling integratio...

Full description

Bibliographic Details
Main Authors: Scott R. Tyler, Yoojin Chun, Victoria M. Ribeiro, Galina Grishina, Alexander Grishin, Gabriel E. Hoffman, Anh N. Do, Supinda Bunyavanich
Format: Article
Language:English
Published: Elsevier 2021-04-01
Series:Cell Reports
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2211124721002898
id doaj-db16dbab761748eb9d8ddf4bb766fea2
record_format Article
spelling doaj-db16dbab761748eb9d8ddf4bb766fea22021-04-16T04:53:25ZengElsevierCell Reports2211-12472021-04-01352108975Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypesScott R. Tyler0Yoojin Chun1Victoria M. Ribeiro2Galina Grishina3Alexander Grishin4Gabriel E. Hoffman5Anh N. Do6Supinda Bunyavanich7Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADepartment of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADepartment of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADivision of Allergy and Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADivision of Allergy and Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADepartment of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADepartment of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USADepartment of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Division of Allergy and Immunology, Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Corresponding authorSummary: Although clinical and laboratory data have long been used to guide medical practice, this information is rarely integrated with multi-omic data to identify endotypes. We present Merged Affinity Network Association Clustering (MANAclust), a coding-free, automated pipeline enabling integration of categorical and numeric data spanning clinical and multi-omic profiles for unsupervised clustering to identify disease subsets. Using simulations and real-world data from The Cancer Genome Atlas, we demonstrate that MANAclust’s feature selection algorithms are accurate and outperform competitors. We also apply MANAclust to a clinically and multi-omically phenotyped asthma cohort. MANAclust identifies clinically and molecularly distinct clusters, including heterogeneous groups of “healthy controls” and viral and allergy-driven subsets of asthmatic subjects. We also find that subjects with similar clinical presentations have disparate molecular profiles, highlighting the need for additional testing to uncover asthma endotypes. This work facilitates data-driven personalized medicine through integration of clinical parameters with multi-omics. MANAclust is freely available at https://bitbucket.org/scottyler892/manaclust/src/master/.http://www.sciencedirect.com/science/article/pii/S2211124721002898bioinformaticssystems biologymulti-omicsclusteringfeature selectiondata integration
collection DOAJ
language English
format Article
sources DOAJ
author Scott R. Tyler
Yoojin Chun
Victoria M. Ribeiro
Galina Grishina
Alexander Grishin
Gabriel E. Hoffman
Anh N. Do
Supinda Bunyavanich
spellingShingle Scott R. Tyler
Yoojin Chun
Victoria M. Ribeiro
Galina Grishina
Alexander Grishin
Gabriel E. Hoffman
Anh N. Do
Supinda Bunyavanich
Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes
Cell Reports
bioinformatics
systems biology
multi-omics
clustering
feature selection
data integration
author_facet Scott R. Tyler
Yoojin Chun
Victoria M. Ribeiro
Galina Grishina
Alexander Grishin
Gabriel E. Hoffman
Anh N. Do
Supinda Bunyavanich
author_sort Scott R. Tyler
title Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes
title_short Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes
title_full Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes
title_fullStr Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes
title_full_unstemmed Merged Affinity Network Association Clustering: Joint multi-omic/clinical clustering to identify disease endotypes
title_sort merged affinity network association clustering: joint multi-omic/clinical clustering to identify disease endotypes
publisher Elsevier
series Cell Reports
issn 2211-1247
publishDate 2021-04-01
description Summary: Although clinical and laboratory data have long been used to guide medical practice, this information is rarely integrated with multi-omic data to identify endotypes. We present Merged Affinity Network Association Clustering (MANAclust), a coding-free, automated pipeline enabling integration of categorical and numeric data spanning clinical and multi-omic profiles for unsupervised clustering to identify disease subsets. Using simulations and real-world data from The Cancer Genome Atlas, we demonstrate that MANAclust’s feature selection algorithms are accurate and outperform competitors. We also apply MANAclust to a clinically and multi-omically phenotyped asthma cohort. MANAclust identifies clinically and molecularly distinct clusters, including heterogeneous groups of “healthy controls” and viral and allergy-driven subsets of asthmatic subjects. We also find that subjects with similar clinical presentations have disparate molecular profiles, highlighting the need for additional testing to uncover asthma endotypes. This work facilitates data-driven personalized medicine through integration of clinical parameters with multi-omics. MANAclust is freely available at https://bitbucket.org/scottyler892/manaclust/src/master/.
topic bioinformatics
systems biology
multi-omics
clustering
feature selection
data integration
url http://www.sciencedirect.com/science/article/pii/S2211124721002898
work_keys_str_mv AT scottrtyler mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT yoojinchun mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT victoriamribeiro mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT galinagrishina mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT alexandergrishin mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT gabrielehoffman mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT anhndo mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
AT supindabunyavanich mergedaffinitynetworkassociationclusteringjointmultiomicclinicalclusteringtoidentifydiseaseendotypes
_version_ 1721525651009175552