Content-based microarray search using differential expression profiles

<p>Abstract</p> <p>Background</p> <p>With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene...

Full description

Bibliographic Details
Main Authors: Thathoo Rahul, Chen Rong, Dudley Joel T, Morgan Alexander A, Engreitz Jesse M, Altman Russ B, Butte Atul J
Format: Article
Language:English
Published: BMC 2010-12-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/603
id doaj-e7fa10d0227b4032b00799b6e083e2dc
record_format Article
spelling doaj-e7fa10d0227b4032b00799b6e083e2dc2020-11-25T00:52:16ZengBMCBMC Bioinformatics1471-21052010-12-0111160310.1186/1471-2105-11-603Content-based microarray search using differential expression profilesThathoo RahulChen RongDudley Joel TMorgan Alexander AEngreitz Jesse MAltman Russ BButte Atul J<p>Abstract</p> <p>Background</p> <p>With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations.</p> <p>Results</p> <p>We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and <it>p</it>-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3.</p> <p>Conclusions</p> <p>Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.</p> http://www.biomedcentral.com/1471-2105/11/603
collection DOAJ
language English
format Article
sources DOAJ
author Thathoo Rahul
Chen Rong
Dudley Joel T
Morgan Alexander A
Engreitz Jesse M
Altman Russ B
Butte Atul J
spellingShingle Thathoo Rahul
Chen Rong
Dudley Joel T
Morgan Alexander A
Engreitz Jesse M
Altman Russ B
Butte Atul J
Content-based microarray search using differential expression profiles
BMC Bioinformatics
author_facet Thathoo Rahul
Chen Rong
Dudley Joel T
Morgan Alexander A
Engreitz Jesse M
Altman Russ B
Butte Atul J
author_sort Thathoo Rahul
title Content-based microarray search using differential expression profiles
title_short Content-based microarray search using differential expression profiles
title_full Content-based microarray search using differential expression profiles
title_fullStr Content-based microarray search using differential expression profiles
title_full_unstemmed Content-based microarray search using differential expression profiles
title_sort content-based microarray search using differential expression profiles
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2010-12-01
description <p>Abstract</p> <p>Background</p> <p>With the expansion of public repositories such as the Gene Expression Omnibus (GEO), we are rapidly cataloging cellular transcriptional responses to diverse experimental conditions. Methods that query these repositories based on gene expression content, rather than textual annotations, may enable more effective experiment retrieval as well as the discovery of novel associations between drugs, diseases, and other perturbations.</p> <p>Results</p> <p>We develop methods to retrieve gene expression experiments that differentially express the same transcriptional programs as a query experiment. Avoiding thresholds, we generate differential expression profiles that include a score for each gene measured in an experiment. We use existing and novel dimension reduction and correlation measures to rank relevant experiments in an entirely data-driven manner, allowing emergent features of the data to drive the results. A combination of matrix decomposition and <it>p</it>-weighted Pearson correlation proves the most suitable for comparing differential expression profiles. We apply this method to index all GEO DataSets, and demonstrate the utility of our approach by identifying pathways and conditions relevant to transcription factors Nanog and FoxO3.</p> <p>Conclusions</p> <p>Content-based gene expression search generates relevant hypotheses for biological inquiry. Experiments across platforms, tissue types, and protocols inform the analysis of new datasets.</p>
url http://www.biomedcentral.com/1471-2105/11/603
work_keys_str_mv AT thathoorahul contentbasedmicroarraysearchusingdifferentialexpressionprofiles
AT chenrong contentbasedmicroarraysearchusingdifferentialexpressionprofiles
AT dudleyjoelt contentbasedmicroarraysearchusingdifferentialexpressionprofiles
AT morganalexandera contentbasedmicroarraysearchusingdifferentialexpressionprofiles
AT engreitzjessem contentbasedmicroarraysearchusingdifferentialexpressionprofiles
AT altmanrussb contentbasedmicroarraysearchusingdifferentialexpressionprofiles
AT butteatulj contentbasedmicroarraysearchusingdifferentialexpressionprofiles
_version_ 1725243223380590592