<it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists

<p>Abstract</p> <p>Background</p> <p>Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently availa...

Full description

Bibliographic Details
Main Authors: Steinfeld Israel, Navon Roy, Eden Eran, Lipson Doron, Yakhini Zohar
Format: Article
Language:English
Published: BMC 2009-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/10/48
id doaj-6fbf3642915b41c2a1cd65f3734dbd96
record_format Article
spelling doaj-6fbf3642915b41c2a1cd65f3734dbd962020-11-24T22:21:49ZengBMCBMC Bioinformatics1471-21052009-02-011014810.1186/1471-2105-10-48<it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene listsSteinfeld IsraelNavon RoyEden EranLipson DoronYakhini Zohar<p>Abstract</p> <p>Background</p> <p>Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results.</p> <p>Results</p> <p><it>GOrilla </it>is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). <it>GOrilla </it>employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the <it>top </it>of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, <it>GOrilla </it>computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms.</p> <p>Conclusion</p> <p><it>GOrilla </it>is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. <it>GOrilla</it>'s unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. <it>GOrilla </it>is publicly available at: <url>http://cbl-gorilla.cs.technion.ac.il</url></p> http://www.biomedcentral.com/1471-2105/10/48
collection DOAJ
language English
format Article
sources DOAJ
author Steinfeld Israel
Navon Roy
Eden Eran
Lipson Doron
Yakhini Zohar
spellingShingle Steinfeld Israel
Navon Roy
Eden Eran
Lipson Doron
Yakhini Zohar
<it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists
BMC Bioinformatics
author_facet Steinfeld Israel
Navon Roy
Eden Eran
Lipson Doron
Yakhini Zohar
author_sort Steinfeld Israel
title <it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists
title_short <it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists
title_full <it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists
title_fullStr <it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists
title_full_unstemmed <it>GOrilla</it>: a tool for discovery and visualization of enriched GO terms in ranked gene lists
title_sort <it>gorilla</it>: a tool for discovery and visualization of enriched go terms in ranked gene lists
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2009-02-01
description <p>Abstract</p> <p>Background</p> <p>Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results.</p> <p>Results</p> <p><it>GOrilla </it>is a web-based application that identifies enriched GO terms in ranked lists of genes, without requiring the user to provide explicit target and background sets. This is particularly useful in many typical cases where genomic data may be naturally represented as a ranked list of genes (e.g. by level of expression or of differential expression). <it>GOrilla </it>employs a flexible threshold statistical approach to discover GO terms that are significantly enriched at the <it>top </it>of a ranked gene list. Building on a complete theoretical characterization of the underlying distribution, called mHG, <it>GOrilla </it>computes an exact p-value for the observed enrichment, taking threshold multiple testing into account without the need for simulations. This enables rigorous statistical analysis of thousand of genes and thousands of GO terms in order of seconds. The output of the enrichment analysis is visualized as a hierarchical structure, providing a clear view of the relations between enriched GO terms.</p> <p>Conclusion</p> <p><it>GOrilla </it>is an efficient GO analysis tool with unique features that make a useful addition to the existing repertoire of GO enrichment tools. <it>GOrilla</it>'s unique features and advantages over other threshold free enrichment tools include rigorous statistics, fast running time and an effective graphical representation. <it>GOrilla </it>is publicly available at: <url>http://cbl-gorilla.cs.technion.ac.il</url></p>
url http://www.biomedcentral.com/1471-2105/10/48
work_keys_str_mv AT steinfeldisrael itgorillaitatoolfordiscoveryandvisualizationofenrichedgotermsinrankedgenelists
AT navonroy itgorillaitatoolfordiscoveryandvisualizationofenrichedgotermsinrankedgenelists
AT edeneran itgorillaitatoolfordiscoveryandvisualizationofenrichedgotermsinrankedgenelists
AT lipsondoron itgorillaitatoolfordiscoveryandvisualizationofenrichedgotermsinrankedgenelists
AT yakhinizohar itgorillaitatoolfordiscoveryandvisualizationofenrichedgotermsinrankedgenelists
_version_ 1725769663043862528