A combinatorial optimization approach for diverse motif finding applications

<p>Abstract</p> <p>Background</p> <p>Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared...

Full description

Bibliographic Details
Main Authors: Singh Mona, Zaslavsky Elena
Format: Article
Language:English
Published: BMC 2006-08-01
Series:Algorithms for Molecular Biology
Online Access:http://www.almob.org/content/1/1/13
id doaj-ab402160149d4399ab4ead6b6d98e376
record_format Article
spelling doaj-ab402160149d4399ab4ead6b6d98e3762020-11-24T23:01:48ZengBMCAlgorithms for Molecular Biology1748-71882006-08-01111310.1186/1748-7188-1-13A combinatorial optimization approach for diverse motif finding applicationsSingh MonaZaslavsky Elena<p>Abstract</p> <p>Background</p> <p>Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied.</p> <p>Results</p> <p>We introduce a versatile combinatorial optimization framework for motif finding that couples graph pruning techniques with a novel integer linear programming formulation. Our approach is flexible and robust enough to model several variants of the motif finding problem, including those incorporating substitution matrices and phylogenetic distances. Additionally, we give an approach for determining statistical significance of uncovered motifs. In testing on numerous DNA and protein datasets, we demonstrate that our approach typically identifies statistically significant motifs corresponding to either known motifs or other motifs of high conservation. Moreover, in most cases, our approach finds provably optimal solutions to the underlying optimization problem.</p> <p>Conclusion</p> <p>Our results demonstrate that a combined graph theoretic and mathematical programming approach can be the basis for effective and powerful techniques for diverse motif finding applications.</p> http://www.almob.org/content/1/1/13
collection DOAJ
language English
format Article
sources DOAJ
author Singh Mona
Zaslavsky Elena
spellingShingle Singh Mona
Zaslavsky Elena
A combinatorial optimization approach for diverse motif finding applications
Algorithms for Molecular Biology
author_facet Singh Mona
Zaslavsky Elena
author_sort Singh Mona
title A combinatorial optimization approach for diverse motif finding applications
title_short A combinatorial optimization approach for diverse motif finding applications
title_full A combinatorial optimization approach for diverse motif finding applications
title_fullStr A combinatorial optimization approach for diverse motif finding applications
title_full_unstemmed A combinatorial optimization approach for diverse motif finding applications
title_sort combinatorial optimization approach for diverse motif finding applications
publisher BMC
series Algorithms for Molecular Biology
issn 1748-7188
publishDate 2006-08-01
description <p>Abstract</p> <p>Background</p> <p>Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied.</p> <p>Results</p> <p>We introduce a versatile combinatorial optimization framework for motif finding that couples graph pruning techniques with a novel integer linear programming formulation. Our approach is flexible and robust enough to model several variants of the motif finding problem, including those incorporating substitution matrices and phylogenetic distances. Additionally, we give an approach for determining statistical significance of uncovered motifs. In testing on numerous DNA and protein datasets, we demonstrate that our approach typically identifies statistically significant motifs corresponding to either known motifs or other motifs of high conservation. Moreover, in most cases, our approach finds provably optimal solutions to the underlying optimization problem.</p> <p>Conclusion</p> <p>Our results demonstrate that a combined graph theoretic and mathematical programming approach can be the basis for effective and powerful techniques for diverse motif finding applications.</p>
url http://www.almob.org/content/1/1/13
work_keys_str_mv AT singhmona acombinatorialoptimizationapproachfordiversemotiffindingapplications
AT zaslavskyelena acombinatorialoptimizationapproachfordiversemotiffindingapplications
AT singhmona combinatorialoptimizationapproachfordiversemotiffindingapplications
AT zaslavskyelena combinatorialoptimizationapproachfordiversemotiffindingapplications
_version_ 1725638738829115392