Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks

Abstract Background Genome browsers are widely used for locating interesting genomic regions, but their interactive use is obviously limited to inspecting short genomic portions. An ideal interaction is to provide patterns of regions on the browser, and then extract other genomic regions over the wh...

Full description

Bibliographic Details
Main Authors: Arnaud Ceol, Piero Montanari, Ilaria Bartolini, Stefano Ceri, Paolo Ciaccia, Marco Patella, Marco Masseroli
Format: Article
Language:English
Published: BMC 2020-10-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-020-03781-2
id doaj-48274b9c751a49a188f1d8c441b3551f
record_format Article
spelling doaj-48274b9c751a49a188f1d8c441b3551f2020-11-25T03:56:18ZengBMCBMC Bioinformatics1471-21052020-10-0121111310.1186/s12859-020-03781-2Search and comparison of (epi)genomic feature patterns in multiple genome browser tracksArnaud Ceol0Piero Montanari1Ilaria Bartolini2Stefano Ceri3Paolo Ciaccia4Marco Patella5Marco Masseroli6Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT)DISI, Università di BolognaDISI, Università di BolognaDipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di MilanoDISI, Università di BolognaDISI, Università di BolognaDipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di MilanoAbstract Background Genome browsers are widely used for locating interesting genomic regions, but their interactive use is obviously limited to inspecting short genomic portions. An ideal interaction is to provide patterns of regions on the browser, and then extract other genomic regions over the whole genome where such patterns occur, ranked by similarity. Results We developed SimSearch, an optimized pattern-search method and an open source plugin for the Integrated Genome Browser (IGB), to find genomic region sets that are similar to a given region pattern. It provides efficient visual genome-wide analytics computation in large datasets; the plugin supports intuitive user interactions for selecting an interesting pattern on IGB tracks and visualizing the computed occurrences of similar patterns along the entire genome. SimSearch also includes functions for the annotation and enrichment of results, and is enhanced with a Quickload repository including numerous epigenomic feature datasets from ENCODE and Roadmap Epigenomics. The paper also includes some use cases to show multiple genome-wide analyses of biological interest, which can be easily performed by taking advantage of the presented approach. Conclusions The novel SimSearch method provides innovative support for effective genome-wide pattern search and visualization; its relevance and practical usefulness is demonstrated through a number of significant use cases of biological interest. The SimSearch IGB plugin, documentation, and code are freely available at https://deib-geco.github.io/simsearch-app/ and https://github.com/DEIB-GECO/simsearch-app/ .http://link.springer.com/article/10.1186/s12859-020-03781-2Computational genomicsGenome-wide pattern-searchVisual analyticsGenome browser
collection DOAJ
language English
format Article
sources DOAJ
author Arnaud Ceol
Piero Montanari
Ilaria Bartolini
Stefano Ceri
Paolo Ciaccia
Marco Patella
Marco Masseroli
spellingShingle Arnaud Ceol
Piero Montanari
Ilaria Bartolini
Stefano Ceri
Paolo Ciaccia
Marco Patella
Marco Masseroli
Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
BMC Bioinformatics
Computational genomics
Genome-wide pattern-search
Visual analytics
Genome browser
author_facet Arnaud Ceol
Piero Montanari
Ilaria Bartolini
Stefano Ceri
Paolo Ciaccia
Marco Patella
Marco Masseroli
author_sort Arnaud Ceol
title Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
title_short Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
title_full Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
title_fullStr Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
title_full_unstemmed Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
title_sort search and comparison of (epi)genomic feature patterns in multiple genome browser tracks
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2020-10-01
description Abstract Background Genome browsers are widely used for locating interesting genomic regions, but their interactive use is obviously limited to inspecting short genomic portions. An ideal interaction is to provide patterns of regions on the browser, and then extract other genomic regions over the whole genome where such patterns occur, ranked by similarity. Results We developed SimSearch, an optimized pattern-search method and an open source plugin for the Integrated Genome Browser (IGB), to find genomic region sets that are similar to a given region pattern. It provides efficient visual genome-wide analytics computation in large datasets; the plugin supports intuitive user interactions for selecting an interesting pattern on IGB tracks and visualizing the computed occurrences of similar patterns along the entire genome. SimSearch also includes functions for the annotation and enrichment of results, and is enhanced with a Quickload repository including numerous epigenomic feature datasets from ENCODE and Roadmap Epigenomics. The paper also includes some use cases to show multiple genome-wide analyses of biological interest, which can be easily performed by taking advantage of the presented approach. Conclusions The novel SimSearch method provides innovative support for effective genome-wide pattern search and visualization; its relevance and practical usefulness is demonstrated through a number of significant use cases of biological interest. The SimSearch IGB plugin, documentation, and code are freely available at https://deib-geco.github.io/simsearch-app/ and https://github.com/DEIB-GECO/simsearch-app/ .
topic Computational genomics
Genome-wide pattern-search
Visual analytics
Genome browser
url http://link.springer.com/article/10.1186/s12859-020-03781-2
work_keys_str_mv AT arnaudceol searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
AT pieromontanari searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
AT ilariabartolini searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
AT stefanoceri searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
AT paolociaccia searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
AT marcopatella searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
AT marcomasseroli searchandcomparisonofepigenomicfeaturepatternsinmultiplegenomebrowsertracks
_version_ 1724465745786568704