A Peak-Finder Meta Server for ChIP-Seq Analysis

Chromatin immunoprecipitation (ChIP) coupled with ultra high-throughput parallel sequencing (ChIP-seq) is widely used to study transcriptional regulation on a genome wide scale. Numerous computational tools have been developed to identify transcription factor (protein) binding sites from large ChIP-...

Full description

Bibliographic Details
Main Author: Umer, Husen
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för informationsteknologi 2011
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-156436
id ndltd-UPSALLA1-oai-DiVA.org-uu-156436
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-1564362013-01-08T13:50:22ZA Peak-Finder Meta Server for ChIP-Seq AnalysisengUmer, HusenUppsala universitet, Institutionen för informationsteknologi2011Chromatin immunoprecipitation (ChIP) coupled with ultra high-throughput parallel sequencing (ChIP-seq) is widely used to study transcriptional regulation on a genome wide scale. Numerous computational tools have been developed to identify transcription factor (protein) binding sites from large ChIP-seq datasets. The diversity of the datasets and the algorithm dependencies make it hard to get a satisfactory result. Many studies have compared the performance and accuracy of the algorithms using empirical datasets. It is shown that selecting the best algorithm to analyze a ChIP-seq dataset for detecting binding sites of a specific transcription factor depends on the dataset conditions. A systematic solution to compare the results of multiple algorithms to produce the best putative binding sites is still lacking. In this thesis project a new software package was introduced to provide a single interface for several state-of-the-art algorithms. A voting mechanism and a scoring mechanism were implemented to identify a set of the best predicted transcription factor binding sites (peaks) by normalizing and comparing the predicted peaks of the selected algorithms. The methods were applied on some publicly available datasets and the results were validated by comparing them to the results of the selected algorithms and their corresponding binding motifs. The discovered motifs showed a very high similarity to the consensus motifs of the selected transcription factors. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-156436IT ; 11 039application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description Chromatin immunoprecipitation (ChIP) coupled with ultra high-throughput parallel sequencing (ChIP-seq) is widely used to study transcriptional regulation on a genome wide scale. Numerous computational tools have been developed to identify transcription factor (protein) binding sites from large ChIP-seq datasets. The diversity of the datasets and the algorithm dependencies make it hard to get a satisfactory result. Many studies have compared the performance and accuracy of the algorithms using empirical datasets. It is shown that selecting the best algorithm to analyze a ChIP-seq dataset for detecting binding sites of a specific transcription factor depends on the dataset conditions. A systematic solution to compare the results of multiple algorithms to produce the best putative binding sites is still lacking. In this thesis project a new software package was introduced to provide a single interface for several state-of-the-art algorithms. A voting mechanism and a scoring mechanism were implemented to identify a set of the best predicted transcription factor binding sites (peaks) by normalizing and comparing the predicted peaks of the selected algorithms. The methods were applied on some publicly available datasets and the results were validated by comparing them to the results of the selected algorithms and their corresponding binding motifs. The discovered motifs showed a very high similarity to the consensus motifs of the selected transcription factors.
author Umer, Husen
spellingShingle Umer, Husen
A Peak-Finder Meta Server for ChIP-Seq Analysis
author_facet Umer, Husen
author_sort Umer, Husen
title A Peak-Finder Meta Server for ChIP-Seq Analysis
title_short A Peak-Finder Meta Server for ChIP-Seq Analysis
title_full A Peak-Finder Meta Server for ChIP-Seq Analysis
title_fullStr A Peak-Finder Meta Server for ChIP-Seq Analysis
title_full_unstemmed A Peak-Finder Meta Server for ChIP-Seq Analysis
title_sort peak-finder meta server for chip-seq analysis
publisher Uppsala universitet, Institutionen för informationsteknologi
publishDate 2011
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-156436
work_keys_str_mv AT umerhusen apeakfindermetaserverforchipseqanalysis
AT umerhusen peakfindermetaserverforchipseqanalysis
_version_ 1716530527253561344