XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm

Understanding the possible mechanisms of gene transcription regulation is a primary challenge for current molecular biologists. Identifying transcription factor binding sites (TFBSs), also called DNA motifs, is an important step in understanding these mechanisms. Furthermore, many human diseases are...

Full description

Bibliographic Details
Main Author: Zhou, Wei
Format: Others
Published: BYU ScholarsArchive 2012
Subjects:
Online Access:https://scholarsarchive.byu.edu/etd/3589
https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=4588&context=etd
id ndltd-BGMYU2-oai-scholarsarchive.byu.edu-etd-4588
record_format oai_dc
spelling ndltd-BGMYU2-oai-scholarsarchive.byu.edu-etd-45882019-05-16T03:19:12Z XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm Zhou, Wei Understanding the possible mechanisms of gene transcription regulation is a primary challenge for current molecular biologists. Identifying transcription factor binding sites (TFBSs), also called DNA motifs, is an important step in understanding these mechanisms. Furthermore, many human diseases are attributed to mutations in TFBSs, which makes identifying those DNA motifs significant for disease treatment. Uncertainty and variations in specific nucleotides of TFBSs present difficulties for DNA motif searching. In this project, we present an algorithm, XPRIME-EM (Eliciting EXpert PRior Information for Motif Exploration using the Expectation-Maximization Algorithm), which can discover known and de novo (unknown) DNA motifs simultaneously from a collection of DNA sequences using a modified EM algorithm and describe the variation nature of DNA motifs using position specific weight matrix (PWM). XPRIME improves the efficiency of locating and describing motifs by prevent the overlap of multiple motifs, a phenomenon termed a phase shift, and generates stronger motifs by considering the correlations between nucleotides at different positions within each motif. Moreover, a Bayesian formulation of the XPRIME algorithm allows for the elicitation of prior information for motifs of interest from literature and experiments into motif searching. We are the first research team to incorporate human genome-wide nucleosome occupancy information into the PWM based DNA motif searching. 2012-06-22T07:00:00Z text application/pdf https://scholarsarchive.byu.edu/etd/3589 https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=4588&context=etd http://lib.byu.edu/about/copyright/ All Theses and Dissertations BYU ScholarsArchive DNA motif modified EM algorithm human nucleosome occupancy information Statistics and Probability
collection NDLTD
format Others
sources NDLTD
topic DNA motif
modified EM algorithm
human nucleosome occupancy information
Statistics and Probability
spellingShingle DNA motif
modified EM algorithm
human nucleosome occupancy information
Statistics and Probability
Zhou, Wei
XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm
description Understanding the possible mechanisms of gene transcription regulation is a primary challenge for current molecular biologists. Identifying transcription factor binding sites (TFBSs), also called DNA motifs, is an important step in understanding these mechanisms. Furthermore, many human diseases are attributed to mutations in TFBSs, which makes identifying those DNA motifs significant for disease treatment. Uncertainty and variations in specific nucleotides of TFBSs present difficulties for DNA motif searching. In this project, we present an algorithm, XPRIME-EM (Eliciting EXpert PRior Information for Motif Exploration using the Expectation-Maximization Algorithm), which can discover known and de novo (unknown) DNA motifs simultaneously from a collection of DNA sequences using a modified EM algorithm and describe the variation nature of DNA motifs using position specific weight matrix (PWM). XPRIME improves the efficiency of locating and describing motifs by prevent the overlap of multiple motifs, a phenomenon termed a phase shift, and generates stronger motifs by considering the correlations between nucleotides at different positions within each motif. Moreover, a Bayesian formulation of the XPRIME algorithm allows for the elicitation of prior information for motifs of interest from literature and experiments into motif searching. We are the first research team to incorporate human genome-wide nucleosome occupancy information into the PWM based DNA motif searching.
author Zhou, Wei
author_facet Zhou, Wei
author_sort Zhou, Wei
title XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm
title_short XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm
title_full XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm
title_fullStr XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm
title_full_unstemmed XPRIME-EM: Eliciting Expert Prior Information for Motif Exploration Using the Expectation-Maximization Algorithm
title_sort xprime-em: eliciting expert prior information for motif exploration using the expectation-maximization algorithm
publisher BYU ScholarsArchive
publishDate 2012
url https://scholarsarchive.byu.edu/etd/3589
https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=4588&context=etd
work_keys_str_mv AT zhouwei xprimeemelicitingexpertpriorinformationformotifexplorationusingtheexpectationmaximizationalgorithm
_version_ 1719185246696505344