Alignment and analysis of noncoding DNA sequences in Drosophila

In this project, a fast accurate global pairwise alignment of noncoding DNA sequences, MCALIGN2, is developed based on explicit models of indel evolution. A pair-hidden-Markov-Model (pair-HMM) of seven states and a golden-Section-Search algorithm are employed in this method to search for the most pr...

Full description

Bibliographic Details
Main Author: Wang, Jun
Published: University of Edinburgh 2010
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.663450
id ndltd-bl.uk-oai-ethos.bl.uk-663450
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-6634502016-02-03T03:17:06ZAlignment and analysis of noncoding DNA sequences in DrosophilaWang, Jun2010In this project, a fast accurate global pairwise alignment of noncoding DNA sequences, MCALIGN2, is developed based on explicit models of indel evolution. A pair-hidden-Markov-Model (pair-HMM) of seven states and a golden-Section-Search algorithm are employed in this method to search for the most probable alignment between two homologous sequences. This method is then used to align and analyze noncoding DNA sequences in <i>Drosophila</i>. Comparative genomic analysis in this project shows that INE-1 elements, one of the most abundant TEs in <i>Drosophila</i>, along with sites within short introns and fourfold degenerate sites are the fastest evolving nucleotides in the genomes of <i>Drosophila</i> <i>melanogaster</i>, <i>D. simulans</i> and <i>D. sechellia</i>. Fourfold sites tend to be evolving (relatively) slightly more slowly than the other two classes of nucleotides, probably due to selection acting on protein translation efficiency. The observed substitution rate in these fastest evolving sites appears to be strongly influenced by the recombinational environment in which they are located. This rate may be influenced by several factors including ancestral polymorphisms, variation in mutation rate, natural selection and random genetic drift. The relative importance of these factors varies depending on the time since speciation. This project also fully investigates the distribution and rate of evolution of three major TE classes (LTR, non-LTR retrotransposons and DNA transposons) in the <i>Drosophila</i> euchromatic genomeusing a gene-centric approach. The study demonstrates that LTR elements outnumber non-LTR and DNA elements in all intergenic, intronic and exonic regions, and LTR elements also show relatively lower mean divergences than the other two classes between <i>D. melanogaster</i> and <i>D. yakuba</i>. The findings suggest that some TEs, rather than being “junk” and “selfish”, may be conserved between species, and therefore, play vital roles in gene regulation and host genome evolution.572.8University of Edinburghhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.663450http://hdl.handle.net/1842/11944Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 572.8
spellingShingle 572.8
Wang, Jun
Alignment and analysis of noncoding DNA sequences in Drosophila
description In this project, a fast accurate global pairwise alignment of noncoding DNA sequences, MCALIGN2, is developed based on explicit models of indel evolution. A pair-hidden-Markov-Model (pair-HMM) of seven states and a golden-Section-Search algorithm are employed in this method to search for the most probable alignment between two homologous sequences. This method is then used to align and analyze noncoding DNA sequences in <i>Drosophila</i>. Comparative genomic analysis in this project shows that INE-1 elements, one of the most abundant TEs in <i>Drosophila</i>, along with sites within short introns and fourfold degenerate sites are the fastest evolving nucleotides in the genomes of <i>Drosophila</i> <i>melanogaster</i>, <i>D. simulans</i> and <i>D. sechellia</i>. Fourfold sites tend to be evolving (relatively) slightly more slowly than the other two classes of nucleotides, probably due to selection acting on protein translation efficiency. The observed substitution rate in these fastest evolving sites appears to be strongly influenced by the recombinational environment in which they are located. This rate may be influenced by several factors including ancestral polymorphisms, variation in mutation rate, natural selection and random genetic drift. The relative importance of these factors varies depending on the time since speciation. This project also fully investigates the distribution and rate of evolution of three major TE classes (LTR, non-LTR retrotransposons and DNA transposons) in the <i>Drosophila</i> euchromatic genomeusing a gene-centric approach. The study demonstrates that LTR elements outnumber non-LTR and DNA elements in all intergenic, intronic and exonic regions, and LTR elements also show relatively lower mean divergences than the other two classes between <i>D. melanogaster</i> and <i>D. yakuba</i>. The findings suggest that some TEs, rather than being “junk” and “selfish”, may be conserved between species, and therefore, play vital roles in gene regulation and host genome evolution.
author Wang, Jun
author_facet Wang, Jun
author_sort Wang, Jun
title Alignment and analysis of noncoding DNA sequences in Drosophila
title_short Alignment and analysis of noncoding DNA sequences in Drosophila
title_full Alignment and analysis of noncoding DNA sequences in Drosophila
title_fullStr Alignment and analysis of noncoding DNA sequences in Drosophila
title_full_unstemmed Alignment and analysis of noncoding DNA sequences in Drosophila
title_sort alignment and analysis of noncoding dna sequences in drosophila
publisher University of Edinburgh
publishDate 2010
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.663450
work_keys_str_mv AT wangjun alignmentandanalysisofnoncodingdnasequencesindrosophila
_version_ 1718176426244440064