Statistical Analysis of Species Level Phylogenetic Trees
Main Author: | |
---|---|
Language: | English |
Published: |
Bowling Green State University / OhioLINK
2017
|
Subjects: | |
Online Access: | http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1503051433382274 |
id |
ndltd-OhioLink-oai-etd.ohiolink.edu-bgsu1503051433382274 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-OhioLink-oai-etd.ohiolink.edu-bgsu15030514333822742021-08-03T07:03:57Z Statistical Analysis of Species Level Phylogenetic Trees Ferguson, Meg Elizabeth Statistics statistics phylogenetics squid missing data In this thesis, statistical methods are used to analyze the generation of species-level phylogenies. Two software packages, one phylogenetic and one statistical, are used to investigate the difference in phylogeny topology across three methods. Maximum likelihood estimation, neighbor-joining, and UPGMA methodologies are applied in this comparison to study the accuracy of each software package in correctly placing taxa with the true phylogeny. Four genes are used to compare with variable length sequences and genes amongst forty-seven squid species. In addition, missing data techniques are employed to assess the impact missing data has on phylogeny generation. Two software platforms were used to generate phylogenies for genes 16S rRNA, 18s rRNA, 28S rRNA, and the mitochondrial gene cytochrome c oxidase I (COI). The phylogenetic software platform MEGA was utilized as well as the statistical software platform, R; within R, the packages ape, phangorn, and seqinr were used in tree generation. Results show discrepancies between phylogenies generated across the four single-gene trees and multiple-gene trees; only phylogenies generated using missing data in the form of partial sequences grouped all families correctly. Results from this study highlight the struggle in determining the best software package to use for phylogenetic analyses. It was discovered that in general, MEGA generated a more accurate single-gene phylogeny from gene 18S rRNA while R generated a more accurate single-gene phylogeny from gene 28S rRNA. Results also showed that sequences with 50% missing characters could be accurately placed within generated phylogenies. 2017-11-14 English text Bowling Green State University / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1503051433382274 http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1503051433382274 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws. |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
topic |
Statistics statistics phylogenetics squid missing data |
spellingShingle |
Statistics statistics phylogenetics squid missing data Ferguson, Meg Elizabeth Statistical Analysis of Species Level Phylogenetic Trees |
author |
Ferguson, Meg Elizabeth |
author_facet |
Ferguson, Meg Elizabeth |
author_sort |
Ferguson, Meg Elizabeth |
title |
Statistical Analysis of Species Level Phylogenetic Trees |
title_short |
Statistical Analysis of Species Level Phylogenetic Trees |
title_full |
Statistical Analysis of Species Level Phylogenetic Trees |
title_fullStr |
Statistical Analysis of Species Level Phylogenetic Trees |
title_full_unstemmed |
Statistical Analysis of Species Level Phylogenetic Trees |
title_sort |
statistical analysis of species level phylogenetic trees |
publisher |
Bowling Green State University / OhioLINK |
publishDate |
2017 |
url |
http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1503051433382274 |
work_keys_str_mv |
AT fergusonmegelizabeth statisticalanalysisofspecieslevelphylogenetictrees |
_version_ |
1719452656164929536 |