Statistical approach of functional profiling for a microbial community.

Metagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiology are not e...

Full description

Bibliographic Details
Main Authors: Lingling An, Nauromal Pookhao, Hongmei Jiang, Jiannong Xu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4157783?pdf=render
id doaj-5f16daf36a744b6daed3a69a38f41d42
record_format Article
spelling doaj-5f16daf36a744b6daed3a69a38f41d422020-11-25T01:46:56ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0199e10658810.1371/journal.pone.0106588Statistical approach of functional profiling for a microbial community.Lingling AnNauromal PookhaoHongmei JiangJiannong XuMetagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiology are not efficient in capturing the structure of the microbial community in an environment. Nowadays, high-throughput next-generation sequencing technologies are powerfully driving the metagenomic studies. However, there is an urgent need to develop efficient statistical methods and computational algorithms to rapidly analyze the massive metagenomic short sequencing data and to accurately detect the features/functions present in the microbial community. Although several issues about functions of metagenomes at pathways or subsystems level have been investigated, there is a lack of studies focusing on functional analysis at a low level of a hierarchical functional tree, such as SEED subsystem tree.A two-step statistical procedure (metaFunction) is proposed to detect all possible functional roles at the low level from a metagenomic sample/community. In the first step a statistical mixture model is proposed at the base of gene codons to estimate the abundances for the candidate functional roles, with sequencing error being considered. As a gene could be involved in multiple biological processes the functional assignment is therefore adjusted by utilizing an error distribution in the second step. The performance of the proposed procedure is evaluated through comprehensive simulation studies. Compared with other existing methods in metagenomic functional analysis the new approach is more accurate in assigning reads to functional roles, and therefore at more general levels. The method is also employed to analyze two real data sets.metaFunction is a powerful tool in accurate profiling functions in a metagenomic sample.http://europepmc.org/articles/PMC4157783?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Lingling An
Nauromal Pookhao
Hongmei Jiang
Jiannong Xu
spellingShingle Lingling An
Nauromal Pookhao
Hongmei Jiang
Jiannong Xu
Statistical approach of functional profiling for a microbial community.
PLoS ONE
author_facet Lingling An
Nauromal Pookhao
Hongmei Jiang
Jiannong Xu
author_sort Lingling An
title Statistical approach of functional profiling for a microbial community.
title_short Statistical approach of functional profiling for a microbial community.
title_full Statistical approach of functional profiling for a microbial community.
title_fullStr Statistical approach of functional profiling for a microbial community.
title_full_unstemmed Statistical approach of functional profiling for a microbial community.
title_sort statistical approach of functional profiling for a microbial community.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2014-01-01
description Metagenomics is a relatively new but fast growing field within environmental biology and medical sciences. It enables researchers to understand the diversity of microbes, their functions, cooperation, and evolution in a particular ecosystem. Traditional methods in genomics and microbiology are not efficient in capturing the structure of the microbial community in an environment. Nowadays, high-throughput next-generation sequencing technologies are powerfully driving the metagenomic studies. However, there is an urgent need to develop efficient statistical methods and computational algorithms to rapidly analyze the massive metagenomic short sequencing data and to accurately detect the features/functions present in the microbial community. Although several issues about functions of metagenomes at pathways or subsystems level have been investigated, there is a lack of studies focusing on functional analysis at a low level of a hierarchical functional tree, such as SEED subsystem tree.A two-step statistical procedure (metaFunction) is proposed to detect all possible functional roles at the low level from a metagenomic sample/community. In the first step a statistical mixture model is proposed at the base of gene codons to estimate the abundances for the candidate functional roles, with sequencing error being considered. As a gene could be involved in multiple biological processes the functional assignment is therefore adjusted by utilizing an error distribution in the second step. The performance of the proposed procedure is evaluated through comprehensive simulation studies. Compared with other existing methods in metagenomic functional analysis the new approach is more accurate in assigning reads to functional roles, and therefore at more general levels. The method is also employed to analyze two real data sets.metaFunction is a powerful tool in accurate profiling functions in a metagenomic sample.
url http://europepmc.org/articles/PMC4157783?pdf=render
work_keys_str_mv AT linglingan statisticalapproachoffunctionalprofilingforamicrobialcommunity
AT nauromalpookhao statisticalapproachoffunctionalprofilingforamicrobialcommunity
AT hongmeijiang statisticalapproachoffunctionalprofilingforamicrobialcommunity
AT jiannongxu statisticalapproachoffunctionalprofilingforamicrobialcommunity
_version_ 1725017128551055360