Metaprotein expression modeling for label-free quantitative proteomics

<p>Abstract</p> <p>Background</p> <p>Label-free quantitative proteomics holds a great deal of promise for the future study of both medicine and biology. However, the data generated is extremely intricate in its correlation structure, and its proper analysis is complex....

Full description

Bibliographic Details
Main Authors: Lucas Joseph E, Thompson J, Dubois Laura G, McCarthy Jeanette, Tillmann Hans, Thompson Alexander, Shire Norah, Hendrickson Ron, Dieguez Francisco, Goldman Phyllis, Schwarz Kathleen, Patel Keyur, McHutchison John, Moseley M
Format: Article
Language:English
Published: BMC 2012-05-01
Series:BMC Bioinformatics
Subjects:
Srm
Mrm
Online Access:http://www.biomedcentral.com/1471-2105/13/74
id doaj-7f53a18c46d842418d5ea3067e847e9c
record_format Article
spelling doaj-7f53a18c46d842418d5ea3067e847e9c2020-11-24T22:16:23ZengBMCBMC Bioinformatics1471-21052012-05-011317410.1186/1471-2105-13-74Metaprotein expression modeling for label-free quantitative proteomicsLucas Joseph EThompson JDubois Laura GMcCarthy JeanetteTillmann HansThompson AlexanderShire NorahHendrickson RonDieguez FranciscoGoldman PhyllisSchwarz KathleenPatel KeyurMcHutchison JohnMoseley M<p>Abstract</p> <p>Background</p> <p>Label-free quantitative proteomics holds a great deal of promise for the future study of both medicine and biology. However, the data generated is extremely intricate in its correlation structure, and its proper analysis is complex. There are issues with missing identifications. There are high levels of correlation between many, but not all, of the peptides derived from the same protein. Additionally, there may be systematic shifts in the sensitivity of the machine between experiments or even through time within the duration of a single experiment.</p> <p>Results</p> <p>We describe a hierarchical model for analyzing unbiased, label-free proteomics data which utilizes the covariance of peptide expression across samples as well as MS/MS-based identifications to group peptides—a strategy we call metaprotein expression modeling. Our metaprotein model acknowledges the possibility of misidentifications, post-translational modifications and systematic differences between samples due to changes in instrument sensitivity or differences in total protein concentration. In addition, our approach allows us to validate findings from unbiased, label-free proteomics experiments with further unbiased, label-free proteomics experiments. Finally, we demonstrate the clinical/translational utility of the model for building predictors capable of differentiating biological phenotypes as well as for validating those findings in the context of three novel cohorts of patients with Hepatitis C.</p> <p>Conclusions</p> <p>Mass-spectrometry proteomics is quickly becoming a powerful tool for studying biological and translational questions. Making use of all of the information contained in a particular set of data will be critical to the success of those endeavors. Our proposed model represents an advance in the ability of statistical models of proteomic data to identify and utilize correlation between features. This allows validation of predictors without translation to targeted assays in addition to informing the choice of targets when it is appropriate to generate those assays.</p> http://www.biomedcentral.com/1471-2105/13/74ProteomicsFactorHepatitisOpen platformStatisticsStatistical modelSrmMrm
collection DOAJ
language English
format Article
sources DOAJ
author Lucas Joseph E
Thompson J
Dubois Laura G
McCarthy Jeanette
Tillmann Hans
Thompson Alexander
Shire Norah
Hendrickson Ron
Dieguez Francisco
Goldman Phyllis
Schwarz Kathleen
Patel Keyur
McHutchison John
Moseley M
spellingShingle Lucas Joseph E
Thompson J
Dubois Laura G
McCarthy Jeanette
Tillmann Hans
Thompson Alexander
Shire Norah
Hendrickson Ron
Dieguez Francisco
Goldman Phyllis
Schwarz Kathleen
Patel Keyur
McHutchison John
Moseley M
Metaprotein expression modeling for label-free quantitative proteomics
BMC Bioinformatics
Proteomics
Factor
Hepatitis
Open platform
Statistics
Statistical model
Srm
Mrm
author_facet Lucas Joseph E
Thompson J
Dubois Laura G
McCarthy Jeanette
Tillmann Hans
Thompson Alexander
Shire Norah
Hendrickson Ron
Dieguez Francisco
Goldman Phyllis
Schwarz Kathleen
Patel Keyur
McHutchison John
Moseley M
author_sort Lucas Joseph E
title Metaprotein expression modeling for label-free quantitative proteomics
title_short Metaprotein expression modeling for label-free quantitative proteomics
title_full Metaprotein expression modeling for label-free quantitative proteomics
title_fullStr Metaprotein expression modeling for label-free quantitative proteomics
title_full_unstemmed Metaprotein expression modeling for label-free quantitative proteomics
title_sort metaprotein expression modeling for label-free quantitative proteomics
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2012-05-01
description <p>Abstract</p> <p>Background</p> <p>Label-free quantitative proteomics holds a great deal of promise for the future study of both medicine and biology. However, the data generated is extremely intricate in its correlation structure, and its proper analysis is complex. There are issues with missing identifications. There are high levels of correlation between many, but not all, of the peptides derived from the same protein. Additionally, there may be systematic shifts in the sensitivity of the machine between experiments or even through time within the duration of a single experiment.</p> <p>Results</p> <p>We describe a hierarchical model for analyzing unbiased, label-free proteomics data which utilizes the covariance of peptide expression across samples as well as MS/MS-based identifications to group peptides—a strategy we call metaprotein expression modeling. Our metaprotein model acknowledges the possibility of misidentifications, post-translational modifications and systematic differences between samples due to changes in instrument sensitivity or differences in total protein concentration. In addition, our approach allows us to validate findings from unbiased, label-free proteomics experiments with further unbiased, label-free proteomics experiments. Finally, we demonstrate the clinical/translational utility of the model for building predictors capable of differentiating biological phenotypes as well as for validating those findings in the context of three novel cohorts of patients with Hepatitis C.</p> <p>Conclusions</p> <p>Mass-spectrometry proteomics is quickly becoming a powerful tool for studying biological and translational questions. Making use of all of the information contained in a particular set of data will be critical to the success of those endeavors. Our proposed model represents an advance in the ability of statistical models of proteomic data to identify and utilize correlation between features. This allows validation of predictors without translation to targeted assays in addition to informing the choice of targets when it is appropriate to generate those assays.</p>
topic Proteomics
Factor
Hepatitis
Open platform
Statistics
Statistical model
Srm
Mrm
url http://www.biomedcentral.com/1471-2105/13/74
work_keys_str_mv AT lucasjosephe metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT thompsonj metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT duboislaurag metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT mccarthyjeanette metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT tillmannhans metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT thompsonalexander metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT shirenorah metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT hendricksonron metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT dieguezfrancisco metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT goldmanphyllis metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT schwarzkathleen metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT patelkeyur metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT mchutchisonjohn metaproteinexpressionmodelingforlabelfreequantitativeproteomics
AT moseleym metaproteinexpressionmodelingforlabelfreequantitativeproteomics
_version_ 1725790358323855360