Empirical bayes analysis of quantitative proteomics experiments

Background: Advances in mass spectrometry-based proteomics have enabled the incorporation of proteomic data into systems approaches to biology. However, development of analytical methods has lagged behind. Here we describe an empirical Bayes framework for quantitative proteomics data analysis. The m...

Full description

Bibliographic Details
Main Authors: Golub, Todd R. (Author), Schreiber, Stuart L. (Author), Gould, Robert J. (Author), Schenone, Monica (Author), Ong, Shao-En (Author), Margolin, Adam A. (Author), Carr, Steven A (Author)
Other Authors: Koch Institute for Integrative Cancer Research at MIT (Contributor), Carr, Steven A. (Contributor)
Format: Article
Language:English
Published: Public Library of Science, 2010-03-10T20:01:05Z.
Subjects:
Online Access:Get fulltext
Description
Summary:Background: Advances in mass spectrometry-based proteomics have enabled the incorporation of proteomic data into systems approaches to biology. However, development of analytical methods has lagged behind. Here we describe an empirical Bayes framework for quantitative proteomics data analysis. The method provides a statistical description of each experiment, including the number of proteins that differ in abundance between 2 samples, the experiment's statistical power to detect them, and the false-positive probability of each protein. Methodology/Principal Findings: We analyzed 2 types of mass spectrometric experiments. First, we showed that the method identified the protein targets of small-molecules in affinity purification experiments with high precision. Second, we re-analyzed a mass spectrometric data set designed to identify proteins regulated by microRNAs. Our results were supported by sequence analysis of the 3' UTR regions of predicted target genes, and we found that the previously reported conclusion that a large fraction of the proteome is regulated by microRNAs was not supported by our statistical analysis of the data. Conclusions/Significance: Our results highlight the importance of rigorous statistical analysis of proteomic data, and the method described here provides a statistical framework to robustly and reliably interpret such data.
National Institutes of Health (grants RL1CA133834, RL1GM084437, and UL1RR024924)