Identification of biomarkers from mass spectrometry data using a "common" peak approach

<p>Abstract</p> <p>Background</p> <p>Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem.</p> &...

Full description

Bibliographic Details
Main Authors: Eguchi Shinto, Fujisawa Hironori, Fushiki Tadayoshi
Format: Article
Language:English
Published: BMC 2006-07-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/7/358
id doaj-8fb2d484d1704b598a60c902bb8b474e
record_format Article
spelling doaj-8fb2d484d1704b598a60c902bb8b474e2020-11-25T00:09:33ZengBMCBMC Bioinformatics1471-21052006-07-017135810.1186/1471-2105-7-358Identification of biomarkers from mass spectrometry data using a "common" peak approachEguchi ShintoFujisawa HironoriFushiki Tadayoshi<p>Abstract</p> <p>Background</p> <p>Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem.</p> <p>Results</p> <p>This paper proposes the use of "common" peaks in data as biomarkers. Analysis is conducted as follows: data preprocessing, identification of biomarkers, and application of AdaBoost to construct a classification function. Informative "common" peaks are selected by AdaBoost. AsymBoost is also examined to balance false negatives and false positives. The effectiveness of the approach is demonstrated using an ovarian cancer dataset.</p> <p>Conclusion</p> <p>Continuous covariates and discrete covariates can be used in the present approach. The difference between the result for the continuous covariates and that for the discrete covariates was investigated in detail. In the example considered here, both covariates provide a good prediction, but it seems that they provide different kinds of information. We can obtain more information on the structure of the data by integrating both results.</p> http://www.biomedcentral.com/1471-2105/7/358
collection DOAJ
language English
format Article
sources DOAJ
author Eguchi Shinto
Fujisawa Hironori
Fushiki Tadayoshi
spellingShingle Eguchi Shinto
Fujisawa Hironori
Fushiki Tadayoshi
Identification of biomarkers from mass spectrometry data using a "common" peak approach
BMC Bioinformatics
author_facet Eguchi Shinto
Fujisawa Hironori
Fushiki Tadayoshi
author_sort Eguchi Shinto
title Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_short Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_full Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_fullStr Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_full_unstemmed Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_sort identification of biomarkers from mass spectrometry data using a "common" peak approach
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2006-07-01
description <p>Abstract</p> <p>Background</p> <p>Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem.</p> <p>Results</p> <p>This paper proposes the use of "common" peaks in data as biomarkers. Analysis is conducted as follows: data preprocessing, identification of biomarkers, and application of AdaBoost to construct a classification function. Informative "common" peaks are selected by AdaBoost. AsymBoost is also examined to balance false negatives and false positives. The effectiveness of the approach is demonstrated using an ovarian cancer dataset.</p> <p>Conclusion</p> <p>Continuous covariates and discrete covariates can be used in the present approach. The difference between the result for the continuous covariates and that for the discrete covariates was investigated in detail. In the example considered here, both covariates provide a good prediction, but it seems that they provide different kinds of information. We can obtain more information on the structure of the data by integrating both results.</p>
url http://www.biomedcentral.com/1471-2105/7/358
work_keys_str_mv AT eguchishinto identificationofbiomarkersfrommassspectrometrydatausingacommonpeakapproach
AT fujisawahironori identificationofbiomarkersfrommassspectrometrydatausingacommonpeakapproach
AT fushikitadayoshi identificationofbiomarkersfrommassspectrometrydatausingacommonpeakapproach
_version_ 1725411350435332096