The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging

Fourier transform infrared spectroscopy (FT-IR) is widely used in the analysis of the chemical composition of biological materials and has the potential to reveal new aspects of the molecular basis of diseases, including different types of cancer. The potential of FT-IR in cancer research lies in it...

Full description

Bibliographic Details
Main Authors: Danuta Liberda, Ewa Pięta, Katarzyna Pogoda, Natalia Piergies, Maciej Roman, Paulina Koziol, Tomasz P. Wrobel, Czeslawa Paluszkiewicz, Wojciech M. Kwiatek
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Cells
Subjects:
Online Access:https://www.mdpi.com/2073-4409/10/4/953
id doaj-c423bcafd5364d9cbb77fbaac0ec2523
record_format Article
spelling doaj-c423bcafd5364d9cbb77fbaac0ec25232021-04-20T23:02:50ZengMDPI AGCells2073-44092021-04-011095395310.3390/cells10040953The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared ImagingDanuta Liberda0Ewa Pięta1Katarzyna Pogoda2Natalia Piergies3Maciej Roman4Paulina Koziol5Tomasz P. Wrobel6Czeslawa Paluszkiewicz7Wojciech M. Kwiatek8Institute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandFourier transform infrared spectroscopy (FT-IR) is widely used in the analysis of the chemical composition of biological materials and has the potential to reveal new aspects of the molecular basis of diseases, including different types of cancer. The potential of FT-IR in cancer research lies in its capability of monitoring the biochemical status of cells, which undergo malignant transformation and further examination of spectral features that differentiate normal and cancerous ones using proper mathematical approaches. Such examination can be performed with the use of chemometric tools, such as partial least squares discriminant analysis (PLS-DA) classification and partial least squares regression (PLSR), and proper application of preprocessing methods and their correct sequence is crucial for success. Here, we performed a comparison of several state-of-the-art methods commonly used in infrared biospectroscopy (denoising, baseline correction, and normalization) with the addition of methods not previously used in infrared biospectroscopy classification problems: Mie extinction extended multiplicative signal correction, Eiler’s smoothing, and probabilistic quotient normalization. We compared all of these approaches and their effect on the data structure, classification, and regression capability on experimental FT-IR spectra collected from five different prostate normal and cancerous cell lines. Additionally, we tested the influence of added spectral noise. Overall, we concluded that in the case of the data analyzed here, the biggest impact on data structure and performance of PLS-DA and PLSR was caused by the baseline correction; therefore, much attention should be given, especially to this step of data preprocessing.https://www.mdpi.com/2073-4409/10/4/953FT-IR spectroscopyprostate cancer cellspreprocessingPLS-DAPLSREMSC
collection DOAJ
language English
format Article
sources DOAJ
author Danuta Liberda
Ewa Pięta
Katarzyna Pogoda
Natalia Piergies
Maciej Roman
Paulina Koziol
Tomasz P. Wrobel
Czeslawa Paluszkiewicz
Wojciech M. Kwiatek
spellingShingle Danuta Liberda
Ewa Pięta
Katarzyna Pogoda
Natalia Piergies
Maciej Roman
Paulina Koziol
Tomasz P. Wrobel
Czeslawa Paluszkiewicz
Wojciech M. Kwiatek
The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
Cells
FT-IR spectroscopy
prostate cancer cells
preprocessing
PLS-DA
PLSR
EMSC
author_facet Danuta Liberda
Ewa Pięta
Katarzyna Pogoda
Natalia Piergies
Maciej Roman
Paulina Koziol
Tomasz P. Wrobel
Czeslawa Paluszkiewicz
Wojciech M. Kwiatek
author_sort Danuta Liberda
title The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
title_short The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
title_full The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
title_fullStr The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
title_full_unstemmed The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
title_sort impact of preprocessing methods for a successful prostate cell lines discrimination using partial least squares regression and discriminant analysis based on fourier transform infrared imaging
publisher MDPI AG
series Cells
issn 2073-4409
publishDate 2021-04-01
description Fourier transform infrared spectroscopy (FT-IR) is widely used in the analysis of the chemical composition of biological materials and has the potential to reveal new aspects of the molecular basis of diseases, including different types of cancer. The potential of FT-IR in cancer research lies in its capability of monitoring the biochemical status of cells, which undergo malignant transformation and further examination of spectral features that differentiate normal and cancerous ones using proper mathematical approaches. Such examination can be performed with the use of chemometric tools, such as partial least squares discriminant analysis (PLS-DA) classification and partial least squares regression (PLSR), and proper application of preprocessing methods and their correct sequence is crucial for success. Here, we performed a comparison of several state-of-the-art methods commonly used in infrared biospectroscopy (denoising, baseline correction, and normalization) with the addition of methods not previously used in infrared biospectroscopy classification problems: Mie extinction extended multiplicative signal correction, Eiler’s smoothing, and probabilistic quotient normalization. We compared all of these approaches and their effect on the data structure, classification, and regression capability on experimental FT-IR spectra collected from five different prostate normal and cancerous cell lines. Additionally, we tested the influence of added spectral noise. Overall, we concluded that in the case of the data analyzed here, the biggest impact on data structure and performance of PLS-DA and PLSR was caused by the baseline correction; therefore, much attention should be given, especially to this step of data preprocessing.
topic FT-IR spectroscopy
prostate cancer cells
preprocessing
PLS-DA
PLSR
EMSC
url https://www.mdpi.com/2073-4409/10/4/953
work_keys_str_mv AT danutaliberda theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT ewapieta theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT katarzynapogoda theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT nataliapiergies theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT maciejroman theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT paulinakoziol theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT tomaszpwrobel theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT czeslawapaluszkiewicz theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT wojciechmkwiatek theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT danutaliberda impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT ewapieta impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT katarzynapogoda impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT nataliapiergies impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT maciejroman impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT paulinakoziol impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT tomaszpwrobel impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT czeslawapaluszkiewicz impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
AT wojciechmkwiatek impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging
_version_ 1721517301553954816