The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging
Fourier transform infrared spectroscopy (FT-IR) is widely used in the analysis of the chemical composition of biological materials and has the potential to reveal new aspects of the molecular basis of diseases, including different types of cancer. The potential of FT-IR in cancer research lies in it...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-04-01
|
Series: | Cells |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-4409/10/4/953 |
id |
doaj-c423bcafd5364d9cbb77fbaac0ec2523 |
---|---|
record_format |
Article |
spelling |
doaj-c423bcafd5364d9cbb77fbaac0ec25232021-04-20T23:02:50ZengMDPI AGCells2073-44092021-04-011095395310.3390/cells10040953The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared ImagingDanuta Liberda0Ewa Pięta1Katarzyna Pogoda2Natalia Piergies3Maciej Roman4Paulina Koziol5Tomasz P. Wrobel6Czeslawa Paluszkiewicz7Wojciech M. Kwiatek8Institute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandInstitute of Nuclear Physics Polish Academy of Sciences, PL-31342 Krakow, PolandFourier transform infrared spectroscopy (FT-IR) is widely used in the analysis of the chemical composition of biological materials and has the potential to reveal new aspects of the molecular basis of diseases, including different types of cancer. The potential of FT-IR in cancer research lies in its capability of monitoring the biochemical status of cells, which undergo malignant transformation and further examination of spectral features that differentiate normal and cancerous ones using proper mathematical approaches. Such examination can be performed with the use of chemometric tools, such as partial least squares discriminant analysis (PLS-DA) classification and partial least squares regression (PLSR), and proper application of preprocessing methods and their correct sequence is crucial for success. Here, we performed a comparison of several state-of-the-art methods commonly used in infrared biospectroscopy (denoising, baseline correction, and normalization) with the addition of methods not previously used in infrared biospectroscopy classification problems: Mie extinction extended multiplicative signal correction, Eiler’s smoothing, and probabilistic quotient normalization. We compared all of these approaches and their effect on the data structure, classification, and regression capability on experimental FT-IR spectra collected from five different prostate normal and cancerous cell lines. Additionally, we tested the influence of added spectral noise. Overall, we concluded that in the case of the data analyzed here, the biggest impact on data structure and performance of PLS-DA and PLSR was caused by the baseline correction; therefore, much attention should be given, especially to this step of data preprocessing.https://www.mdpi.com/2073-4409/10/4/953FT-IR spectroscopyprostate cancer cellspreprocessingPLS-DAPLSREMSC |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Danuta Liberda Ewa Pięta Katarzyna Pogoda Natalia Piergies Maciej Roman Paulina Koziol Tomasz P. Wrobel Czeslawa Paluszkiewicz Wojciech M. Kwiatek |
spellingShingle |
Danuta Liberda Ewa Pięta Katarzyna Pogoda Natalia Piergies Maciej Roman Paulina Koziol Tomasz P. Wrobel Czeslawa Paluszkiewicz Wojciech M. Kwiatek The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging Cells FT-IR spectroscopy prostate cancer cells preprocessing PLS-DA PLSR EMSC |
author_facet |
Danuta Liberda Ewa Pięta Katarzyna Pogoda Natalia Piergies Maciej Roman Paulina Koziol Tomasz P. Wrobel Czeslawa Paluszkiewicz Wojciech M. Kwiatek |
author_sort |
Danuta Liberda |
title |
The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging |
title_short |
The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging |
title_full |
The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging |
title_fullStr |
The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging |
title_full_unstemmed |
The Impact of Preprocessing Methods for a Successful Prostate Cell Lines Discrimination Using Partial Least Squares Regression and Discriminant Analysis Based on Fourier Transform Infrared Imaging |
title_sort |
impact of preprocessing methods for a successful prostate cell lines discrimination using partial least squares regression and discriminant analysis based on fourier transform infrared imaging |
publisher |
MDPI AG |
series |
Cells |
issn |
2073-4409 |
publishDate |
2021-04-01 |
description |
Fourier transform infrared spectroscopy (FT-IR) is widely used in the analysis of the chemical composition of biological materials and has the potential to reveal new aspects of the molecular basis of diseases, including different types of cancer. The potential of FT-IR in cancer research lies in its capability of monitoring the biochemical status of cells, which undergo malignant transformation and further examination of spectral features that differentiate normal and cancerous ones using proper mathematical approaches. Such examination can be performed with the use of chemometric tools, such as partial least squares discriminant analysis (PLS-DA) classification and partial least squares regression (PLSR), and proper application of preprocessing methods and their correct sequence is crucial for success. Here, we performed a comparison of several state-of-the-art methods commonly used in infrared biospectroscopy (denoising, baseline correction, and normalization) with the addition of methods not previously used in infrared biospectroscopy classification problems: Mie extinction extended multiplicative signal correction, Eiler’s smoothing, and probabilistic quotient normalization. We compared all of these approaches and their effect on the data structure, classification, and regression capability on experimental FT-IR spectra collected from five different prostate normal and cancerous cell lines. Additionally, we tested the influence of added spectral noise. Overall, we concluded that in the case of the data analyzed here, the biggest impact on data structure and performance of PLS-DA and PLSR was caused by the baseline correction; therefore, much attention should be given, especially to this step of data preprocessing. |
topic |
FT-IR spectroscopy prostate cancer cells preprocessing PLS-DA PLSR EMSC |
url |
https://www.mdpi.com/2073-4409/10/4/953 |
work_keys_str_mv |
AT danutaliberda theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT ewapieta theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT katarzynapogoda theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT nataliapiergies theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT maciejroman theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT paulinakoziol theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT tomaszpwrobel theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT czeslawapaluszkiewicz theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT wojciechmkwiatek theimpactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT danutaliberda impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT ewapieta impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT katarzynapogoda impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT nataliapiergies impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT maciejroman impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT paulinakoziol impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT tomaszpwrobel impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT czeslawapaluszkiewicz impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging AT wojciechmkwiatek impactofpreprocessingmethodsforasuccessfulprostatecelllinesdiscriminationusingpartialleastsquaresregressionanddiscriminantanalysisbasedonfouriertransforminfraredimaging |
_version_ |
1721517301553954816 |