Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique

Abstract Background The timing of treating cancer patients is an essential factor in the efficacy of treatment. So, patients who will not respond to current therapy should receive a different treatment as early as possible. Machine learning models can be built to classify responders and nonresponder...

詳細記述

書誌詳細
出版年:BMC Medical Research Methodology
主要な著者: Elham Majd, Li Xing, Xuekui Zhang
フォーマット: 論文
言語:英語
出版事項: BMC 2024-04-01
主題:
オンライン・アクセス:https://doi.org/10.1186/s12874-024-02185-7
_version_ 1850027171069820928
author Elham Majd
Li Xing
Xuekui Zhang
author_facet Elham Majd
Li Xing
Xuekui Zhang
author_sort Elham Majd
collection DOAJ
container_title BMC Medical Research Methodology
description Abstract Background The timing of treating cancer patients is an essential factor in the efficacy of treatment. So, patients who will not respond to current therapy should receive a different treatment as early as possible. Machine learning models can be built to classify responders and nonresponders. Such classification models predict the probability of a patient being a responder. Most methods use a probability threshold of 0.5 to convert the probabilities into binary group membership. However, the cutoff of 0.5 is not always the optimal choice. Methods In this study, we propose a novel data-driven approach to select a better cutoff value based on the optimal cross-validation technique. To illustrate our novel method, we applied it to three clinical trial datasets of small-cell lung cancer patients. We used two different datasets to build a scoring system to segment patients. Then the models were applied to segment patients into the test data. Results We found that, in test data, the predicted responders and non-responders had significantly different long-term survival outcomes. Our proposed novel method segments patients better than the standard approach using a cutoff of 0.5. Comparing clinical outcomes of responders versus non-responders, our novel method had a p-value of 0.009 with a hazard ratio of 0.668 for grouping patients using the Cox proportion hazard model and a p-value of 0.011 using the accelerated failure time model which approved a significant difference between responders and non-responders. In contrast, the standard approach had a p-value of 0.194 with a hazard ratio of 0.823 using the Cox proportion hazard model and a p-value of 0.240 using the accelerated failure time model indicating the responders and non-responders do not differ significantly in survival. Conclusion In summary, our novel prediction method can successfully segment new patients into responders and non-responders. Clinicians can use our prediction to decide if a patient should receive a different treatment or stay with the current treatment.
format Article
id doaj-art-8e1eef4fda3f48eabcf9f4dde45abdaf
institution Directory of Open Access Journals
issn 1471-2288
language English
publishDate 2024-04-01
publisher BMC
record_format Article
spelling doaj-art-8e1eef4fda3f48eabcf9f4dde45abdaf2025-08-20T00:37:35ZengBMCBMC Medical Research Methodology1471-22882024-04-0124111110.1186/s12874-024-02185-7Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation techniqueElham Majd0Li Xing1Xuekui Zhang2Department of Mathematics and Statistics, University of VictoriaDepartment of Mathematics and Statistics, University of SaskatchewanDepartment of Mathematics and Statistics, University of VictoriaAbstract Background The timing of treating cancer patients is an essential factor in the efficacy of treatment. So, patients who will not respond to current therapy should receive a different treatment as early as possible. Machine learning models can be built to classify responders and nonresponders. Such classification models predict the probability of a patient being a responder. Most methods use a probability threshold of 0.5 to convert the probabilities into binary group membership. However, the cutoff of 0.5 is not always the optimal choice. Methods In this study, we propose a novel data-driven approach to select a better cutoff value based on the optimal cross-validation technique. To illustrate our novel method, we applied it to three clinical trial datasets of small-cell lung cancer patients. We used two different datasets to build a scoring system to segment patients. Then the models were applied to segment patients into the test data. Results We found that, in test data, the predicted responders and non-responders had significantly different long-term survival outcomes. Our proposed novel method segments patients better than the standard approach using a cutoff of 0.5. Comparing clinical outcomes of responders versus non-responders, our novel method had a p-value of 0.009 with a hazard ratio of 0.668 for grouping patients using the Cox proportion hazard model and a p-value of 0.011 using the accelerated failure time model which approved a significant difference between responders and non-responders. In contrast, the standard approach had a p-value of 0.194 with a hazard ratio of 0.823 using the Cox proportion hazard model and a p-value of 0.240 using the accelerated failure time model indicating the responders and non-responders do not differ significantly in survival. Conclusion In summary, our novel prediction method can successfully segment new patients into responders and non-responders. Clinicians can use our prediction to decide if a patient should receive a different treatment or stay with the current treatment.https://doi.org/10.1186/s12874-024-02185-7Best overall responseClinical trialsCross-validationOverall survival
spellingShingle Elham Majd
Li Xing
Xuekui Zhang
Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique
Best overall response
Clinical trials
Cross-validation
Overall survival
title Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique
title_full Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique
title_fullStr Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique
title_full_unstemmed Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique
title_short Segmentation of patients with small cell lung cancer into responders and non-responders using the optimal cross-validation technique
title_sort segmentation of patients with small cell lung cancer into responders and non responders using the optimal cross validation technique
topic Best overall response
Clinical trials
Cross-validation
Overall survival
url https://doi.org/10.1186/s12874-024-02185-7
work_keys_str_mv AT elhammajd segmentationofpatientswithsmallcelllungcancerintorespondersandnonrespondersusingtheoptimalcrossvalidationtechnique
AT lixing segmentationofpatientswithsmallcelllungcancerintorespondersandnonrespondersusingtheoptimalcrossvalidationtechnique
AT xuekuizhang segmentationofpatientswithsmallcelllungcancerintorespondersandnonrespondersusingtheoptimalcrossvalidationtechnique