Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC

Abstract Background Early diagnosis is crucial for effective medical management of cancer patients. Tissue biopsy has been widely used for cancer diagnosis, but its invasive nature limits its application, especially when repeated biopsies are needed. Over the past few years, genomic explorations hav...

Full description

Bibliographic Details
Main Authors: Chitrita Goswami, Smriti Chawla, Deepshi Thakral, Himanshu Pant, Pramod Verma, Prabhat Singh Malik, Jayadeva ▮, Ritu Gupta, Gaurav Ahuja, Debarka Sengupta
Format: Article
Language:English
Published: BMC 2020-10-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-020-07147-z
id doaj-299bb670c54a462791afb91469cc7718
record_format Article
spelling doaj-299bb670c54a462791afb91469cc77182020-11-25T03:05:57ZengBMCBMC Genomics1471-21642020-10-0121111210.1186/s12864-020-07147-zMolecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLCChitrita Goswami0Smriti Chawla1Deepshi Thakral2Himanshu Pant3Pramod Verma4Prabhat Singh Malik5Jayadeva ▮6Ritu Gupta7Gaurav Ahuja8Debarka Sengupta9Department of Computer Science and Engineering, Indraprastha Institute of Information TechnologyDepartment of Computational Biology, Indraprastha Institute of Information TechnologyLaboratory Oncology Unit, All India Institute of Medical SciencesDepartment of Electrical Engineering, Indian Institute of TechnologyLaboratory Oncology Unit, All India Institute of Medical SciencesDepartment of Medical Oncology, All India Institute of Medical SciencesDepartment of Electrical Engineering, Indian Institute of TechnologyLaboratory Oncology Unit, All India Institute of Medical SciencesDepartment of Computational Biology, Indraprastha Institute of Information TechnologyDepartment of Computer Science and Engineering, Indraprastha Institute of Information TechnologyAbstract Background Early diagnosis is crucial for effective medical management of cancer patients. Tissue biopsy has been widely used for cancer diagnosis, but its invasive nature limits its application, especially when repeated biopsies are needed. Over the past few years, genomic explorations have led to the discovery of various blood-based biomarkers. Tumor Educated Platelets (TEPs) have, of late, generated considerable interest due to their ability to infer tumor existence and subtype accurately. So far, a majority of the studies involving TEPs have offered marker-panels consisting of several hundreds of genes. Profiling large numbers of genes incur a significant cost, impeding its diagnostic adoption. As such, it is important to construct minimalistic molecular signatures comprising a small number of genes. Results To address the aforesaid challenges, we analyzed publicly available TEP expression profiles and identified a panel of 11 platelet-genes that reliably discriminates between cancer and healthy samples. To validate its efficacy, we chose non-small cell lung cancer (NSCLC), the most prevalent type of lung malignancy. When applied to platelet-gene expression data from a published study, our machine learning model could accurately discriminate between non-metastatic NSCLC cases and healthy samples. We further experimentally validated the panel on an in-house cohort of metastatic NSCLC patients and healthy controls via real-time quantitative Polymerase Chain Reaction (RT-qPCR) (AUC = 0.97). Model performance was boosted significantly after artificial data-augmentation using the EigenSample method (AUC = 0.99). Lastly, we demonstrated the cancer-specificity of the proposed gene-panel by benchmarking it on platelet transcriptomes from patients with Myocardial Infarction (MI). Conclusion We demonstrated an end-to-end bioinformatic plus experimental workflow for identifying a minimal set of TEP associated marker-genes that are predictive of the existence of cancers. We also discussed a strategy for boosting the predictive model performance by artificial augmentation of gene expression data.http://link.springer.com/article/10.1186/s12864-020-07147-zLiquid biopsyTumour educated plateletNSCLCMolecular diagnosticsGene-signature
collection DOAJ
language English
format Article
sources DOAJ
author Chitrita Goswami
Smriti Chawla
Deepshi Thakral
Himanshu Pant
Pramod Verma
Prabhat Singh Malik
Jayadeva ▮
Ritu Gupta
Gaurav Ahuja
Debarka Sengupta
spellingShingle Chitrita Goswami
Smriti Chawla
Deepshi Thakral
Himanshu Pant
Pramod Verma
Prabhat Singh Malik
Jayadeva ▮
Ritu Gupta
Gaurav Ahuja
Debarka Sengupta
Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
BMC Genomics
Liquid biopsy
Tumour educated platelet
NSCLC
Molecular diagnostics
Gene-signature
author_facet Chitrita Goswami
Smriti Chawla
Deepshi Thakral
Himanshu Pant
Pramod Verma
Prabhat Singh Malik
Jayadeva ▮
Ritu Gupta
Gaurav Ahuja
Debarka Sengupta
author_sort Chitrita Goswami
title Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_short Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_full Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_fullStr Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_full_unstemmed Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_sort molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of nsclc
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2020-10-01
description Abstract Background Early diagnosis is crucial for effective medical management of cancer patients. Tissue biopsy has been widely used for cancer diagnosis, but its invasive nature limits its application, especially when repeated biopsies are needed. Over the past few years, genomic explorations have led to the discovery of various blood-based biomarkers. Tumor Educated Platelets (TEPs) have, of late, generated considerable interest due to their ability to infer tumor existence and subtype accurately. So far, a majority of the studies involving TEPs have offered marker-panels consisting of several hundreds of genes. Profiling large numbers of genes incur a significant cost, impeding its diagnostic adoption. As such, it is important to construct minimalistic molecular signatures comprising a small number of genes. Results To address the aforesaid challenges, we analyzed publicly available TEP expression profiles and identified a panel of 11 platelet-genes that reliably discriminates between cancer and healthy samples. To validate its efficacy, we chose non-small cell lung cancer (NSCLC), the most prevalent type of lung malignancy. When applied to platelet-gene expression data from a published study, our machine learning model could accurately discriminate between non-metastatic NSCLC cases and healthy samples. We further experimentally validated the panel on an in-house cohort of metastatic NSCLC patients and healthy controls via real-time quantitative Polymerase Chain Reaction (RT-qPCR) (AUC = 0.97). Model performance was boosted significantly after artificial data-augmentation using the EigenSample method (AUC = 0.99). Lastly, we demonstrated the cancer-specificity of the proposed gene-panel by benchmarking it on platelet transcriptomes from patients with Myocardial Infarction (MI). Conclusion We demonstrated an end-to-end bioinformatic plus experimental workflow for identifying a minimal set of TEP associated marker-genes that are predictive of the existence of cancers. We also discussed a strategy for boosting the predictive model performance by artificial augmentation of gene expression data.
topic Liquid biopsy
Tumour educated platelet
NSCLC
Molecular diagnostics
Gene-signature
url http://link.springer.com/article/10.1186/s12864-020-07147-z
work_keys_str_mv AT chitritagoswami molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT smritichawla molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT deepshithakral molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT himanshupant molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT pramodverma molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT prabhatsinghmalik molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT jayadeva molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT ritugupta molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT gauravahuja molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT debarkasengupta molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
_version_ 1724676266151378944