Precise diagnosis of three top cancers using dbGaP data
Abstract The challenge of decoding information about complex diseases hidden in huge number of single nucleotide polymorphism (SNP) genotypes is undertaken based on five dbGaP studies. Current genome-wide association studies have successfully identified many high-risk SNPs associated with diseases,...
Main Authors: | , , , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-01-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-020-80832-x |
id |
doaj-24e7d0fce6ac404098fc1a383345b51d |
---|---|
record_format |
Article |
spelling |
doaj-24e7d0fce6ac404098fc1a383345b51d2021-01-17T12:35:56ZengNature Publishing GroupScientific Reports2045-23222021-01-011111810.1038/s41598-020-80832-xPrecise diagnosis of three top cancers using dbGaP dataXu-Qing Liu0Xin-Sheng Liu1Jian-Ying Rong2Feng Gao3Yan-Dong Wu4Chun-Hua Deng5Hong-Yan Jiang6Xiao-Feng Li7Ye-Qin Chen8Zhi-Guo Zhao9Yu-Ting Liu10Hai-Wen Chen11Jun-Liang Li12Yu Huang13Cheng-Yao Ji14Wen-Wen Liu15Xiao-Hu Luo16Li-Li Xiao17Huaiyin Institute of TechnologyNanjing University of Aeronautics and AstronauticsJiangsu Vocational College of Electronics and InformationHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyJiangsu Vocational College of Electronics and InformationHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyHuaiyin Institute of TechnologyAbstract The challenge of decoding information about complex diseases hidden in huge number of single nucleotide polymorphism (SNP) genotypes is undertaken based on five dbGaP studies. Current genome-wide association studies have successfully identified many high-risk SNPs associated with diseases, but precise diagnostic models for complex diseases by these or more other SNP genotypes are still unavailable in the literature. We report that lung cancer, breast cancer and prostate cancer as the first three top cancers worldwide can be predicted precisely via 240–370 SNPs with accuracy up to 99% according to leave-one-out and 10-fold cross-validation. Our findings (1) confirm an early guess of Dr. Mitchell H. Gail that about 300 SNPs are needed to improve risk forecasts for breast cancer, (2) reveal an incredible fact that SNP genotypes may contain almost all information that one wants to know, and (3) show a hopeful possibility that complex diseases can be precisely diagnosed by means of SNP genotypes without using phenotypical features. In short words, information hidden in SNP genotypes can be extracted in efficient ways to make precise diagnoses for complex diseases.https://doi.org/10.1038/s41598-020-80832-x |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xu-Qing Liu Xin-Sheng Liu Jian-Ying Rong Feng Gao Yan-Dong Wu Chun-Hua Deng Hong-Yan Jiang Xiao-Feng Li Ye-Qin Chen Zhi-Guo Zhao Yu-Ting Liu Hai-Wen Chen Jun-Liang Li Yu Huang Cheng-Yao Ji Wen-Wen Liu Xiao-Hu Luo Li-Li Xiao |
spellingShingle |
Xu-Qing Liu Xin-Sheng Liu Jian-Ying Rong Feng Gao Yan-Dong Wu Chun-Hua Deng Hong-Yan Jiang Xiao-Feng Li Ye-Qin Chen Zhi-Guo Zhao Yu-Ting Liu Hai-Wen Chen Jun-Liang Li Yu Huang Cheng-Yao Ji Wen-Wen Liu Xiao-Hu Luo Li-Li Xiao Precise diagnosis of three top cancers using dbGaP data Scientific Reports |
author_facet |
Xu-Qing Liu Xin-Sheng Liu Jian-Ying Rong Feng Gao Yan-Dong Wu Chun-Hua Deng Hong-Yan Jiang Xiao-Feng Li Ye-Qin Chen Zhi-Guo Zhao Yu-Ting Liu Hai-Wen Chen Jun-Liang Li Yu Huang Cheng-Yao Ji Wen-Wen Liu Xiao-Hu Luo Li-Li Xiao |
author_sort |
Xu-Qing Liu |
title |
Precise diagnosis of three top cancers using dbGaP data |
title_short |
Precise diagnosis of three top cancers using dbGaP data |
title_full |
Precise diagnosis of three top cancers using dbGaP data |
title_fullStr |
Precise diagnosis of three top cancers using dbGaP data |
title_full_unstemmed |
Precise diagnosis of three top cancers using dbGaP data |
title_sort |
precise diagnosis of three top cancers using dbgap data |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-01-01 |
description |
Abstract The challenge of decoding information about complex diseases hidden in huge number of single nucleotide polymorphism (SNP) genotypes is undertaken based on five dbGaP studies. Current genome-wide association studies have successfully identified many high-risk SNPs associated with diseases, but precise diagnostic models for complex diseases by these or more other SNP genotypes are still unavailable in the literature. We report that lung cancer, breast cancer and prostate cancer as the first three top cancers worldwide can be predicted precisely via 240–370 SNPs with accuracy up to 99% according to leave-one-out and 10-fold cross-validation. Our findings (1) confirm an early guess of Dr. Mitchell H. Gail that about 300 SNPs are needed to improve risk forecasts for breast cancer, (2) reveal an incredible fact that SNP genotypes may contain almost all information that one wants to know, and (3) show a hopeful possibility that complex diseases can be precisely diagnosed by means of SNP genotypes without using phenotypical features. In short words, information hidden in SNP genotypes can be extracted in efficient ways to make precise diagnoses for complex diseases. |
url |
https://doi.org/10.1038/s41598-020-80832-x |
work_keys_str_mv |
AT xuqingliu precisediagnosisofthreetopcancersusingdbgapdata AT xinshengliu precisediagnosisofthreetopcancersusingdbgapdata AT jianyingrong precisediagnosisofthreetopcancersusingdbgapdata AT fenggao precisediagnosisofthreetopcancersusingdbgapdata AT yandongwu precisediagnosisofthreetopcancersusingdbgapdata AT chunhuadeng precisediagnosisofthreetopcancersusingdbgapdata AT hongyanjiang precisediagnosisofthreetopcancersusingdbgapdata AT xiaofengli precisediagnosisofthreetopcancersusingdbgapdata AT yeqinchen precisediagnosisofthreetopcancersusingdbgapdata AT zhiguozhao precisediagnosisofthreetopcancersusingdbgapdata AT yutingliu precisediagnosisofthreetopcancersusingdbgapdata AT haiwenchen precisediagnosisofthreetopcancersusingdbgapdata AT junliangli precisediagnosisofthreetopcancersusingdbgapdata AT yuhuang precisediagnosisofthreetopcancersusingdbgapdata AT chengyaoji precisediagnosisofthreetopcancersusingdbgapdata AT wenwenliu precisediagnosisofthreetopcancersusingdbgapdata AT xiaohuluo precisediagnosisofthreetopcancersusingdbgapdata AT lilixiao precisediagnosisofthreetopcancersusingdbgapdata |
_version_ |
1724334599035682816 |