DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites

Abstract Background The hepatitis B virus (HBV) is one of the main causes of viral hepatitis and liver cancer. HBV integration is one of the key steps in the virus-promoted malignant transformation. Results An attention-based deep learning model, DeepHBV, was developed to predict HBV integration sit...

Full description

Bibliographic Details
Main Authors: Canbiao Wu, Xiaofang Guo, Mengyuan Li, Jingxian Shen, Xiayu Fu, Qingyu Xie, Zeliang Hou, Manman Zhai, Xiaofan Qiu, Zifeng Cui, Hongxian Xie, Pengmin Qin, Xuchu Weng, Zheng Hu, Jiuxing Liang
Format: Article
Language:English
Published: BMC 2021-07-01
Series:BMC Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1186/s12862-021-01869-8
id doaj-cb44f2c3dbe64691b8ecc84e9532b230
record_format Article
spelling doaj-cb44f2c3dbe64691b8ecc84e9532b2302021-08-29T11:05:11ZengBMCBMC Ecology and Evolution2730-71822021-07-0121111010.1186/s12862-021-01869-8DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sitesCanbiao Wu0Xiaofang Guo1Mengyuan Li2Jingxian Shen3Xiayu Fu4Qingyu Xie5Zeliang Hou6Manman Zhai7Xiaofan Qiu8Zifeng Cui9Hongxian Xie10Pengmin Qin11Xuchu Weng12Zheng Hu13Jiuxing Liang14Institute for Brain Research and Rehabilitation, South China Normal UniversityDepartment of Medical Oncology of the Eastern Hospital, the First Affiliated Hospital, Sun Yat-Sen UniversityDepartment of Gynecological Oncology, the First Affiliated Hospital, Sun Yat-Sen UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityDepartment of Thoracic Surgery, the First Affiliated Hospital, Sun Yat-Sen UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityDepartment of Gynecological Oncology, the First Affiliated Hospital, Sun Yat-Sen UniversityGenerulor Company Bio-X LabSchool of Psychology, South China Normal UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityDepartment of Gynecological Oncology, the First Affiliated Hospital, Sun Yat-Sen UniversityInstitute for Brain Research and Rehabilitation, South China Normal UniversityAbstract Background The hepatitis B virus (HBV) is one of the main causes of viral hepatitis and liver cancer. HBV integration is one of the key steps in the virus-promoted malignant transformation. Results An attention-based deep learning model, DeepHBV, was developed to predict HBV integration sites. By learning local genomic features automatically, DeepHBV was trained and tested using HBV integration site data from the dsVIS database. Initially, DeepHBV showed an AUROC of 0.6363 and an AUPR of 0.5471 for the dataset. The integration of genomic features of repeat peaks and TCGA Pan-Cancer peaks significantly improved model performance, with AUROCs of 0.8378 and 0.9430 and AUPRs of 0.7535 and 0.9310, respectively. The transcription factor binding sites (TFBS) were significantly enriched near the genomic positions that were considered. The binding sites of the AR-halfsite, Arnt, Atf1, bHLHE40, bHLHE41, BMAL1, CLOCK, c-Myc, COUP-TFII, E2A, EBF1, Erra, and Foxo3 were highlighted by DeepHBV in both the dsVIS and VISDB datasets, revealing a novel integration preference for HBV. Conclusions DeepHBV is a useful tool for predicting HBV integration sites, revealing novel insights into HBV integration-related carcinogenesis.https://doi.org/10.1186/s12862-021-01869-8Deep learningHBV integration sitesGenomic featuresBioinformatics
collection DOAJ
language English
format Article
sources DOAJ
author Canbiao Wu
Xiaofang Guo
Mengyuan Li
Jingxian Shen
Xiayu Fu
Qingyu Xie
Zeliang Hou
Manman Zhai
Xiaofan Qiu
Zifeng Cui
Hongxian Xie
Pengmin Qin
Xuchu Weng
Zheng Hu
Jiuxing Liang
spellingShingle Canbiao Wu
Xiaofang Guo
Mengyuan Li
Jingxian Shen
Xiayu Fu
Qingyu Xie
Zeliang Hou
Manman Zhai
Xiaofan Qiu
Zifeng Cui
Hongxian Xie
Pengmin Qin
Xuchu Weng
Zheng Hu
Jiuxing Liang
DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites
BMC Ecology and Evolution
Deep learning
HBV integration sites
Genomic features
Bioinformatics
author_facet Canbiao Wu
Xiaofang Guo
Mengyuan Li
Jingxian Shen
Xiayu Fu
Qingyu Xie
Zeliang Hou
Manman Zhai
Xiaofan Qiu
Zifeng Cui
Hongxian Xie
Pengmin Qin
Xuchu Weng
Zheng Hu
Jiuxing Liang
author_sort Canbiao Wu
title DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites
title_short DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites
title_full DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites
title_fullStr DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites
title_full_unstemmed DeepHBV: a deep learning model to predict hepatitis B virus (HBV) integration sites
title_sort deephbv: a deep learning model to predict hepatitis b virus (hbv) integration sites
publisher BMC
series BMC Ecology and Evolution
issn 2730-7182
publishDate 2021-07-01
description Abstract Background The hepatitis B virus (HBV) is one of the main causes of viral hepatitis and liver cancer. HBV integration is one of the key steps in the virus-promoted malignant transformation. Results An attention-based deep learning model, DeepHBV, was developed to predict HBV integration sites. By learning local genomic features automatically, DeepHBV was trained and tested using HBV integration site data from the dsVIS database. Initially, DeepHBV showed an AUROC of 0.6363 and an AUPR of 0.5471 for the dataset. The integration of genomic features of repeat peaks and TCGA Pan-Cancer peaks significantly improved model performance, with AUROCs of 0.8378 and 0.9430 and AUPRs of 0.7535 and 0.9310, respectively. The transcription factor binding sites (TFBS) were significantly enriched near the genomic positions that were considered. The binding sites of the AR-halfsite, Arnt, Atf1, bHLHE40, bHLHE41, BMAL1, CLOCK, c-Myc, COUP-TFII, E2A, EBF1, Erra, and Foxo3 were highlighted by DeepHBV in both the dsVIS and VISDB datasets, revealing a novel integration preference for HBV. Conclusions DeepHBV is a useful tool for predicting HBV integration sites, revealing novel insights into HBV integration-related carcinogenesis.
topic Deep learning
HBV integration sites
Genomic features
Bioinformatics
url https://doi.org/10.1186/s12862-021-01869-8
work_keys_str_mv AT canbiaowu deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT xiaofangguo deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT mengyuanli deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT jingxianshen deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT xiayufu deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT qingyuxie deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT zelianghou deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT manmanzhai deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT xiaofanqiu deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT zifengcui deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT hongxianxie deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT pengminqin deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT xuchuweng deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT zhenghu deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
AT jiuxingliang deephbvadeeplearningmodeltopredicthepatitisbvirushbvintegrationsites
_version_ 1721187198073569280