Spike protein-based epitopes predicted against SARS-CoV-2 through literature mining

Background: With the diffusion of SARS-CoV-2 around the world, human health is being threatened. As there is no effective vaccine yet, the development of the vaccine is urgently in progress. Materials and methods: Immunoinformatics methods were applied to predict epitopes from the Spike protein thro...

Full description

Bibliographic Details
Main Authors: Wendong Li, Lin Li, Ting Sun, Yufei He, Guang Liu, Zixuan Xiao, Yubo Fan, Jing Zhang
Format: Article
Language:English
Published: Elsevier 2020-12-01
Series:Medicine in Novel Technology and Devices
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2590093520300229
Description
Summary:Background: With the diffusion of SARS-CoV-2 around the world, human health is being threatened. As there is no effective vaccine yet, the development of the vaccine is urgently in progress. Materials and methods: Immunoinformatics methods were applied to predict epitopes from the Spike protein through mining literature associated with B- and T-cell epitopes prediction published or preprinted since the outbreak of the virus till June 1, 2020. 3D structure of the Spike protein were obtained (PDB ID: 6VSB) for prediction of discontinuous B-cell epitopes and localization of epitopes in the hotspot regions. Results: Methods provided by the Immune Epitope Database (IEDB) server were the most frequently used to predict epitopes. Sequence alignment of the epitopes extracted from literature with the Spike protein demonstrated that the epitopes in different studies converged to multiple short hotspot regions. There were three hotspot regions found in RBD of the Spike protein harboring B-cell linear epitopes (‘RQIAPGQTGKIADYNYKLPD’, ‘SYGFQPTNGVGYQ’ and ‘YAWNRKRISNCVA’) predicted to have high antigenicity score. Two T-cell epitopes (‘KPFERDISTEIYQ’ and ‘NYNYLYRLFR’) predicted to be highly antigenic in the original studies were discovered in the hotspot region. Toxicity and allergenicity analysis confirmed all the five epitopes are of non-toxin, and four of them are of non-allergen. The five epitopes identified in hotspot regions of RBD were found fully exposed based on the 3D structure of the Spike protein. Conclusion: The five epitopes we discovered from literature mining may be potential candidates for diagnostics and vaccine development against SARS-CoV-2.
ISSN:2590-0935