Anti-COVID-19 multi-epitope vaccine designs employing global viral genome sequences

Tahreem Zaheer; Maaz Waseem; Walifa Waqar; Hamza Arshad Dar; Muhammad Shehroz; Kanwal Naz; Zaara Ishaq; Tahir Ahmad; Nimat Ullah; Syeda Marriam Bakhtiar; Syed Aun Muhammad; Amjad Ali

doi:10.7717/peerj.9541

Anti-COVID-19 multi-epitope vaccine designs employing global viral genome sequences

Tahreem Zaheer¹, Maaz Waseem¹, Walifa Waqar¹, Hamza Arshad Dar¹, Muhammad Shehroz², Kanwal Naz¹, Zaara Ishaq¹, Tahir Ahmad¹, Nimat Ullah¹, Syeda Marriam Bakhtiar³, Syed Aun Muhammad⁴, Amjad Ali ¹

1Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad, Pakistan

2Department of Biotechnology, Virtual University of Pakistan, Peshawar, Pakistan

3Department of Bioinformatics and Biosciences, Capital University of Science and Technology, Islamabad, Pakistan

4Institute of Molecular Biology and Biotechnology, Bahauddin Zakariya University, Multan, Pakistan

DOI: 10.7717/peerj.9541

Published: 2020-08-03
Accepted: 2020-06-23
Received: 2020-04-09

Academic Editor: Brenda Oppert

Subject Areas: Bioinformatics, Biotechnology, Immunology
Keywords: SARS-CoV-2, COVID-19, Global pandemic, Immunoinformatics, Immunity

Copyright: © 2020 Zaheer et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Zaheer T, Waseem M, Waqar W, Dar HA, Shehroz M, Naz K, Ishaq Z, Ahmad T, Ullah N, Bakhtiar SM, Muhammad SA, Ali A. 2020. Anti-COVID-19 multi-epitope vaccine designs employing global viral genome sequences. PeerJ 8:e9541 https://doi.org/10.7717/peerj.9541

The authors have chosen to make the review history of this article public.

Abstract

Background

The coronavirus SARS-CoV-2 is a member of the Coronaviridae family that has caused a global public health emergency. Currently, there is no approved treatment or vaccine available against it. The current study aimed to cover the diversity of SARS-CoV-2 strains reported from all over the world and to design a broad-spectrum multi-epitope vaccine using an immunoinformatics approach.

Methods

For this purpose, all available complete genomes were retrieved from GISAID and NGDC followed by genome multiple alignments to develop a global consensus sequence to compare with the reference genome. Fortunately, comparative genomics and phylogeny revealed a significantly high level of conservation between the viral strains. All the Open Reading Frames (ORFs) of the reference sequence NC_045512.2 were subjected to epitope mapping using CTLpred and HLApred, respectively. The predicted CTL epitopes were then screened for antigenicity, immunogenicity and strong binding affinity with HLA superfamily alleles. HTL predicted epitopes were screened for antigenicity, interferon induction potential, overlapping B cell epitopes and strong HLA DR binding potential. The shortlisted epitopes were arranged into two multi-epitope sequences, Cov-I-Vac and Cov-II-Vac, and molecular docking was performed with Toll-Like Receptor 8 (TLR8).

Results

The designed multi-epitopes were found to be antigenic and non-allergenic. Both multi-epitopes were stable and predicted to be soluble in an Escherichia coli expression system. The molecular docking with TLR8 also demonstrated that they have a strong binding affinity and immunogenic potential. These in silico analyses suggest that the proposed multi-epitope vaccine can effectively evoke an immune response.

Introduction

COVID-19 has been declared a pandemic by the World Health Organization (WHO) as of the 11 March 2020 (WHO, 2020). The outbreak of SARS-CoV-2 infection starting in Wuhan, China has now affected millions of lives across the world. The common symptoms patients exhibit include fatigue, fever, dry cough, shortness of breath and upper airway congestion (Chan, Wong & Tang, 2020). The delay in symptom appearance due to the incubation period of the virus has caused more harm as infected people without symptoms might have already infected many others even before knowing they are carrying the infection. The number of infected people is increasing daily. Scientists are working tirelessly to find a cure, but as of now, the impasse continues.

Coronaviruses (CoVs) are a family of viruses causing diseases ranging from the common cold to more severe illnesses such as Severe Acute Respiratory Syndrome (SARS, caused by SARS-CoV) and Middle East Respiratory Syndrome (MERS, caused by MERS-CoV) (Ceraolo & Giorgi, 2020b; Sahin, 2020). Belonging to the family of Betacoronavirus, the latest CoV, recently named as SARS-CoV-2, is an enveloped spherical virion about 60–140 nm in diameter. It has a positive-sense single-stranded RNA genome of ~30 kb. The virus is believed to have originated from bats and is phylogenetically linked closest to the bat-SL-CoVZC45 and bat-SL-CoVZXC21 (Zhu et al., 2020). The virus is highly contagious and spreads from person-to-person through the respiratory droplets when an infected person sneezes or coughs or when a person encounters contaminated surfaces with the virus. The disease state caused by viral infection has been termed as COVID-19 (Chan, Wong & Tang, 2020).

The genome contains six open reading frames (ORFs), including ORF1ab, ORF3a, ORF6, ORF7a, ORF8, ORF10, encoding spike glycoprotein trimer S, nucleoprotein N, membrane protein M and envelopes small membrane protein pentamer E (Zhu et al., 2020). SARS-CoV-2 causes wide-ranging infections like mild upper respiratory tract disease, severe viral pneumonia with respiratory failure and severe cases, even death (Huang et al., 2020).

Considering the gravity of the situation of COVID-19 worldwide, there is a dire need for the development of an effective vaccine or antiviral drugs that are effective in treating this viral infection. Currently, there is no FDA approved treatment or licensed vaccine against COVID-19. Vaccination is the most desired option available at our disposal to eradicate the diseases associated with infectious microorganisms (Greenwood, 2014). Reverse vaccinology is a valuable technique that adopts genome-level analysis to identify potential antigenic determinants of the pathogenic microorganism (Rappuoli, 2001). Previously designed multi-epitope vaccines against various emerging viruses have been reported as safe and generate potent immune responses.

To elicit strong immunogenic responses and have long-term efficacy, vaccines against coronavirus must activate both humoral and cell-mediated immune responses (Amanna & Slifka, 2011). Virus-specific T cells, the effector T cells, provide vaccine-mediated protection at the peak of antiviral response and the antiviral cytokines production is also increased (Sallusto, Geginat & Lanzavecchia, 2004), but a successful vaccine would activate B-cell mediated humoral immunity as well. In the case of re-infection, memory B cells can be re-activated via B cell receptors and provide protection (Amanna, Carlson & Slifka, 2007). Therefore, it is a challenge to activate both humoral as well as cell mediated immunity of required levels to effectively provide protection against COVID-19.

In this study, we have utilized the 475 genomes of SARS-CoV-2 available in NGDC (available at: https://bigd.big.ac.cn/) and GISAID (available at https://www.epicov.org/epi3/cfrontend) database as of 15 March 2020 to identify conserved immunogenic epitopes representing the novel coronavirus strains (collection) and to design a potent multi-epitope vaccine against the COVID-19 infection.

Materials and Methods

Retrieval of sequences and multiple alignment

GISAID (available at https://www.epicov.org/epi3/cfrontend) and NGDC databases were mined to retrieve updated complete genomes of SARS-CoV-2. The retrieved genomes were then aligned using the Multiple Alignment using Fast Fourier Transform tool (MAFFT) (available at: https://www.ebi.ac.uk/Tools/msa/mafft/) (Katoh et al., 2002). Default parameters were used encompassing 200 PAM/κ = 2 scoring matrix and a gap penalty of 1.53. These aligned sequences were visualized and analyzed in Unipro UGENE v.34 software (Okonechnikov et al., 2012) to generate a consensus sequence of all genomes and later, nBLAST was performed to analyze its diversity with Reference sequence NC_045512.2. The ORFs from this sequence were retrieved and subjected to further analysis of predicted proteins.

CTL epitope mapping

CTLpred (available at: http://crdd.osdd.net/raghava/ctlpred/) was used for prediction of CD8 T cell epitopes (Bhasin & Raghava, 2004). Consensus approach was used that predicted CTL epitopes based on both Support Vector Machine (SVM) and Artificial Neural Network (ANN) methods using default thresholds (0.36 for SVM method and 0.51 for ANN method). For further analyses top 100 peptides were selected. These predicted epitopes were also verified by HLApred (available at: http://crdd.osdd.net/raghava/hlapred/) and ProPred1 (available at: http://crdd.osdd.net/raghava/propred1/) (Singh & Raghava, 2003).

Antigenicity

All CTL epitopes were screened for antigenicity using VaxiJen (available at: http://www.ddg-pharmfac.net/vaxijen/) (Doytchinova & Flower, 2007). VaxiJen is an online server that predicts the antigenicity of peptides using the physicochemical properties of the proteins irrespective of their lengths and the need of alignments. We used VaxiJen server for our CTL epitopes using the default threshold 0.5 for viruses.

Immunogenicity

After screening for the antigenic potential of CTL epitopes, epitopes having immunogenic potential were screened using the IEDB analysis resource for MHC class I Immunogenicity (available at: http://tools.iedb.org/immunogenicity/) (Calis et al., 2013). This tool works best for 9-mer epitopes but larger lengths can be employed too. Only those epitopes that showed a strong binding affinity with HLA super family alleles were shortlisted. Default threshold was used for prediction of results. Epitopes having percentage rank less than 0.5 were qualified as strong binders.

Strong binding affinity with HLA alleles and population coverage analysis

CTL Epitopes that were both antigenic and immunogenic were screened for their binding affinity with HLA superfamily alleles using Net MHC 4.0 (available at: http://www.cbs.dtu.dk/services/NetMHC/) (Nielsen et al., 2003; Andreatta & Nielsen, 2016). The tool predicts results on the basis of an ANN that has been trained using 81 different HLA alleles. MHC binding affinity of shortlisted epitopes was checked against HLA superfamily alleles (i.e., HLA-A0101, HLA-A0201, HLA-A0301, HLA-A2402, HLA-A2601, HLA-B0702, HLA-B0801, HLA-B2705, HLA-B3901, HLA-B4001, HLA-B5801, HLA-B1501). HLA supertype representative are clusters of HLA alleles based on their overlapping peptide binding specificity, hence can cover 80% of population in a less computationally exhaustive environment. The concept has been effectively utilized in prediction of promiscuously recognized epitopes against SARS, HCV, vaccinia virus and HBV (Frahm et al., 2007; Sidney et al., 2008). Furthermore, population coverage analysis was performed using IEDB resource server. The tool predicts population coverage of epitopes based upon the restriction database (Bui et al., 2006).

HTL epitope mapping and population coverage analysis

The ORFs that didn’t yielded prioritized CTL epitopes were subjected to Helper T Lymphocytes (HTL) epitope (CD4 T cell epitopes) prediction by the HLApred server. This was done to cover the whole proteome of the virus. All alleles of MHC-II were selected, and experimental as well as predicted binders were included in the analysis. A default threshold was used, and all predicted epitopes that shared no identity with humans were selected. The predicted epitopes were screened for their antigenic potential using VaxiJen as discussed earlier. The antigenic epitopes were further screened for their IFN-gamma induction potential using IFNepitope using hybrid approach that involves both SVM and Motif based approach. The prediction model chosen for analysis was IFN-gamma vs non IFN-gamma (available at: http://crdd.osdd.net/raghava/ifnepitope/scan.php) (Dhanda, Vir & Raghava, 2013). The epitopes screened positive for IFN-gamma were checked for overlapped B cell epitopes predicted by ABCpred (available at: http://crdd.osdd.net/raghava/abcpred/) using a default threshold (Saha & Raghava, 2006b). Lastly, the shortlisted epitopes were checked for their strong binding affinity with all the HLA DR alleles present in Net MHCII server (available at: http://www.cbs.dtu.dk/services/NetMHCII/). These alleles were DRB1_0101, DRB1_0103, DRB1_0301, DRB1_0401, DRB1_0402, DRB1_0403, DRB1_0404, DRB1_0405, DRB1_0701, DRB1_0801, DRB1_0802, DRB1_0901, DRB1_1001, DRB1_1101, DRB1_1201, DRB1_1301, DRB1_1302, DRB1_1501, DRB1_1602, DRB3_0101, DRB3_0202, DRB3_0301, DRB4_0101, DRB4_0103, DRB5_0101. HLA-DR alleles are present on the surface of antigen-presenting cells and present the epitopes to helper T-cells initiating an immune response (Smith-Garvin, Koretzky & Jordan, 2009). HLA DRB alleles were reported to be involved in viral clearance, hence all the available HLA DRB alleles in server were selected for analysis (Barrett, Ryan & Crowe, 1999; Keicho et al., 2009). Lastly, Population coverage analysis was performed using IEDB population coverage analysis tool.

Multi-epitope vaccine design and construction

Epitopes were arranged as per their arrangement in the reference sequence, the CTL epitopes were linked together using a flexible linker GGGGS as recommended by Chen, Zaro & Shen (2013), and HTL epitopes with GPGPG linker to construct a multi-epitope (Saadi, Karkhah & Nouri, 2017). Moreover, a second multi-epitope construct was also designed where β-defensin, an adjuvant, was added at the N terminus of multi-epitope with EAAAK linker (Shey et al., 2019).

Sequence-based physio-chemical properties of the multi-epitope design

Different physicochemical properties of multi-epitopes were predicted using ProtParam tool including prediction of in vivo half-life, theoretical isoelectric point (pI), molecular weight, instability index, aliphatic index and Grand Average of Hydropathy (GRAVY) index of peptides (Wilkins et al., 1999; Gasteiger et al., 2005). The degradation of peptides was based on amino acid at N-terminus hence N-end rule in Protparam was employed to predict in vivo half-life of both peptides. Instability index with a threshold of 40 was used in order to determine the stability of the protein in a cellular environment. The aliphatic and GRAVY index of both peptides was calculated based on their amino acid profiles.

Allergenicity of multi-epitope

Both multi-epitope constructs (with and without adjuvant) were screened for their allergenicity using two online servers; AlgPred (available at: http://crdd.osdd.net/raghava/algpred/) (Saha & Raghava, 2006a) and Allergen FP (available at: http://ddg-pharmfac.net/AllergenFP/) (Dimitrov et al., 2014). In AlgPred, all four approaches were used including mapping of IgE epitopes and PID approach (allergenicity prediction is based on similarity with experimental IgE epitopes), BLAST search on allergen representative peptides (makes prediction by performing BLAST search against 2890 allergen representative peptides), MEME/MAST motif approach (predicts allergenic motifs using MAST) and hybrid approach. Allergen FP predicts the allergenicity of peptide using an alignment-free method by implementing a four-step algorithm and compared by Tanimoto coefficient.

Antigenicity of multi-epitope design and modeling

The antigenicity profiles of both multi-epitope constructs were predicted again to confirm if the epitopes were antigenic or not. We used two servers this time; VaxiJen (available at: http://www.ddg-pharmfac.net/vaxijen/) (Doytchinova & Flower, 2007) and AntigenPro (Magnan et al., 2010). Scratch Protein Predictor server available at http://scratch.proteomics.ics.uci.edu/ hosted the tool Antigen Pro for antigenicity estimation. AntigenPro, like VaxiJen, predicts antigenicity using physicochemical properties, irrespective of the length of peptides as well as its alignment.

Secondary structure prediction

For evaluation of further structural characteristics of designed multi-epitopes, PDBsum (available at http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/) (Laskowski et al., 2018) was used. PDBsum provides insight into the unique structural aspects of proteins, peptides and their ligands.

Tertiary structure prediction and refinement

Tertiary structures of both vaccines were predicted by Scratch Protein Predictor-hosted 3Dpro tool (available at http://scratch.proteomics.ics.uci.edu/) (Cheng et al., 2005) and refined by Galaxy refine (available at http://galaxy.seoklab.org/cgi-bin/submit.cgi?type=REFINE) (Ko et al., 2012). The model showing highest Ramachandran favored residues and minimum poor rotamers were selected. The refined structures were checked for their stability and flexibility using molecular dynamics (MD) simulation studies. Structural flexibility of a protein/peptide is important for its molecular recognition and its function, so a coarse-grained protein model implemented in a webserver CABS-Flex 2.0 (available at: http://biocomp.chem.uw.edu.pl/CABSflex/) (Kuriata et al., 2018) was used for near-native dynamics of both vaccines. Default distance restrained parameters were used, number of cycles and cycles between trajectory frames were raised to 100. Temperature of simulation was also kept default.

Molecular docking with Toll-Like Receptor 8

The models were then checked for their interaction with Toll-Like Receptor 8 (TLR8). Active and passive residues for these analyses were predicted using CPORT (De Vries & Bonvin, 2011). Molecular docking was performed by using guru level interface of HADDOCK 2.2 (available at: http://haddock.science.uu.nl/services/HADDOCK/haddockserver-guru.html) (Van Zundert et al., 2016) with default parameters and a representative structure from the top-ranked docked cluster with minimum HADDOCK score was refined using Refinement Interface (available at: http://haddock.science.uu.nl/services/HADDOCK/haddockserver-refinement.html) (Van Zundert et al., 2016). Moreover, NMA analysis of refined complexes was also performed using iMOD in order to determine their deformation potential (Kremer, Mastronarde & McIntosh, 1996) (available at: https://bio3d.colorado.edu/imod/paper/). The interacting residues between each complex were predicted using PDBsum analysis (Laskowski et al., 2018).

Codon optimization and in silico cloning in Escherichia coli

Codon optimization of both multi-epitopes was performed using JCAT (available at: http://www.jcat.de/) (Grote et al., 2005) and codons were adapted as per E. coli K12 strain. Default parameters were used and plasmid UC19 was used for in silico cloning of both multi-epitopes. Moreover, a His6 tag was added at the both ends of sequences for the purification of multi-epitope proteins.

Results

Retrieval of sequences and multiple sequence alignments

The recruited 475 complete SARS-CoV-2 genomes showed a high level of conservancy upon multiple sequence alignments and in the phylogenetic tree (Fig. 1). The consensus sequence was found to be 99% identical, with 10 gaps and no mismatches with Ref seq NC_045512.2 from Wuhan. There were 11 ORFs available in Refseq NC_045512.2 for protein coding sequence (ORF1ab (YP_009724389.1), Spike Glycoprotein (YP_009724390.1), ORF3a (YP_009724391.1), Envelope Protein (YP_009724392.1), Membrane Glycoprotein (YP_009724393.1), ORF6 (YP_009724393.1), ORF7a (YP_009724393.1), ORF7b (YP_009725318.1), ORF8 (YP_009724396.1), ORF9 (YP_009724397.2) and ORF10 (YP_009725255.1)). These ORFs were subjected to epitope mapping.

Figure 1: Phylogenetic tree analysis of 475 complete genomes of SARS-CoV-2 based on full genome nucleotide sequences using the UPGMA.
Outer ring displays genomes with different colors for each country.

Download full-size image

DOI: 10.7717/peerj.9541/fig-1

CTL epitope mapping

A total of 13 CTL epitopes were shortlisted for designing multi-epitopes. These shortlisted epitopes were predicted to be antigenic (as determined by VaxiJen tool), immunogenic (as determined by MHC-I IEDB Immunogenicity tool), and had potential to bind with HLA superfamily alleles (as predicted by NetMHC) (Table 1). All the screened epitopes had 88.42% population coverage. The details of epitopes predicted from each ORFs filtered at each stage along with their properties have been provided in Table S1.

Table 1:

Shortlisted CTL epitopes for multi-epitope design.

Sr. No.	Position	Epitope	CTLpred score (SVM/ANN)	MHC	Antigenicity	Immunogenicity	Strong Binding
ORF1ab
1	1561	SLREVRTIK	0.97/1.3171709	15	0.7073	0.31699	67.66 0.40 <= SB HLA-A0301
2	2363	WLMWLIINL	1.00/1.1904209	10	1.0138	0.38891	6.60 0.06 <= SB HLA-A0201
3	214	TLSEQLDFI	0.90/0.83436257	11	1.0499	0.01093	26.42 0.40 <= SB HLA-A0201
4	5679	YVFCTVNAL	0.77/0.92267704	10	0.5377	0.07781	10.00 0.03 <= SB HLA-B3901
5	2351	FSYFAVHFI	0.66/0.97709163	13	0.8806	0.28926	133.13 0.50 <= SB HLA-B5801
6	2335	TRFFYVLGL	0.87/0.7530986	10	0.5606	0.2015	40.93 0.17 <= SB HLA-B2705
7	5255	AIDAYPLTK	0.82/0.78760102	10	0.5972	0.04585	95.81 0.50 <= SB HLA-A0301
Surface glycoprotein
8	350	VYAWNRKRI	0.93/0.49736819	11	0.5003	0.12625	0.30 <= SB HLA-A2402
9	1060	VVFLHVTYV	0.59/0.77948535	10	1.5122	0.1278	0.50 <= SB HLA-A0201
Membrane glycoprotein
10	136	SELVIGAVI	0.92/0.48139255	13	0.6409	0.25658	0.15 <= SB HLA-B4001
ORF6
11	3	HLVDFQVTI	0.59/1.0193905	13	1.4119	0.0982	0.40 <= SB HLA-A0201
ORF7a
12	101	FLIVAAIVF	0.53/0.37735924	9	0.586	0.29611	0.25 <= SB HLA-B1501
ORF7b
13	26	IIFWFSLEL	0.88/0.96108642	9	0.8291	0.2683	0.40 <= SB HLA-A0201

DOI: 10.7717/peerj.9541/table-1

Note:

CTL epitopes presenting antigenicity, immunogenicity and strong immunogenic potential shortlisted for designing multi-epitopes.

HTL epitope mapping

A total of 10 HTL epitopes were scrutinized for multi-epitopes. These shortlisted epitopes were predicted to lack homology with Human proteome (as determined by HLApred), had overlapping B cell epitopes (as determined by ABCpred), showed potential to induce IFN-gamma response (as determined by IFNepitope server) and demonstrated strong binding affinity with HLA DR alleles (as predicted by NetMHCII) (Table 2). All the epitopes had 89.70% population coverage (as predicted by IEDB population coverage analysis tool). The detail of step wise scrutiny and all the initially predicted epitopes is provided in Table S2.

Table 2:

HTL epitopes prioritized for multi-epitope vaccine design.

Sr. No.	Epitope	Antigenic	IFN	Binding affinity	Affinity (nM)	MHC
Envelope
1	FLLVTLAIL	0.9645	0.103076	DRB1_0103, DRB3_0301	9,440.3, 654.2	11
ORF8
2	LGIITTVAA	0.8663	0.10392924	DRB1_0801	646.6	30
3	WYIRVGARK	1.1953	0.10404837	DRB5_0101, DRB4_0103, DRB1_0801	661.7, 25.5, 228.8	10
ORF9
4	IGYYRRATR	0.888	0.47631827	DRB4_0103, DRB1_0801	62.7, 480.3	12
5	VILLNKHID	0.5902	0.18486858	DRB1_0801	309.6	22
ORF3a
6	IITLKKRWQ	1.7866	0.16918449	DRB1_0801	668.3	22
7	ITLKKRWQL	1.9347	0.41796429	DRB1_1301, DRB4_0103	32.8, 191.2	14
8	LLLVAAGLE	0.8981	0.03978107	DRB1_0801	763	46
9	VRATATIPI	0.7515	0.18967277	DRB1_0701, DRB1_1302, DRB3_0301	13.2, 527.3, 120.8	25
10	FLCWHTNCY	0.8927	0.26613992	DRB1_0402	8,117.1	9

DOI: 10.7717/peerj.9541/table-2

Note:

CD4 T cell epitopes having strong antigenicity, overlapping B cell epitopes and IFN induction potential that were screened to design multi epitopes.

Multi-epitope structure design and modeling

Two multi-epitope constructs were designed with and without β-defensin adjuvant to analyze and compare whether the designed multi-epitope is efficacious only in presence of adjuvant or has the ability to elicit immune response solely. Cov-I-Vac was the multi-epitope construct without adjuvant and Cov-II-Vac was with adjuvant (Fig. 2).

Figure 2: Graphical representation of Cov-I-Vac and Cov-II-Vac.
All the available ORFs were subjected to epitope mapping and shortlisted epitopes were used to design Cov-I-Vac. Furthermore, beta defensin adjuvant was added at N-terminus to design Cov-II-Vac. Cov-I-vac and Cov-II-vac have CTL as well as HTL epitopes to incite cell mediated as well as humoral immunity.

Download full-size image

DOI: 10.7717/peerj.9541/fig-2

Sequence-based physio-chemical properties of multi-epitope

Physio-chemical analysis of Cov-I-Vac and Cov-II-Vac revealed that both constructs are reasonably stable (38.48 and 39.91 respectively), have molecular weight <50 kDa, lower GRAVY indexes (i.e., 0.394 and 0.249, respectively), with a pI of 9.97 and 10.03, respectively. The N terminus of Cov-I-Vac has Threonine so its half-life in mammalian reticulocytes was predicted to be 7.5 h (if tested in vitro) using Protparam. Meanwhile, Cov-II-Vac, because of Glycine at N terminus, has a half-life of >30 h in mammalian reticulocytes, as predicted using N-end rule. However, no difference in in vivo half-life was predicted, and was found to be >20 h in yeast and >10 h in E. coli.

Antigenicity, allergenicity and immunogenicity of Cov-I-Vac and Cov-II-Vac

The antigenicity potential was found to be retained in both multi-epitopes and was estimated as 0.6153 and 0.5947 for Cov-I-Vac and Cov-II-Vac, respectively. Both constructs were reported as antigenic by AntigenPro as well with an antigenicity score of 0.516406 and 0.386605.

Moreover, both vaccines were soluble upon overexpression in E. coli. The safety profile also showed that constructs do not exhibit any experimentally reported allergens and thus projected to be non-allergens.

Secondary structure of Cov-I-Vac and Cov-II-Vac

The secondary structure prediction of Cov-I-Vac shows that it has a sheet and beta hairpin between two strands (Ala160-Il161, Gly164-Gly165), six helices, 42 beta turns and two gamma turns (Fig. 3A). The secondary structure of Cov-II-Vac shows that it has a sheet and two beta hairpins among three strands (Val193-Thr198, Gly201-Val208, Gly216-Gly217), 10 helices, five helix-helix interactions, and 37 beta turns (Fig. 3B).

Figure 3: Secondary structure of multi-epitope vaccines designed against SARS-CoV-2.
(A) Secondary structure of Cov-I-Vac. (B) Secondary structure of Cov-II-Vac.

Download full-size image

DOI: 10.7717/peerj.9541/fig-3

Tertiary structure prediction and refinement

The tertiary structure of Cov-I-Vac had 10.2% poor rotamers with 90.5% Rama favored residues; however, the refined structure selected for further analysis had an RMSD of 0.541 with 0% poor rotamers and 93% Rama favored residues. Similarly, the initial structure of Cov-II-Vac had 5.9% poor rotamers with 89% Rama favored residues, but the refined shortlisted structure with RMSD 0.548 had 92.9% Rama favored residues and 0% poor rotamers.

CABS-flex analysis

The trajectory of 10 models was generated after simulation (Fig. 4). The Root Mean Square Fluctuation (RMSF) of this simulation was found to be between 3.9 and 5.8 Å.

Figure 4: Fast dynamic simulation of Cov-I-Vac.
(A) Trajectory analysis of Cov-I-Vac showing that multi-epitope remained fairly stable during the perturbations. (B) RMSF of multi-epitope during simulation.

Download full-size image

DOI: 10.7717/peerj.9541/fig-4

The trajectory of Cov-II-Vac after fast simulation

Trajectory analysis of fast simulation of Cov-II-Vac was analyzed and the RMSF of the structure was found to be 0.75–7.25 Å. The maximum fluctuation was observed at residue 227 while minimum fluctuation was residue 39–59 (Fig. 5).

Figure 5: Trajectory analysis of Cov-II-Vac using CABS-flex analysis.
(A) Trajectory of multi-epitope protein (B) RMSF of multi-epitope during simulation.

Download full-size image

DOI: 10.7717/peerj.9541/fig-5

The active and passive residues of Cov-I-Vac, Cov-II-Vac and TLR8 are provided in Table S3. The molecular docking of Cov-I-Vac with TLR8 resulted in the top-ranked cluster with a HADDOCK score of −40.8 +/− 5.5. Low values of HADDOCK score, especially those that are negative, usually imply significantly high interaction between proteins. HADDOCK refined the representative structure of this complex and clustered the resulting 20 structures into one complex that represented 100% of the refined water models generated. After undergoing molecular refinements, the HADDOCK score of the Cov-I-Vac-TLR8 complex improved significantly to reach −251.3 +/− 2.3. The details of the refined interaction along with statistical parameters are provided in Table 3. Cov-II-Vac also showed significant interaction with TLR8 that was −83.5 +/− 6.3 which was a higher interaction score than Cov-I-Vac. However, after refinements, the interaction of Cov-II-Vac with TLR8 was lower compared to Cov-I-Vac-TLR8 interaction, that is, −217.4 +/− 3.5. The details of HADDOCK score with all its parameters are mentioned in Table 3.

Table 3:

Molecular docking score of multi-epitopes with TLR8.

Parameters	Cov-I-Vac	Cov-II-Vac
HADDOCK score	−251.3 ± 2.3	−217.4 ± 3.5
Cluster size	20	20
RMSD from the overall lowest-energy structure	0.3 ± 0.2	0.3 ± 0.2
Van der Waals energy	−165.3 ± 6.6	−149.7 ± 2.7
Electrostatic energy	−305.3 ± 21.6	−160.7 ± 21.6
Desolvation energy	−25.0 ± 5.4	−35.6 ± 3.9
Restraints violation energy	0.9 ± 0.09	0.6 ± 0.25
Buried Surface Area	4,417.1 ± 74.1	4,225.6 ± 40.0
Z-Score	0	0

DOI: 10.7717/peerj.9541/table-3

Note:

Both Cov-I-Vac and Cov-II-Vac show a strong binding affinity with Immune receptopr TLR8.

For an in depth analysis of protein–protein interactions, PDBSum analysis revealed that a total of 36 residues of Cov-I-Vac interact with 43 residues of TLR8, with one salt bridge, 15 hydrogen bonds and 210 non bonded interactions. The detail of the interaction is illustrated in Fig. 6 and atomic level interaction is provided in Table S4. Similarly, 36 residues of Cov-II-Vac interacted with 41 residues of TLR8, prominent among them were residues of multi-epitopes of 12 hydrogen bonds and 168 non bonded contacts. The interacting residues of the complex are illustrated in Fig. 7 while distances of detailed atomic interaction are provided in Table S5.

Figure 6: Protein-protein interaction of Cov-I-Vac and TLR8.
(A) Refined complex of Cov-I-Vac interacting with TLR8. (B) Atomic level interaction showing Hydrogen bond and non-bonded contacts between residues of Cov-I-Vac and TLR8. (C) List of all the residues of both interacting proteins.

Download full-size image

DOI: 10.7717/peerj.9541/fig-6

Figure 7: Protein-protein interaction of Cov-II-Vac and TLR8.
(A) Refined complex of Cov-II-Vac interacting with TLR8. (B) Atomic level interaction showing Hydrogen bond and non-bonded contacts between residues of Cov-II-Vac and TLR8. (C) List of all the residues of both interacting proteins.

Download full-size image

DOI: 10.7717/peerj.9541/fig-7

NMA analysis

NMA analysis of both multi-epitopes showed that Cov-I-Vac has higher stability than Cov-II-Vac, as the Eigenvalue, the energy required to deform the structure, was higher for Cov-I-Vas as shown in Fig. 8.

Figure 8: Normal Mode analysis of designed multi epitopes.
(A) Eigen score of Cov-I-Vac and TLR8 complex. (B) Eigen score of Cov-II-Vac and TLR-8 complex.

Download full-size image

DOI: 10.7717/peerj.9541/fig-8

Codon optimization and E. coli expression

Codon optimization results of both vaccines Cov-I-Vac and Cov-II-Vac (with GC content 58.99% and 58.49%, respectively) were within the optimum limits (Pandey, Bhatt & Prajapati, 2018). The CAI value was predicted to be 1.0, which projects high level expression of our designed vaccine constructs in E. coli K12 strain. The total length of the Cov-I-Vac clone was 3.6 kbp while Cov- II-Vac clone was 3.8 kbp. Both sequences were designed to be added to the plasmid between restriction sites AfiIII-pciI and BspQI-sapI (Fig. 9).

Figure 9: In silico cloned multi epitopes in plasmid pUC19.
The CAI values indicate that both multi-epitopes have high expression in *E. coli* expression system. (A) Cov-I- vac cloned in pUC19. (B) Cov-II-Vac cloned in pUC19.

Download full-size image

DOI: 10.7717/peerj.9541/fig-9

Discussion

The COVID-19 pandemic has affected 213 countries around the world with almost 6.3 million patients and more than 378 thousand deaths (Worldometersinfo, 2020). The current situation urges scientists all over the world to find an urgent solution to stop this pandemic and develop effective therapeutics (WHO, 2020). To ensure viral clearance, cell mediated and humoral responses must be induced by the action of CD8 and CD4 T cells (Ikram et al., 2018). The reliability of such responses has enabled the use immunoinformatics approaches for vaccine development against viral diseases. The present study encompassing 475 complete genomes from all around the world suggests that while there is emerging variation in SARS-CoV-2, the rate with which this variation arises is quite slow (Ceraolo & Giorgi, 2020a). The consensus sequence generated from the alignment of all complete genomes was found to be 99% identical to the reference sequence reported from Wuhan.

The phylogenetic analysis of 475 genomes of SARS-CoV-2 reveals that genome sequences reported from China belong to clades containing those from almost all of the countries of the world and covering the whole span of a phylogenetic tree. The relatedness of SARS-Cov-2 strains was expected considering that the outbreak originated from China (Nezafat et al., 2016; Mackenzie & Smith, 2020; Wu et al., 2020a).

The present study was mainly conducted to propose multi-epitope based vaccines against SARS-CoV-2 by an immunoinformatics approach similar to other studies (Tomar & De, 2010; Dar et al., 2016; Ikram et al., 2018). These vaccines are advantageous compared to other monovalent vaccines because they provide superior protection by eliciting strong humoral and cell mediated immunity (Amanna & Slifka, 2011). Three subunit vaccines have been previously reported that were based on one, three, and 10 proteins (Abdelmageed et al., 2020; ul Qamar et al., 2020).

In recent years, epitope-based vaccines, developed via immunoinformatics approach, have gained global recognition as an alternate therapeutic strategy because of the various advantages they possess. A multi-epitope vaccine was designed against MERS, a virus belonging to the same family of viruses as SARS-CoV-2, using in silico tools (Srivastava et al., 2018). Another multi-epitope peptide vaccine was designed against brucellosis which elicited strong T cell responses (Saadi, Karkhah & Nouri, 2017). Shey et al. (2019) reported a similar approach used for prophylactic/therapeutic multi-epitope vaccine design for Onchocerciasis caused by microflaria. Promising results were shown in mice models by a multi-epitope vaccine developed by Lin et al. (2016) against Epstein-Barr virus, designed using bioinformatics tools. These studies highlight the importance of multi-epitope vaccine design to combat infections, and validate the methodology adopted in our study.

The vaccine designed in our study used all predicted ORFs and therefore covers the complete proteome. Epitopes selected were 13 CTL based and 10 HTL based that elicit cell mediated and humoral immune responses, respectively. Therefore, our subunit vaccines are projected to be effective and immunogenic against SARS-Cov-2. Coding sequences of the proteins of SARS-CoV-2 from NCBI were evaluated for their antigenicity. Highly antigenic sequences were used to predict B and T cell epitopes, the key step in vaccine development. Antigenic T cell epitopes were screened to overlap with B-cell and IFN-gamma epitopes (Van Regenmortel, 1996). HTL epitopes overlapping with B cell epitopes were further proceeded for multi-epitope construction. HTL epitopes overlapped with B-cell epitopes have been used in vaccine design studies to generate humoral response (Lehtinen et al., 1995; Dar et al., 2019). Cellular immunity mediated by CTL and humoral immunity mediated by HTL epitopes can aid viral clearance from body. Subunit vaccines proposed against HIV using HTL and CTL epitopes are under clinical trials (Newman et al., 2002).

The multi epitope vaccines, proposed in this study, were designed with the help of linkers; epitopes were linked with GGGGS (CTL based) and GPGPG (HTL based) linkers. Linkers were added to help maintain the proper functioning of each epitope after being imported to the human body independently (Pandey, Bhatt & Prajapati, 2018). GGGGS linker was added between CTL epitopes to ensure flexibility as well as solubility and pH stability (Chen et al., 2017). Addition of the GPGPG linker induces strong HTL responses as it not only facilitates the processing and presentation of epitopes but also prevents the formation of junctional epitopes in order to maintain immunogenicity (Ikram et al., 2018; Wu et al., 2020b).

An adjuvant, β-Defensin was added to the N-terminus of Cov-I-Vac to make it Cov-II-Vac, using EAAAK linker, in order to increase the immunogenicity of the vaccine. N-terminus amino acid plays a role in ubiquitin mediated degradation (Wilkins et al., 1999). The EAAAK linker effectively separates the functional domains and decreases the chances of interaction among them (Arai et al., 2001). It also increases the bioactivity and expression of epitopes (Bai & Shen, 2006; Amet, Lee & Shen, 2009).

β-defensins are effector antimicrobial peptides having antimicrobial activity including antiviral, antibacterial and antifungal activity (Lehrer & Lu, 2012). They recruits immature dendritic cells and naïve T-cell at the site of an infection (Allaker, 2008) and use both Th1 and Th2 dependent pathways to boost the synthesis of antigen-specific immunoglobulins (Tani et al., 2000). Kim et al. (2018) demonstrated that the use of human β-defensins can initiate both innate and adaptive immune response against the conjugated antigen. Several studies suggest that β-defensins efficiently induce a prolonged humoral as well as cellular immune response against a pathogen (Allaker, 2008) which makes them a potent antiviral vaccine adjuvant.

The two multi-epitopes Cov-I-Vac and Cov-II-Vac are 317 and 367 amino acids long respectively with a molecular mass of 31.6 kDa and 37.2 kDa, respectively. Both predicted proteins are basic according to theoretical pI values. The instability index and aliphatic index scores calculated using in silico analysis indicated that the vaccine proteins can prove to be stable and thermostable in a test tube, if tested experimentally. The GRAVY score for our vaccines was positive and suggests that both are hydrophobic, which may necessitate the use of micelles for better interaction of the vaccine protein within the polar environment of the body (Pandey et al., 2018). The efficacy of subunit vaccines can be improved by addition of nanoparticles-based antigen delivery system as they have smaller size and an ability to deliver the antigens to antigen presenting cells in lymph nodes. Self-assembled micelles, in this regard, are adjuvant of choice for amphiphilic molecules. Historically, these systems have been used to encapsulate hydrophobic drugs in micelle core as reviewed by Trimaille and Verrier (Trimaille & Verrier, 2015).

Cov-I-Vac and Cov-II-Vac were found to be antigenic, immunogenic, non-allergenic and strong MHC binders. These factors suggest that the multi-epitopic vaccines will have a robust immune response without allergic reaction.

Molecular Dynamic Simulation studies are mainly conducted to check the stability of vaccine by simulating it in a cellular environment (Ikram et al., 2018). However, these simulation studies take longer time so fast, dynamic simulation, CABS-flex analyses were conducted in this study that give similar results like MD simulations performed at 10ns. Moreover, the RMSF obtained from CABS-flex analysis is comparable to the RMSF of NMR ensembles (Jamroz, Kolinski & Kmiecik, 2014).

The efficacy of a multi-epitope vaccine depends on the population for which the vaccine is prepared. The human CTL and HTL based T cell epitopes included in our multi-epitope vaccines were 99.29% similar to those of the world population, indicating that it holds great potential to be an effective approach to combat the ongoing pandemic. In order to induce effective immunological responses in the body, the strong binding of the vaccine products with immune receptors is necessary. The molecular docking analysis of our designed multi-epitope based vaccine showed significant interaction with an immune receptor, TLR8, further predicting the efficacy of both vaccines. However, the vaccine Cov-I-Vac showed a stronger interaction with TLR8. The docking scores indicate that the epitopes predicted in this study have higher binding affinity with TLR8 and thus, have the ability to elicit an effective immune response. TLR8 recognizes the single stranded RNA and acts as a major switch to elicit protective immune responses. It initiates antiviral responses via MyD88-dependent pathway leading to the production of type I and III interferons and other pro-inflammatory cytokines via NF-κB activation (such as IL-1, TNF-α, IL-6) (Lester & Li, 2014; Prompetchara, Ketloy & Palaga, 2020). These immune responses are otherwise suppressed in case of novel SARS-CoV-2 infection, therefore, interaction of predicted vaccine with TLR8 would produce a diverse set of immune mediators leading to initiation of a protective immune response and result in viral clearance, ideally (Felsenstein et al., 2020).

The CAI value after in silico cloning in vector suggests that translational efficacy, mRNA codons, of both multi-epitope vaccines is compatible with the host system. Moreover, it also indicates that synthesis of both vaccines is experimentally possible with higher expression within the E. coli K-12 system. The advantages of using pUC19 plasmid for cloning of Cov-I-Vac and Cov-II-Vac are high copy number, smaller sized plasmid and color differentiation upon over expression (color of recombinant colonies appears white while non-recombinant ones appear blue) (Yanisch-Perron, Vieira & Messing, 1985). The in silico cloned multi-epitopes can be used for experimental validation of these results. In silico cloning of vaccines in vectors has also been performed in studies suggested by Dar et al. (2019).

We believe that immunoinformatics approaches are useful to design effective vaccine candidates that when tested in experimental studies yield proposed outcomes (Ikram et al., 2018). All the ORFs of SARS-CoV-2 were subjected to epitope mapping; however, it is noteworthy to mention that no epitope from 3′ UTRs had strong binding affinity with HLA super family alleles. Therefore, no epitopes from this region are part of either CovI-Vac and CovII-Vac vaccines. However, the multi-epitopes do contain strong HLA superfamily allele binding regions from structural as well as non-structural proteins. These structural, as well as non-structural proteins, play an important part in viral assembly and attachment and specific immune responses against them are required to combat the infection (Lorente et al., 2016; Astuti, 2020). Despite the numerous benefits associated with this computational study, experimental validation is required to verify these results.

Conclusions

The study was aimed to predict a vaccine against SARS-CoV-2 using immunoinformatics approaches. Two multi-epitopes were designed, one without adjuvant (Cov-I-Vac) and the other with β-defensin adjuvant (Cov-II-Vac). The in silico designed multi-epitopes Cov-I-Vac and Cov-II-Vac are predicted to be antigenic, immunogenic, thermostable and safe for human applications. The multi-epitopes cover the predicted proteome of SARS-CoV-2 (Fig. 2). Cov-I-Vac demonstrates better interaction with TLR8 and more energy is required to deform the structure as compared to Cov-II-Vac. Future prospects of the study involve experimental validation of these results.

Supplemental Information

Detail of scruitiny of CD8 T cell epitopes.

DOI: 10.7717/peerj.9541/supp-1

Download

Interacting residues of Cov-II-Vac with TLR8 describing atomic level interactions.

DOI: 10.7717/peerj.9541/supp-2

Download

Active and Passive residues of Cov-I-Vac, Cov-II-Vac and TLR8.

DOI: 10.7717/peerj.9541/supp-3

Download

Interacting residues of Cov-I-Vac with TLR8 describing atomic level interactions.

DOI: 10.7717/peerj.9541/supp-4

Download

Interacting residues of Cov-II-Vac with TLR8 describing atomic level interactions.

DOI: 10.7717/peerj.9541/supp-5

Download

[1] Abdelmageed MI, Abdelmoneim AH, Mustafa MI, Elfadol NM, Murshed NS, Shantier SW, Makhawi AM. 2020. Design of multi-epitope-based peptide vaccine against E protein of human 2019-nCoV: an immunoinformatics approach. bioRxiv

[2] Allaker RP. 2008. Host defence peptides—a bridge between the innate and adaptive immune responses. Transactions of the Royal Society of Tropical Medicine and Hygiene 102(1):3-4

[3] Amanna IJ, Carlson NE, Slifka MK. 2007. Duration of humoral immunity to common viral and vaccine antigens. New England Journal of Medicine 357(19):1903-1915

[4] Amanna IJ, Slifka MK. 2011. Contributions of humoral and cellular immunity to vaccine-induced protection in humans. Virology 411(2):206-215

[5] Amet N, Lee HF, Shen WC. 2009. Insertion of the designed helical linker led to increased expression of tf-based fusion proteins. Pharmaceutical Research 26(3):523-528

[6] Andreatta M, Nielsen M. 2016. Gapped sequence alignment using artificial neural networks: application to the MHC class I system. Bioinformatics 32(4):511-517

[7] Arai R, Ueda H, Kitayama A, Kamiya N, Nagamune T. 2001. Design of the linkers which effectively separate domains of a bifunctional fusion protein. Protein Engineering, Design and Selection 14(8):529-532

[8] Astuti I. 2020. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2): an overview of viral structure and host response. Diabetes & Metabolic Syndrome: Clinical Research & Reviews 14(4):407-412

[9] Bai Y, Shen WC. 2006. Improving the oral efficacy of recombinant granulocyte colony-stimulating factor and transferrin fusion protein by spacer optimization. Pharmaceutical Research 23(9):2116-2121

[10] Barrett S, Ryan E, Crowe J. 1999. Association of the HLA-DRB1* 01 allele with spontaneous viral clearance in an Irish cohort infected with hepatitis C virus via contaminated anti-D immunoglobulin. Journal of Hepatology 30(6):979-983

[11] Bhasin M, Raghava GPS. 2004. Prediction of CTL epitopes using QM, SVM and ANN techniques. Vaccine 22(23–24):3195-3204

[12] Bui HH, Sidney J, Dinh K, Southwood S, Newman MJ, Sette A. 2006. Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC Bioinformatics 7(1):153

[13] Calis JJA, Maybeno M, Greenbaum JA, Weiskopf D, De Silva AD, Sette A, Keşmir C, Peters B. 2013. Properties of MHC class I presented peptides that enhance immunogenicity. PLOS Computational Biology 9(10):e1003266

[14] Ceraolo C, Giorgi FM. 2020a. Genomic variance of the 2019-nCoV coronavirus. Journal of Medical Virology 92(5):522-528

[15] Ceraolo C, Giorgi FM. 2020b. Phylogenomic analysis of the 2019-nCoV coronavirus. bioRxiv

[16] Chan KW, Wong VT, Tang SCW. 2020. COVID-19: an update on the epidemiological, clinical, preventive and therapeutic evidence and guidelines of integrative Chinese–Western medicine for the management of 2019 novel coronavirus disease. American Journal of Chinese Medicine 48(3):737-762

[17] Chen H, Chen Z, Wu B, Ullah J, Zhang T, Jia J, Wang H, Tan T. 2017. Influences of various peptide linkers on the Thermotoga maritima MSB8 nitrilase displayed on the spore surface of Bacillus subtilis. Journal of Molecular Microbiology and Biotechnology 27(1):64-71

[18] Chen X, Zaro JL, Shen WC. 2013. Fusion protein linkers: property, design and functionality. Advanced Drug Delivery Reviews 65(10):1357-1369

[19] Cheng J, Randall AZ, Sweredoski MJ, Baldi P. 2005. SCRATCH: a protein structure and structural feature prediction server. Nucleic Acids Research 33(Web Server):W72-W76

[20] Dar H, Zaheer T, Rehman MT, Ali A, Javed A, Khan GA, Babar MM, Waheed Y. 2016. Prediction of promiscuous T-cell epitopes in the Zika virus polyprotein: an in silico approach. Asian Pacific Journal of Tropical Medicine 9(9):844-850

[21] Dar HA, Zaheer T, Shehroz M, Ullah N, Naz K, Muhammad SA, Zhang T, Ali A. 2019. Immunoinformatics-aided design and evaluation of a potential multi-epitope vaccine against klebsiella pneumoniae. Vaccines (Basel) 7(3):88

[22] De Vries SJ, Bonvin AM. 2011. CPORT: a consensus interface predictor and its performance in prediction-driven docking with HADDOCK. PLOS ONE 6(3):e17695

[23] Dhanda SK, Vir P, Raghava GPS. 2013. Designing of interferon-gamma inducing MHC class-II binders. Biology Direct 8(1):30

[24] Dimitrov I, Naneva L, Doytchinova I, Bangov I. 2014. AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics 30(6):846-851

[25] Doytchinova IA, Flower DR. 2007. VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics 8(1):4

[26] Felsenstein S, Herbert JA, McNamara PS, Hedrich CM. 2020. COVID-19: Immunology and treatment options. Clinical Immunology 215:108448

[27] Frahm N, Yusim K, Suscovich TJ, Adams S, Sidney J, Hraber P, Hewitt HS, Linde CH, Kavanagh DG, Woodberry T, Henry LM, Faircloth K, Listgarten J, Kadie C, Jojic N, Sango K, Brown N V, Pae E, Zaman MT, Bihl F, Khatri A, John M, Mallal S, Marincola FM, Walker BD, Sette A, Heckerman D, Korber BT, Brander C. 2007. Extensive HLA class I allele promiscuity among viral CTL epitopes. European Journal of Immunology 37(9):2419-2433

[28] Gasteiger E, Hoogland C, Gattiker A, Wilkins MR, Appel RD, Bairoch A. 2005. Protein identification and analysis tools in the ExPASy server. In: Walker JM, ed. The Proteomics Protocols Handbook. New York: Humana Press. 571-607

[29] Greenwood B. 2014. The contribution of vaccination to global health: past, present and future. Philosophical Transactions of the Royal Society B: Biological Sciences 369(1645):20130433

[30] Grote A, Hiller K, Scheer M, Münch R, Nörtemann B, Hempel DC, Jahn D. 2005. JCat: a novel tool to adapt codon usage of a target gene to its potential expression host. Nucleic Acids Research 33(Web Server):W526-W531

[31] Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z, Yu T, Xia J, Wei Y, Wu W, Xie X, Yin W, Li H, Liu M, Xiao Y, Gao H, Guo L, Xie J, Wang G, Jiang R, Gao Z, Jin Q, Wang J, Cao B. 2020. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395(10223):497-506

[32] Ikram A, Zaheer T, Awan FM, Obaid A, Naz A, Hanif R, Paracha RZ, Ali A, Naveed AK, Janjua HA. 2018. Exploring NS3/4A, NS5A and NS5B proteins to design conserved subunit multi-epitope vaccine against HCV utilizing immunoinformatics approaches. Scientific Reports 8(1):1-14

[33] Jamroz M, Kolinski A, Kmiecik S. 2014. CABS-flex predictions of protein flexibility compared with NMR ensembles. Bioinformatics 30(15):2150-2154

[34] Katoh K, Misawa K, Kuma K, Miyata T. 2002. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research 30(14):3059-3066

[35] Keicho N, Itoyama S, Kashiwase K, Phi NC, Long HT, Van Ban V, Hoa BK, Le Hang NT, Hijikata M, Sakurada S, Satake M. 2009. Association of human leukocyte antigen class II alleles with severe acute respiratory syndrome in the Vietnamese population. Human Immunology 70(7):527-531

[36] Kim J, Yang YL, Jang SH, Jang YS. 2018. Human β-defensin 2 plays a regulatory role in innate antiviral immunity and is capable of potentiating the induction of antigen-specific immunity. Virology Journal 15(1):124

[37] Ko J, Park H, Heo L, Seok C. 2012. GalaxyWEB server for protein structure prediction and refinement. Nucleic Acids Research 40(W1):W294-W297

[38] Kremer JR, Mastronarde DN, McIntosh JR. 1996. Computer visualization of three-dimensional image data using IMOD. Journal of Structural Biology 116(1):71-76

[39] Kuriata A, Gierut AM, Oleniecki T, Ciemny MP, Kolinski A, Kurcinski M, Kmiecik S. 2018. CABS-flex 2.0: a web server for fast simulations of flexibility of protein structures. Nucleic Acids Research 46(W1):W338-W343

[40] Laskowski RA, Jabłońska J, Pravda L, Vařeková RS, Thornton JM. 2018. PDBsum: structural summaries of PDB entries. Protein Science 27(1):129-134

[41] Lehrer RI, Lu W. 2012. α-Defensins in human innate immunity. Immunological Reviews 245(1):84-112

[42] Lehtinen M, Hibma MH, Stellato G, Kuoppala T, Paavonen J. 1995. Human T helper cell epitopes overlap B cell and putative cytotoxic T cell epitopes in the E2 protein of human papillomavirus type 16. Biochemical and Biophysical Research Communications 209(2):541-546

[43] Lester SN, Li K. 2014. Toll-like receptors in antiviral innate immunity. Journal of Molecular Biology 426(6):1246-1264

[44] Lin X, Chen S, Xue X, Lu L, Zhu S, Li W, Chen X, Zhong X, Jiang P, Sename TS, Zheng Y. 2016. Chimerically fused antigen rich of overlapped epitopes from latent membrane protein 2 (LMP2) of Epstein–Barr virus as a potential vaccine and diagnostic agent. Cellular & Molecular Immunology 13(4):492-501

[45] Lorente E, Barriga A, Barnea E, Mir C, Gebe JA, Admon A, López D. 2016. Structural and nonstructural viral proteins are targets of T-helper immune response against human respiratory syncytial virus. Molecular and Cellular Proteomics 15(6):2141-2151

[46] Mackenzie JS, Smith DW. 2020. COVID-19: a novel zoonotic disease caused by a coronavirus from China: what we know and what we don’t. Microbiology Australia 41(1):45-50

[47] Magnan CN, Zeller M, Kayala MA, Vigil A, Randall A, Felgner PL, Baldi P. 2010. High-throughput prediction of protein antigenicity using protein microarray data. Bioinformatics 26(23):2936-2943

[48] Newman MJ, Livingston B, McKinney DM, Chesnut RW, Sette A, Subsets-immunology TL. 2002. T-lymphocyte epitope identification and their use in vaccine development for HIV-1. Frontiers in Bioscience 7(4):d1503-d1515

[49] Nezafat N, Karimi Z, Eslami M, Mohkam M, Zandian S, Ghasemi Y. 2016. Designing an efficient multi-epitope peptide vaccine against Vibrio cholerae via combined immunoinformatics and protein interaction based approaches. Computational Biology and Chemistry 62:82-95

[50] Nielsen M, Lundegaard C, Worning P, Lauemøller SL, Lamberth K, Buus S, Brunak S, Lund O. 2003. Reliable prediction of T-cell epitopes using neural networks with novel sequence representations. Protein Science 12(5):1007-1017

[51] Okonechnikov K, Golosova O, Fursov M, team U. 2012. Unipro UGENE: a unified bioinformatics toolkit. Bioinformatics 28(8):1166-1167

[52] Pandey RK, Ali M, Ojha R, Bhatt TK, Prajapati VK. 2018. Development of multi-epitope driven subunit vaccine in secretory and membrane protein of Plasmodium falciparum to convey protection against malaria infection. Vaccine 36(30):4555-4565

[53] Pandey RK, Bhatt TK, Prajapati VK. 2018. Novel immunoinformatics approaches to design multi-epitope subunit vaccine for malaria by investigating anopheles salivary protein. Scientific Reports 8(1):1-11

[54] Prompetchara E, Ketloy C, Palaga T. 2020. Immune responses in COVID-19 and potential vaccines: lessons learned from SARS and MERS epidemic. Asian Pacific Journal of Allergy and Immunology 38(1):1-9

[55] Rappuoli R. 2001. Reverse vaccinology, a genome-based approach to vaccine development. Vaccine 19(17–19):2688-2691

[56] Saadi M, Karkhah A, Nouri HR. 2017. Development of a multi-epitope peptide vaccine inducing robust T cell responses against brucellosis using immunoinformatics based approaches. Infection, Genetics and Evolution 51:227-234

[57] Saha S, Raghava GPS. 2006a. AlgPred: prediction of allergenic proteins and mapping of IgE epitopes. Nucleic Acids Research 34(Web Server):W202-W209

[58] Saha S, Raghava GPS. 2006b. Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins: Structure, Function, and Bioinformatics 65(1):40-48

[59] Sahin AR. 2020. 2019 novel coronavirus (COVID-19) outbreak: a review of the current literature. Eurasian Journal of Medicine and Oncology 4(1):1-7

[60] Sallusto F, Geginat J, Lanzavecchia A. 2004. Central memory and effector memory T cell subsets: function, generation, and maintenance. Annual Review of Immunology 22(1):745-763

[61] Shey RA, Ghogomu SM, Esoh KK, Nebangwa ND, Shintouo CM, Nongley NF, Asa BF, Ngale FN, Vanhamme L, Souopgui J. 2019. In-silico design of a multi-epitope vaccine candidate against onchocerciasis and related filarial diseases. Scientific Reports 9(1):1-18

[62] Sidney J, Peters B, Frahm N, Brander C, Sette A. 2008. HLA class I supertypes: a revised and updated classification. BMC Immunology 9(1):1

[63] Singh H, Raghava GPS. 2003. ProPred1: prediction of promiscuous MHC class-I binding sites. Bioinformatics 19(8):1009-1014

[64] Smith-Garvin JE, Koretzky GA, Jordan MS. 2009. T cell activation. Annual Review of Immunology 27(1):591-619

[65] Srivastava S, Kamthania M, Singh S, Saxena AK, Sharma N. 2018. Structural basis of development of multi-epitope vaccine against middle east respiratory syndrome using in silico approach. Infection and Drug Resistance 11:2377-2391

[66] Tani K, Murphy WJ, Chertov O, Salcedo R, Koh CY, Utsunomiya I, Funakoshi S, Asai O, Herrmann SH, Wang JM, Kwak LW, Oppenheim JJ. 2000. Defensins act as potent adjuvants that promote cellular and humoral immune responses in mice to a lymphoma idiotype and carrier antigens. International Immunology 12(5):691-700

[67] Tomar N, De RK. 2010. Immunoinformatics: an integrated scenario. Immunology 131(2):153-168

[68] Trimaille T, Verrier B. 2015. Micelle-based adjuvants for subunit vaccine delivery. Vaccines 3(4):803-813

[69] ul Qamar MT, Shahid F, Ashfaq UA, Aslam S, Fatima I, Fareed MM, Zohaib A, Chen LL. 2020. Structural modeling and conserved epitopes prediction against SARS-COV-2 structural proteins for vaccine development. Research Square [preprint]

[70] Van Regenmortel MHV. 1996. Mapping epitope structure and activity: from one-dimensional prediction to four-dimensional description of antigenic specificity. Methods: A Companion to Methods in Enzymology 9(3):465-472

[71] Van Zundert GCP, Rodrigues JPGLM, Trellet M, Schmitz C, Kastritis PL, Karaca E, Melquiond ASJ, van Dijk M, de Vries SJ, Bonvin AMJJ. 2016. The HADDOCK2.2 web server: user-friendly integrative modeling of biomolecular complexes. Journal of Molecular Biology 428(4):720-725

[72] Wilkins MR, Gasteiger E, Bairoch A, Sanchez JC, Williams KL, Appel RD, Hochstrasser DF. 1999. Protein identification and analysis tools in the ExPASy server. Methods in Molecular Biology 112:531-552

[73] WHO. 2020. Coronavirus disease (COVID-19) pandemic.

[74] Worldometersinfo. 2020. Coronavirus cases.