The ORF1ab of 15 COVID‐2019 sequences have been downloaded from GISAID (https://www.gisaid.org/) and GenBank (http://www.ncbi.nlm.nih.gov/genbank/) databanks. A dataset has been built using the five sequences of the severe acute respiratory syndrome (SARS) virus and five sequences from Bat SARS‐like virus sharing the highest sequence similarity to the COVID‐2019 sequence (Table 1). The pairwise percentage of similarity has been calculated using Basic Local Alignment Search Tool (https://blast.ncbi.nlm.nih.gov/Blast.cgi); duplicated sequences have been removed from the dataset. The 25 sequences have been aligned using a multiple sequence alignment multiple alignment using fast fourier transform online tool4 and manually edited using Bioedit program v7.0.5.5
2.2 Selective pressure analysis
The selective pressure analysis was focused on the polyprotein ORF1ab because it differs from the most similar bat Coronavirus (QHR63299) for only 103 amino acid residues, 64 of them are conservative changes. In particular, non structural protein 2 (nsp2) differs from bat Coronavirus for 11 residues while nsp3 for 64 residues of which 44 are conservative changes.Adaptive Evolution Server (http://www.datamonkey.org/) was used to find eventual sites under of positive or negative selection pressure. At this purpose the following tests has been used: fast‐unconstrained Bayesian approximation (FUBAR).6 These tests allowed to infer the site‐specific pervasive selection, the episodic diversifying selection across the region of interest and to identify episodic selection at individual sites.7 Statistically significant positive or negative selection was based on P < .05.
2.3 Structural modelling
Homology modelling has been attempted with SwissModel8 and HHPred9 servers. Models for ORF1ab nsp2 and nsp3 proteins available at the I‐Tasser web site (corresponding to codes QHD43415_2 and QHD43415_3)10 have been considered. PDB Proteins structurally close to the target have been evaluated using the TM‐score11 while the RAMPAGE12 online tool has been used to assess the folding quality of the model.To test for the presence of transmembrane helical segments in Coronavirus ORF1ab nsp2 and nsp3, TMHMM,13 MEMSAT,14 and MEMPACK15 online tools have been used. Three‐dimensional structures have been analyzed and displayed using PyMOL.16
3.1 Selective pressure analysis
Regarding the FUBAR analysis performed on the ORF1ab region, the presence of potential sites under positive selective pressure have been found (P < .05), in particular:on the amino acidic position 501 the COVID‐2019 has a glutamine (E) residue, the Bat SARS‐like coronavirus has a threonine (T) residue and the SARS virus has an alanine (A) residue.
At position 723 in the COVID‐2019 there is a serine (S) residue while the Bat SARS‐like virus and the SARS virus have a glycine (G) residue.
On the aminoacidic position 1010, the COVID‐2019 has a proline (P) residue, the Bat SARS‐like coronavirus has a histidine(H) residue and the SARS virus has an isoleucine (I) residue. Significant (P < .05) pervasive negative selection in 2416 sites (55%) has been evidenced and confirmed by FUBAR analysis.
3.2 Structural modelling
To map the structural variability of the ORF1ab region of the virus and its sites under selection pressure, homology modelling has been attempted. Unfortunately, neither SwissModel nor HHPred found suitable templates for the amino acid region containing the sites under selective pressure. For that reason, the corresponding models available on the I‐Tasser web site has been used. Moreover, some regions of the nsp2 and nsp3 proteins structurally homologous to other known viral proteins have been identified through HHpred analysis and have been mapped within the ORF1ab nsp2 and nsp3 sequences (Figure 1).The results of the analysis suggest the presence of a segment within the nsp2 and the nsp3 regions that has no evident homologous structures. In an attempt to structurally characterize as far as possible these regions, TMHMM, MEMSAT, and MEMPACK analyses have been utilized and have shown the presence of several potential trans‐membrane helices (Figure 1). In particular, our transmembrane helices were predicted by MEMSAT in nsp2 while six helices were predicted by MEMSTA and TMHMM in nsp3 (Figure 1).
Referring to the amino acids under positive selective pressure found using the FUBAR analysis: the amino acid in position 501 (position 321 of the nsp2protein), the corresponding site in the Bat SARS‐like coronavirus has an apolar amino acid while the SARS and COVID‐2019 has a polar amino acid. It can be speculated, that due to its side chain length, polarity, and potential to form H‐bonds the glutamine amino acid(Q) may confer higher stability to the protein. The mutations fall within the protein nsp2 on the region homologous to the endosome‐associated protein similar to the avian infectious bronchitis virus (PDB 3ld1) that plays a key‐role in the viral pathogenicity. (Figure 2) In the nsp2 structure model available at the I‐Tasser site, this position appears to be exposed to the solvent.
4 DISCUSSION
The COVID‐2019 ongoing epidemic is worrying worldwide for its high contagiosity. From its first appearance in Wuhan, China, about 1 month ago, the virus infected thousands people with new cases number rapidly growing every day. For this acceleration in human‐to‐human transmission in China but with evident spreading also in other countries, World Health Organization declared the epidemic a global health emergency.18, 19Many questions are open and need an answer, of these the most frequent is how much this virus can be dangerous and how much it differs from SARS virus which epidemic scared all the world some years ago. In this study some interesting findings have been evidenced to support and fill gaps in knowledge about the new COVID‐2019 that is still causing infection all over the world.20, 21
The positive selective pressure in this protein could justify some clinical features of this virus compared with SARS and Bat SARS‐like CoV.22
First which are the probably most common sites undergoing to an aminoacidic change, providing an insight of some important proteins of the COVID‐2019 that are involved in the mechanism of viral entry and viral replication. This data can contribute for a better understanding of how this virus acts in its pathogenicity.
Furthermore, to identify a potential molecular target is fundamental to follow the molecular evolution of the virus suggesting some interesting sites for potential therapy or vaccine.
The structural similarity of the region in which falls the positive selective pressure as so as the stabilizing mutation falling in the endosome‐associated‐protein‐like domain of the nsp2 protein, could explain why this virus is more contagious than SARS.
The destabilizing mutation happening near the phosphatase domain of the nsp3 proteins could suggest a potential mechanism differentiating COVID‐2019 from SARS.
The results of this study could fill some gaps about COVID‐2019 knowledge especially in the actual moment when the epidemic is ongoing and the scientific community is trying to enrich knowledge about this new viral pathogen. During epidemic, all strength has to be done to enforce virus fight. This can be achieved by understanding the main drivers for pathogen appearance, spreading, and supremacy on human defense.
Inga kommentarer:
Skicka en kommentar