Leta i den här bloggen

onsdag 4 mars 2020

Ajatuksia COVID-19 viruksen nsp2 ja nsp3 tekijästä ja niiden muutamista aminohappoeroista BSLV:n ja SARS-CoV:n vastaaviin nsp2 ja nsp3 proteiineihin 3

https://onlinelibrary.wiley.com/doi/full/10.1002/jmv.25719


 The ORF1ab of 15 COVID‐2019 sequences have been downloaded from GISAID (https://www.gisaid.org/) and GenBank (http://www.ncbi.nlm.nih.gov/genbank/) databanks. A dataset has been built using the five sequences of the severe acute respiratory syndrome (SARS) virus and five sequences from Bat SARS‐like virus sharing the highest sequence similarity to the COVID‐2019 sequence (Table 1). The pairwise percentage of similarity has been calculated using Basic Local Alignment Search Tool (https://blast.ncbi.nlm.nih.gov/Blast.cgi); duplicated sequences have been removed from the dataset. The 25 sequences have been aligned using a multiple sequence alignment multiple alignment using fast fourier transform online tool4 and manually edited using Bioedit program v7.0.5.5

2.2 Selective pressure analysis

The selective pressure analysis was focused on the polyprotein ORF1ab because it differs from the most similar bat Coronavirus (QHR63299) for only 103 amino acid residues, 64 of them are conservative changes. In particular, non structural protein 2 (nsp2) differs from bat Coronavirus for 11 residues while nsp3 for 64 residues of which 44 are conservative changes.
Adaptive Evolution Server (http://www.datamonkey.org/) was used to find eventual sites under of positive or negative selection pressure. At this purpose the following tests has been used: fast‐unconstrained Bayesian approximation (FUBAR).6 These tests allowed to infer the site‐specific pervasive selection, the episodic diversifying selection across the region of interest and to identify episodic selection at individual sites.7 Statistically significant positive or negative selection was based on P < .05.

2.3 Structural modelling

Homology modelling has been attempted with SwissModel8 and HHPred9 servers. Models for ORF1ab nsp2 and nsp3 proteins available at the I‐Tasser web site (corresponding to codes QHD43415_2 and QHD43415_3)10 have been considered. PDB Proteins structurally close to the target have been evaluated using the TM‐score11 while the RAMPAGE12 online tool has been used to assess the folding quality of the model.
To test for the presence of transmembrane helical segments in Coronavirus ORF1ab nsp2 and nsp3, TMHMM,13 MEMSAT,14 and MEMPACK15 online tools have been used. Three‐dimensional structures have been analyzed and displayed using PyMOL.16

3.1 Selective pressure analysis

Regarding the FUBAR analysis performed on the ORF1ab region, the presence of potential sites under positive selective pressure have been found (P < .05), in particular:

 on the amino acidic position 501 the COVID‐2019 has a glutamine (E) residue, the Bat SARS‐like coronavirus has a threonine (T) residue and the SARS virus has an alanine (A) residue.

At position 723 in the COVID‐2019 there is a serine (S) residue while the Bat SARS‐like virus and the SARS virus have a glycine (G) residue.

On the aminoacidic position 1010, the COVID‐2019 has a proline (P) residue, the Bat SARS‐like coronavirus has a histidine(H) residue and the SARS virus has an isoleucine (I) residue. Significant (P < .05) pervasive negative selection in 2416 sites (55%) has been evidenced and confirmed by FUBAR analysis.

3.2 Structural modelling

To map the structural variability of the ORF1ab region of the virus and its sites under selection pressure, homology modelling has been attempted. Unfortunately, neither SwissModel nor HHPred found suitable templates for the amino acid region containing the sites under selective pressure. For that reason, the corresponding models available on the I‐Tasser web site has been used. Moreover, some regions of the nsp2 and nsp3 proteins structurally homologous to other known viral proteins have been identified through HHpred analysis and have been mapped within the ORF1ab nsp2 and nsp3 sequences (Figure 1).

 image
A, HHpred mapping of the homologous structures onto the ORF1ab sequence shown as a blue line on the top of the panel. Numbering above the line refers to the entire ORF. Red and Blue strips represent the PDB homologous structures and the nsp2 and nsp3 sequences, respectively. PDB codes are reported within the corresponding red stripes. Numbering below the blue line is relative to each single nsp. Orange lines indicate approximately the positions of the transmembrane helices predicted by MEMSAT. Label refers to panel B. B, diagram of the topology of predicted transmembrane helices. Number refer to the corresponding nsp sequences. nsp, non structural protein; ORF, open reading frame
 

The results of the analysis suggest the presence of a segment within the nsp2 and the nsp3 regions that has no evident homologous structures. In an attempt to structurally characterize as far as possible these regions, TMHMM, MEMSAT, and MEMPACK analyses have been utilized and have shown the presence of several potential trans‐membrane helices (Figure 1). In particular, our transmembrane helices were predicted by MEMSAT in nsp2 while six helices were predicted by MEMSTA and TMHMM in nsp3 (Figure 1).

Referring to the amino acids under positive selective pressure found using the FUBAR analysis: the amino acid in position 501 (position 321 of the nsp2protein), the corresponding site in the Bat SARS‐like coronavirus has an apolar amino acid while the SARS and COVID‐2019 has a polar amino acid. It can be speculated, that due to its side chain length, polarity, and potential to form H‐bonds the glutamine amino acid(Q) may confer higher stability to the protein. The mutations fall within the protein nsp2 on the region homologous to the endosome‐associated protein similar to the avian infectious bronchitis virus (PDB 3ld1) that plays a key‐role in the viral pathogenicity. (Figure 2) In the nsp2 structure model available at the I‐Tasser site, this position appears to be exposed to the solvent.

Regarding the amino acid in position 1010 (corresponding to position 192 of the nsp3 protein), the homologous region of the Bat SARS‐like coronavirus and SARS virus have a polar(H) and an apolar amino acid (I), respectively, while the COVID‐2019 has proline (P). In this case, it may be speculated that due to the steric bulge and stiffness of the proline, the molecular structure of the COVID‐2019 may undergo a local conformation perturbation compared with the proteins of the other two viruses. In Nsp3, the mutation falls near the protein similar to a phosphatase present also in the SARS coronavirus (PDB code 2acf) playing a key‐role in the replication process of the virus in infected cells17 (Figure 3). In the I‐Tasser model, the position is partially accessible to the solvent. It should be emphasized that all these considerations are speculative and they need to be substantiated by the availability of the experimental crystallographic structure of the corresponding proteins. 

4 DISCUSSION

The COVID‐2019 ongoing epidemic is worrying worldwide for its high contagiosity. From its first appearance in Wuhan, China, about 1 month ago, the virus infected thousands people with new cases number rapidly growing every day. For this acceleration in human‐to‐human transmission in China but with evident spreading also in other countries, World Health Organization declared the epidemic a global health emergency.18, 19
Many questions are open and need an answer, of these the most frequent is how much this virus can be dangerous and how much it differs from SARS virus which epidemic scared all the world some years ago. In this study some interesting findings have been evidenced to support and fill gaps in knowledge about the new COVID‐2019 that is still causing infection all over the world.20, 21

The positive selective pressure in this protein could justify some clinical features of this virus compared with SARS and Bat SARS‐like CoV.22
First which are the probably most common sites undergoing to an aminoacidic change, providing an insight of some important proteins of the COVID‐2019 that are involved in the mechanism of viral entry and viral replication. This data can contribute for a better understanding of how this virus acts in its pathogenicity.
Furthermore, to identify a potential molecular target is fundamental to follow the molecular evolution of the virus suggesting some interesting sites for potential therapy or vaccine.
The structural similarity of the region in which falls the positive selective pressure as so as the stabilizing mutation falling in the endosome‐associated‐protein‐like domain of the nsp2 protein, could explain why this virus is more contagious than SARS.
The destabilizing mutation happening near the phosphatase domain of the nsp3 proteins could suggest a potential mechanism differentiating COVID‐2019 from SARS.
The results of this study could fill some gaps about COVID‐2019 knowledge especially in the actual moment when the epidemic is ongoing and the scientific community is trying to enrich knowledge about this new viral pathogen. During epidemic, all strength has to be done to enforce virus fight. This can be achieved by understanding the main drivers for pathogen appearance, spreading, and supremacy on human defense.

Inga kommentarer:

Skicka en kommentar