Leta i den här bloggen

onsdag 13 april 2022

Omikronin evoluutin ja fylogenetiikan selvittelystä Artikkeli joulukuulta 2021.

. 2022 Apr;94(4):1627-1632.
doi: 10.1002/jmv.27515. Epub 2021 Dec 15.

Omicron variant genome evolution and phylogenetics

Affiliations Abstract

Following the discovery of the SARS-CoV-2 Omicron variant (B.1.1.529), the global COVID-19 outbreak has resurfaced after appearing to be relentlessly spreading over the past 2 years. This new variant showed marked degree of mutation, compared with the previous SARS-CoV-2 variants. This study investigates the evolutionary links between Omicron variant and recently emerged SARS-CoV-2 variants.

 The entire genome sequences of SARS-CoV-2 variants were obtained, aligned using Clustal Omega, pairwise comparison was computed, differences, identity percent, gaps, and mutations were noted, and the identity matrix was generated. 

The phylogenetics of Omicron variants were determined using a variety of evolutionary substitution models. The ultrametric and metric clustering methods, such as UPGMA and neighbor-joining (NJ), using nucleotide substitution models that allowed the inclusion of nucleotide transitions and transversions as Kimura 80 models, revealed that the Omicron variant forms a new monophyletic clade that is distant from other SARS-CoV-2 variants

In contrast, the NJ method using a basic nucleotide substitution model such as Jukes–Cantor revealed a close relationship between the Omicron variant and the recently evolved Alpha variant. Based on the percentage of sequence identity, the closest variants were in the following order: Omicron, Alpha, Gamma, Delta, Beta, Mu, and then the SARS-CoV-2 USA isolate. 

A genome alignment with other variants indicated the greatest number of gaps in the Omicron variant's genome ranging from 43 to 63 gaps. It is possible, given their close relationship to the Alpha variety, that Omicron has been around for much longer than predicted, even though they created a separate monophyletic group. 

Sequencing initiatives in a systematic and comprehensive manner is highly recommended to study the evolution and mutations of the virus.

PMID: 34888894 DOI: 10.1002/jmv.27515  

Highlights

  • Omicron is a new SARS-CoV-2 variant.

  • Omicron encodes the largest number of genomic mutations among all variants.

  • The genome of Omicron formed a new monophyletic class.

  • Based on the percentage of sequence identity, the closest variant is Alpha variant.

1 INTRODUCTION

Omicron was discovered in Botswana in early November. South Africa reported it to the World Health Organization on November 24, 2021, and it was designated as a variant of concern (VOC) on November 26, 2021.1 Omicron contains a huge number of previously documented mutations in other VOCs, including at least 32 mutations in the spike protein alone, compared to the 16 mutations in the already highly infectious delta version, as well as other proteins required for viral replication such as NSP12 and NSP14.2

Several assumptions postulated the Omicron variant's probable emerging pattern, including (1) the potential circulation among chronically infected patients. (2) The introduction of the new variant in some South African countries during the winter wave, which went unreported due to lower genome sequencing in some countries. (3) Spike mutations may have increased the Spike's ability to connect to the ACE2 receptor on host cells. (4) A hidden animal reservoir could be the culprit, owing to the high number of mutations found in the Omicron form. (5) Africa's low immunization rate may have contributed to the establishment of the Omicron variant.3

Because of the emergent nature of the Omicron variation, various concerns have been raised, including the source of emergence, the effect of mutations in Omicron in the response to vaccinations, the influence of mutations on modulation of host immunity, clinical data, Omicron spreading potency and lethality. In this study, an attempt was made to trace the phylogenetic relationships of the Omicron genome. To achieve the best fit of alignment of whole genomes, many methodologies were used.

 

2.1 Collection of genomes and analytical programs

The genomes of CoV variants were retrieved from GISAID (https://www.gisaid.org/).4 The basic information of the used genomes are provided in Table 1.

 The CLC Genomics Workbench 12.0 (QIAGEN)5 and Geneious prime6 software were used to handle the sequences.

2.2 Alignment of genomes

The FASTA files containing entire genomes were uploaded to the Clustal Omega website at the European Bioinformatics Institute using the default parameters, and the results were analyzed. Using in-house software, the output files were imported, and the pairwise comparison matrix was produced. Differences and identity percent were calculated, as well as gaps and mutations were noted, and the identity matrix was generated.

3 RESULTS AND DISCUSSION

The first sequenced genome of Omicron variant was used to trace its phylogenetic relations with other SARS-CoV-2 variants (Table 1). The sample was collected in November 11, 2021 in Botswana, the genome sequence was submitted on November 23, 2021 with accession no. EPI_ISL_6640916. The other variants comprised Alpha, Beta, Gamma, Delta, Mu, GH49R, in addition to the SARS-CoV-2 USA isolate.

The FASTA files of variants were aligned by Clustal Omega. Pairwise comparison matrix revealed that the largest number of mutations were recorded with the Omicron variant. In comparison to other variants, the number of nucleotide changes in the Omicron genome was in the following order: SARS-CoV-2 USA isolate > Mu variant > Beta variant > Delta variant > Gamma variant > Alpha variant > Omicron variant, with 141, 140, 138, 132, 130, and 109 mutations, respectively (Figure 1). The Alpha variant had the greatest identity rate with Omicron variant (99.63%), followed by Gamma and Mu variants (99.56%). The SARS-CoV-2 USA isolate has the lowest identity (99.53%). Furthermore, Omicron variant showed the greatest number of gaps during genome alignment with other viruses, ranging from 43 to 63 gaps (Figure 2).

Pairwise comparative matrix of Omicron with SARS-CoV-2 variants. The upper diagonal panel is the differences in nucleotide composition between two variants. The lower diagonal panel is the percent identity. The matrix color ranges from red (larger differences) to blue (smaller differences)

The phylogenetic analysis of Omicron variant is provided in Figures 3-7. The NJ/Tamura (Figure 3), NJ/Kimura 80 (Figure 4), UPGMA/Kimura 80 (Figure 5), and UPGMA/JC (Figure 6) revealed that Omicron variant formed a new emergent group that was not originating with other variants. In contrast, the NJ/JC (Figure 7) revealed the close relation of Omicron variant with Alpha variant.

Details are in the caption following the image
Phylogenetic tree using neighbor-joining method and Tamura substitution model. The figure was generated by Geneious prime software

 

Details are in the caption following the image
Phylogenetic tree using UPGMA method and Kimura 80 substitution model. The figure was generated by CLC genomics software

 

Phylogenetic tree using UPGMA method and Jukes–Cantor substitution model. The figure was generated by CLC genomics software
Phylogenetic tree using neighbor-joining method and Jukes–Cantor substitution model. The figure was generated by CLC genomics software
 
Bioinformatics and phylogeny tools are gold standards in microbial evolution and drug discovery against selected molecular targets.8-10 Analysis of SARS-CoV-2 genome constituents highlighted the forces affecting virus evolution.11 In this study, we used a combination of tools to get insights into the evolution of Omicron variant. The UPGMA approach assumes that all lineages evolve at the same rate, and the mutation rate is not taken into account during tree construction. The tree construction depends on the pairwise distance. In contrast, the NJ considers the evolution rate during tree construction. The JC model of evolution considers all possible changes to nucleotides occurring with equal rates. While Kimura model assumes considers the transitions (e.g., changes of A to T or G to C) and transversions (e.g., changes from purines to pyrimidines). In  virus evolution, a single evolution model cannot be assumed due to the complexity of virus evolution and variations even within single genes.12 A NJ tree is expected to be insensitive to tree topology in the JC model, and a NJJC tree is thought to provide a good estimate of tree topology.

Based on sequence alignment differences and gaps, the Alpha variant has the fewest nucleotide alterations when compared to the Omicron variant. The close relationship between Omicron and the Alpha variant may indicate that the Omicron variant was in circulation for a lengthy period before it was discovered. The building of a phylogenetic tree using ultrametric distances between variants and equivalent evolution rates among branches using the UPGMA method revealed that the Omicron variant is phylogenetically distant from other variants, producing a monophyletic clade. Since UPGMA method merges pairs of sequences with small distance together, it was evident that Omicron variant was greatly distant from other variants. The NJ method based on Kimura 80, which takes into account mutation rates as well as nucleotide transitions and transversions, ensured that the Omicron variant was significantly different from other variants and formed a distant monophyletic class. In contrast, a close relationship with the Alpha variant was obtained using a simplified model of JC replacement with identical likely nucleotide mutation rate.

 

Inga kommentarer:

Skicka en kommentar