摘要
Genetic analyses showed nearly 30 amino acid mutations occurred in the spike protein of the Omicron variant of SARS-CoV-2. However, how these mutations occurred and changed during the generation and development of Omicron remains unclear. In this study, 6.7 million (all publicly available data from 2020/04/01 to 2022/04/01) SARS-CoV-2 genomes were analyzed to track the origin and evolution of Omicron variants and to reveal the genetic pathways of the generation of core mutations in Omicron. The haplotype network visualized the pre-Omicron, intact-Omicron, and post-Omicron variants and revealed their evolutionary direction. The correlation analysis showed the correlation feature of the core mutations in Omicron. Moreover, we found some core mutations, such as 142D, 417N, 440K, and 764K, reversed to ancestral residues (142G, 417K, 440N, and 764N) in the post-Omicron variant, suggesting the reverse mutations provided sources for the emergence of new variants. In summary, our analysis probed the origin and further evolution of Omicron sub-variants, which may add to our understanding of new variants and facilitate the control of the pandemic.