Tohoku J. Exp. Med., 2023 January, 259(1)

Genetic Recombination Sites Away from the Insertion/Deletion Hotspots in SARS-Related Coronaviruses

Tetsuya Akaishi,1,2 Kei Fujiwara3 and Tadashi Ishii1,2

1Department of Education and Support for Regional Medicine, Tohoku University, Sendai, Miyagi, Japan
2COVID-19 Testing Center, Tohoku University, Sendai, Miyagi, Japan
3Department of Gastroenterology and Metabolism, Nagoya City University, Nagoya, Aichi, Japan

The genome sequences of severe acute respiratory syndrome (SARS)-related coronaviruses (sarbecoviruses) have been reported to include many long and complex insertions/deletions (indels) in specific genomic regions, including open reading frame 1a (ORF1a), S1 domain of the spike, and ORF8 genes. These indel hotspots incorporate various non-classical, long, and complex indels with uncertain developmental processes. A possible explanation for these complex and diversified indels at the hotspots is genetic recombination. To determine the possible association between recombination events and development of indel hotspots, this study investigated the genome sequences of many sarbecoviruses from different countries and hosts and compared the distributions of the indel hotspots and recombination sites by performing multiple sequence alignments and recombination analyses. The genomes of 19 SARS-related coronaviruses (15 coronaviruses that infect bats, two that infect humans, one that infects pangolins, and one that infects civets), including human-infecting SARS-CoV and SARS-CoV-2, were evaluated. Hotspots of complex indels with diverse RNA sequences around gaps were clustered in non-structural protein 2 (Nsp2) and Nsp3 of ORF1a, S1, and ORF8. Phylogenetic reconstructions revealed different structures of the inferred phylogenetic trees between genomic regions, and recombination analyses identified multiple recombination sites across ORF1ab and S genes. However, the nucleotide positions of the indel hotspots were not identical with the identified recombination sites in the recombinant viruses, suggesting the involvement of different developmental processes of indel hotspots and genetic recombination. Further research is required to elucidate the developmental mechanisms underpinning clustered complex indels and recombination events in the evolutionary history of sarbecoviruses.

Keywords —— genetic recombination; multiple sequence alignment; phylogenetic reconstruction; SARS-CoV-2; SARSrelated coronaviruses

===============================

Tohoku J. Exp. Med 2023, 259, 17-26.

Correspondence: Tetsuya Akaishi, Department of Education and Support for Regional Medicine, Tohoku University, 1-1 Seiryo-machi, Aoba-ku, Sendai, Miyagi 980-8574, Japan.

e-mail: t-akaishi@med.tohoku.ac.jp