Page 100 - MI-2-3
P. 100

Microbes & Immunity                                                SARS-CoV-2 complementary classification



            transmissibility, and immune escape mechanisms. Genetic   ambiguous bases and incomplete sequences removed
            divergence thresholds were established by analyzing the   before analysis. The final alignment contained a total
            evolutionary distances between recognized subtypes   of 2,214 nucleotide positions, with discrete clustering
            or genotypes of HIV-1, HCV, and influenza A virus, as   observed in the ML phylogenetic analysis (Figure 2).
            outlined in the sections above.                      The evolutionary divergence between subtypes was

              A review of the literature was conducted in PubMed/  assessed using pairwise genetic distances (Table 2). The
            MEDLINE to identify key functional parameters that have   genetic distances among HCV subtypes ranged from
            historically defined viral lineages in other RNA viruses,   0.371 s/s (1a vs. 1b) to 0.674 s/s (3a vs. 4a), reflecting the
            including HIV-1, HCV, influenza A virus, and previous   well-documented high genetic diversity of HCV. The mean
            SARS-CoV-2 designated as VOCs. The exact search strategy,   divergence across all subtype comparisons was 0.57  s/s,
            concluded on 31 January 2025, was: ((“SARS-CoV-2” OR   which is notably higher than that observed in HIV-1 env
            “COVID-19” OR “HIV-1” OR “Hepatitis C Virus” OR    sequences.
            “HCV”  OR “Influenza A”) AND (“variant of concern”
            OR “VOC” OR “subtype” OR “genotype” OR “lineage”   3.3. Influenza A virus dataset and evolutionary
            OR “clade”) AND (“transmissibility” OR “replication   distance analysis
            fitness” OR “R0” OR “immune escape” OR “neutralizing   The influenza dataset included 49 full-length  HA
            antibodies” OR “vaccine resistance” OR “vaccine escape”   sequences, representing two major subtypes, H1 and H3,
            OR “clinical severity” OR “hospitalization” OR “mortality”   as indicated in the ML phylogenetic tree (Figure 3). The
            OR “epidemiological dominance” OR “variant persistence”   genetic divergence between these subtypes was estimated at
            OR “lineage replacement”)).                        1.956 s/s, which is substantially higher than the divergence
              Criteria were categorized into four virologically relevant   observed in both HIV-1 and HCV subtypes.
            domains: (1) Transmissibility: Evidence of increased   3.4. SARS-CoV-2 dataset and evolutionary distance
            replication fitness 40,91,108-112 ; (2) Immune escape potential:   analysis
            ≥2-fold  reduction  in  neutralizing  antibody  effectiveness,
            impacting vaccine efficacy, or convalescent immunity 92,113 ;   The SARS-CoV-2 S sequence dataset in the ML phylogenetic
            (3) Clinical impact: Increased hospitalization, mortality, or   analysis showed a clear separation into five different
            multisystem complications compared to prior circulating   statistically  supported  monophyletic  clades  (Figure  4).
            variants 114-119 ; and (4) Epidemiological dominance: Variant   Pairwise genetic distance analysis of SARS-CoV-2 variants
            must persist for >12 months, indicating sustained selective   revealed minimal divergence, with nucleotide substitution
            advantage. 89,120-122                              rates ranging from 0.002 to 0.006 s/s. The highest genetic
                                                               distance was observed between Omicron and Beta
            3. Results                                         (0.006 s/s) and Omicron and Alpha (0.006 s/s), while the
                                                               lowest was between Beta and Gamma (0.002 s/s). Variants
            3.1. HIV-1 env dataset description and ML          Delta, Beta, and Gamma exhibited similar divergence levels
            phylogenetic analysis                              relative to Alpha (0.003 – 0.004  s/s), indicating limited
            A total of 58 full-length HIV-1 env sequences were analyzed,   evolutionary separation. These findings demonstrate that
            representing six major subtypes and CRFs. The ML   SARS-CoV-2 variants remain within a constrained genetic
            phylogenetic tree robustly supported the classification of   space, with divergence levels substantially lower than those
            these subtypes into distinct lineages, with bootstrap values   observed in HIV-1 subtypes, HCV genotypes, or influenza
            exceeding 0.99, indicating high confidence in the inferred   A virus subtypes (Table 3).
            evolutionary relationships (Figure 1). The distances ranged
            from 0.157  s/s (A vs. CRF02_AG) to 0.235  s/s (D vs.   3.5. Establishing the genetic divergence threshold
            CRF01_AE), with an overall mean divergence of 0.21 s/s.   based on sequences analyzed
            The highest divergence was observed between subtypes D   To objectively classify viral genetic divergence, three
            and CRF01_AE (0.235 s/s), whereas the lowest was found   evolutionary divergence thresholds were established
            between subtypes A and CRF02_AG (0.157 s/s) (Table 1).  based on the minimum observed genetic distances in
                                                               HIV-1, HCV, and influenza A virus. Three thresholds,
            3.2. HCV E1 dataset and evolutionary distance      shown in Figure 5, are as follows: low divergence, below
            estimation                                         0.157 s/s (based on HIV-1 subtype minimum); moderate
            A total of 40 HCV E1 sequences were analyzed, representing   divergence, between 0.157 and 0.371 s/s (based on HCV
            four major subtypes: 1a, 1b, 3a, and 4a. The dataset was   subtype minimum); and high divergence, above 0.371 s/s
            curated to ensure full-length coding sequences, with all   (with influenza A virus defining the upper bound at


            Volume 2 Issue 3 (2025)                         92                           doi: 10.36922/MI025190042
   95   96   97   98   99   100   101   102   103   104   105