Page 91 - MI-2-4
P. 91

Microbes & Immunity                                         Brachyspira pilosicoli novel outer membrane proteins



            identify structural homologs across five protein databases:   proteins, to identify OMBB proteins (Figure  1). Ten
            PDB100,  CATH50,  AFDB50,  AFDB-SWISSPROT,  and    computational tools were used for predictions: Pepstats,
            AFDB-Proteome.  The top hit, based on the template   DEG database, SignalP, LipoP, CELLO, PSORTb, OMPdb,
                          53
            modeling score, was selected for functional annotation.   MCMBB, TMBETADISC-RBF, and TMbed (Table S2).
            Sequence-based functional  annotation was performed   Prediction outputs from all tools were combined for each
            using PANNZER (http://ekhidna.biocenter.helsinki.fi/  protein. Tools that specifically predict OMBB proteins
            pannzer; accessed on February 13, 2025) and eggNOG-  (e.g., OMPdb, MCMBB, TMBETADISC-RBF, and TMbed)
            mapper v2 (https://eggnog-mapper.embl.de/; accessed on   were prioritized for OMBB protein prediction (Table S2).
            February 2, 2025). 54,55                             Through  stringent  screening  criteria  and  manual
            2.4. Amino acid sequence variation among different   curation, a total of 42 OMBB proteins were selected and
            strains of B. pilosicoli                           classified into two groups: Group A (13 proteins, predicted
                                                               as OMBB by all four tools) and Group  B (29 proteins,
            Predicted OMBB proteins from the reference genome   predicted as OMBB by any three out of four tools) (Table 1).
            95/1000 were searched for similar proteins using BLASTP
            (E-value < 1×10 ; bit score > 100) across nine completed   To gain structural insights into the predicted proteins,
                         −3
            genomes of B. pilosicoli to analyze the amino acid sequence   we searched for their structures in the PDB but found
            variation in the predicted  β-barrel proteins (Table S1).   no experimentally determined models available. Using
                                                                         43
            Multiple  sequence alignment  (MSA) was  performed for   AlphaFold 3,  structural models of the predicted proteins
            orthologous sequences of each protein using ClustalW   were generated, revealing typical features of OMPs, such
            (Thompson JD, Gibson TJ, Higgins DG; EMBL,         as  β-barrel architectures with central pores, periplasmic
            Heidelberg, Germany).  Analysis of the MSA revealed   loops, and surface-exposed loops.
                               56
            amino acid substitutions among the orthologs. Mapping of   Given that most proteins were unannotated, both
            these variations onto the structural models was conducted   structure- and sequence-based approaches were employed
            using PyMOL. 46                                    to assign putative functions. Top-ranking hits were
              In addition, the 16S rRNA gene sequence of B. pilosicoli   considered for functional annotation. This analysis
            strain P43/6/78 was retrieved from the NCBI database and   revealed structural and sequence homologs of well-
            used as a query for BLASTn against available  B. pilosicoli   characterized proteins, including OMP assembly factor
            genome sequences. From the BLASTn results, strains   BamA, LPS-assembly protein D (LptD), Neisserial surface
            previously  included  in  the sequence  variation analysis   protein NspA, OM porin F (OmpF), OM phospholipase
            were identified, and their corresponding 16S rRNA gene   A (OMPLA), and vitamin B  transporter protein BtuB,
                                                                                      12
            sequences were extracted. A multi-FASTA file was created,   thereby providing valuable insights into their possible roles
            followed by MSA using Multiple Sequence Comparison by   (Tables 2 and S3).
            Log Expectation (Robert C. Edgar, USA). Subsequently, a   OMPs are located on the bacterial surface, serving as
            phylogenetic tree was constructed using the Neighbor-Joining   the primary interface between host and pathogen. Due
            method in MEGA12 software (Kumar S, Tamura K; Temple   to exposure to the host environment, these proteins are
            University, Philadelphia, USA, and Tokyo Metropolitan   subjected to strong selection pressures, making the analysis
            University, Japan) to determine the evolutionary relationships   of their sequence variability essential for understanding
            among the selected strains (Figure S1). 57         pathogen evolution (Table 3).  Building on this, we
                                                                                         59
            2.5. Structure alignment using the US-align server  analyzed the amino acid sequences of the predicted OMBB
                                                               proteins for residues exhibiting sequence variation across
            Structural models were aligned using the US-align server   nine B. pilosicoli strains (Table S4). Mapping these variations
            (https://zhanggroup.org/US-align/; accessed on July 23,   onto the structural models revealed that many variations
            2024)  an online web server to assess structural similarities   were located on extracellular loops (ECLs), which are more
                58
            and variations. Structural alignments were visualized, and   likely to interact with the host environment (Table 3).
            figures were generated using PyMOL.
                                                               3.2. Identification of OMPs
            3. Results and discussion                          3.2.1. Group A
            3.1. Prediction of OMBB proteins using a consensus-  Group  A comprised of 13 proteins, consisting of three
            based computational approach                       proteins with 16 stranded  β-barrel domain (BP951000_
            A consensus-based computational framework was applied   RS05730, BP951000_RS10215, and BP951000_RS04760);
            to the B. pilosicoli 95/1000 proteome, consisting of 2,275   seven proteins with eight-stranded  β-barrel domain


            Volume 2 Issue 4 (2025)                         83                           doi: 10.36922/MI025230050
   86   87   88   89   90   91   92   93   94   95   96