Page 104 - TD-3-4
P. 104

Tumor Discovery                                                   Bioinformatics insights into CCL2 mutations



            complexes were prepared for MD simulations using the   of CCL2 proteins – indicators to classify protein residues –
            GROMOS 43a1 force field.  RMSD measures the average   as well as changes in secondary structures between native
                                 65
            distance between the atoms of two superimposed molecular   and mutant CCL2 proteins. 66
            structures. It is commonly used in bioinformatics to assess
            the similarity of 3D structures, particularly in the study   2.10. Docking of CCL2-native and -mutant proteins
            of protein conformations.  RMSD is calculated using the   with receptor CCR2
                                 61
            formula:                                           The objective of molecular docking for the ligand–receptor
            RMSD  = 1/ N  ∑ i n =1 () iδ  2             (I)    (CCL2–CCR2) binding process is to construct an accurate
                                                               binding free energy landscape and estimate the binding
                                                               affinity and binding kinetics. We introduced a novel hybrid
              where  δi represents the distance between the    docking  (HDOCK)  protocol  that  integrates  template-
            corresponding atoms in the two structures, and N is the   based and template-free approaches to enhance protein–
            number of atoms considered. Typically, RMSD is calculated   protein docking accuracy. Unlike traditional methods that
            for the backbone heavy atoms or alpha carbon (Cα) atoms.  focus on docking monomer proteins, HDOCK utilizes
              RMSF is a measure of the average deviation of atomic   homologous complex structures of the proteins involved.
            positions from their mean positions over time in MD   Initially, homologous complexes were identified using
            simulations. It quantifies the fluctuations of individual   BLAST searches of the PDB, with a sequence identity cutoff
            atoms or residues, providing insights into the flexibility   set at 25%. Among multiple hits, the optimal template was
            and mobility of different regions within a protein structure.   selected based on the highest sequence coverage, sequence
            RMSF is calculated as the square root of the variance of the   similarity,  and  resolution.  This  selected  homologous
            fluctuations around the average position  using the formula:  complex structure was then used as a docking template,
                                           62
                                                               incorporating potential binding interface information. By
            RMSF  = (γi − ) i  2                       (II)    leveraging the structural information from the homologous
                         γ
                                                               complexes, HDOCK substantially improves the prediction
                                                               accuracy of protein–protein interactions.  The individual
                                                                                                67
              where γi represents the coordinates of atom i, and 〈γi〉 is
            its ensemble average position. High RMSF values indicate   protein structures are modeled using the homologous
                                                               complexes as templates, either by superimposing them onto
            regions with considerable mobility, whereas low values   available homologous complexes or through homology
            suggest rigidity.
                                                               modeling.  In our study, the individual component
                                                                       68
              To gain insights into the hydrogen bonding characteristics   structures of the modeled complex were separated and
            of the proteins, we analyzed the hydrogen bond geometries.   subjected to a global sampling of potential binding modes
            As the positions of hydrogen atoms are uncertain in these   using our hierarchical protein–protein docking algorithm.
            structures, they were excluded from the analysis. In total,   This approach allows for a comprehensive exploration of
            4195 hydrogen bonds were identified and categorized into   possible interaction interfaces  between the proteins. 69,70
            four types based on the donor and acceptor atom origins:   Symmetric docking was achieved by constraining the
            main-chain to main-chain (MC-MC), main-chain to side-  translational degrees of freedom to ensure symmetry
            chain,  side-chain  to  main-chain,  and  side-chain  to  side-  along a specific axis. In cases where no complex structure
            chain. The MC-MC hydrogen bonds were further classified   templates were available, the individual protein structures
            according to the secondary structures, including alpha   for docking were modeled using their homologous
            helices, beta strands, and non-secondary structures.  monomer structures with Modeller.  The binding modes
                                                                                            71
              Furthermore, we investigated the correlations between   obtained from the docking simulations were evaluated
            hydrogen bond geometry and environmental factors, such   and ranked according to their binding energy scores. To
            as atomic depth, secondary structure, and partial charge.   group similar conformations, an RMSD cutoff of 5.0 Å was
            Atomic depth was defined as the distance to the nearest bulk   applied for clustering. From the resulting clusters, the top
            water  molecule. This  analysis  provides  a comprehensive   100 binding modes were examined and the best 10 models
            understanding of the characteristics and determinants of   were selected for submission to the Critical Assessment of
            hydrogen bonding in high-resolution protein structures.  Predicted Interactions initiative.
            2.9. Examination of alterations in CCL2 protein    3. Results
            secondary structure                                The results of this study demonstrated the considerable
            The Database of Secondary Structure in Proteins tool   impact of mutations in the CCL2 gene on the molecular
            analyzes hydrogen bonding and other secondary structures   stability, functionality, and structure of the resulting


            Volume 3 Issue 4 (2024)                         6                                 doi: 10.36922/td.3891
   99   100   101   102   103   104   105   106   107   108   109