Page 67 - TD-3-1
P. 67

Tumor Discovery                                      Missense mutations in CXCR1: Impact on stability and function



            including sequence information in FASTA format (UniProt   proteins with known 3D structures.  This information is
                                                                                            42
            ID: P25024) from the National Centre for Biological   used to generate substitution probability tables, which are
            Information (NCBI). 31,32  The structural information of   subsequently employed to estimate the predicted impact of
            CXCR1 was obtained from the Protein Data Bank (PDB)   a mutation on protein stability. 41
            (http://www.rcsb.org/) with PDB ID: 2LNL. 33         The mCSM computational approach (http://structure.
            2.2. Computer-assisted pathogenicity analysis of the   bioc.cam.ac.uk/mcsm) 27  was  specifically  designed
            mutations                                          to predict the effects of missense mutations on both
                                                               protein stability and interactions with its corresponding
            Computational techniques play a vital role in assessing the   ligands.  This prediction technique utilizes graph-based
                                                                     22
            effect of mutations on proteins and distinguishing between   signatures to capture atom distance patterns, enabling
            deleterious and benign single-nucleotide variations. This   precise evaluations of the effect of mutations on protein
            capability is crucial for accurately distinguishing the impact   stability and interactions. In detail, mCSM encodes spatial
            of genetic alterations on protein function, facilitating the   relationships between atoms and provides insights into
            identification of disease-causing mutations. Meta-SNPs   the structural consequences of genetic variations at the
            are web-based tools (http://snps.biofold.org/meta-snp/)   atomic level.  These signatures enable the mapping of the
                                                                         43
            that  employ  multiple  predictive  algorithms  to  assess  the   protein residue environment and the training of predictive
            potentially detrimental effects of protein mutations.  The   models. In addition to evaluating the effect of mutations
                                                      23
            tool integrates other prediction methods, such as SNAP,    on protein stability, mCSM also takes into account possible
                                                         34
            PhD-SNP, PANTHER,  and SIFT,  into the system,     interactions with proteins and nucleic acids. Therefore,
                                          36
                               35
            making it one of the most powerful tools for predicting   the mCSM computational approach can provide valuable
            the impact of mutations on protein function. Meta-  insights into how mutations might contribute to the
            SNPs utilize a random forest algorithm consisting of 100   development of various diseases. 44
            decision trees implemented through the WEKA library.
                                                         37
                                                                                                       29
            This algorithm was trained on the SV-2009 dataset using   DUET  (http://structure.bioc.cam.ac.uk/duet)   is a
            a rigorous 20-fold cross-validation methodology.  The   computational method that combines two prediction
                                                     23
            output of this predictor is a probability score indicating   approaches, mCSM and SDM, to accurately predict the
                                                                                                        29
            the likelihood of a mutation causing a disease, with scores   effects of missense mutations on protein stability.  The
            <0.5 indicating a greater likelihood of the mutation being   predictions generated by these methods are combined
            pathogenic. 38                                     using  support  vector  machines  (SVMs)  that  have  been
                                                               trained with sequential minimal optimization.  By
                                                                                                        45
              The PROVEAN web server applies an alignment-     integrating these methods, DUET can produce a consensus
            created  scoring  methodology to  determine  whether  an   prediction, offering a reliable and optimized approach for
            amino acid replacement affects a protein’s biological   accurately predicting the effects of mutations on protein
            function.  In this approach, mutations with final scores   stability. 29
                   25
            <2.5 were considered harmful. The web server can be
            accessed at (http://provean.jcvi.org/seq_submit.php). 39  2.4. Evaluation of cancer-related mutations through
                                                               the FATHMM server
            2.3. Analysis of protein stability
                                                               We utilized the FATHMM server (http://fathmm.
            Mutations in a protein can modify its stability and free   biocompute.org.uk/cancer.html) to assess the relevance of
            energy, wherein the difference in free energy (ΔΔG) between   harmful mutations to cancer. This web-based tool provides
            the mutated (ΔGm) and wild-type (ΔGw) proteins reflects   predictions that enable differentiation between cancer-
            the effect of mutation on protein stability. 28,40  A negative   causing mutations and other nonpathogenic germline
            ΔΔG value represents a destabilizing mutation, while a   polymorphisms.  We accurately assessed the potential
                                                                            46
            positive  ΔΔG value  indicates a  stabilizing  mutation. 28,40    role of deleterious mutations in cancer development,
            Several computational approaches,  including  mCSM,   enhancing our understanding of the genetic basis of cancer
            SDM, and DUET, were used to determine the effect of   and facilitating the identification of clinically significant
            mutations on protein stability. 28                 mutations. 26
              SDM2    (http://marid.bioc.cam.ac.uk/sdm2)  is  a
            computational method designed to predict the quantitative   2.5. Protein structure modeling of the mutation site
            impact of mutations on protein stability. 28,41  SDM2 analyses   To construct a mutant model of CXCR1 with targeted
            the variation in potential amino acid substitutions within   mutations (N57D, R135C, and P302S), the crystal structure
            a specific structural environment tolerated by homologous   of the native protein was downloaded from PDB, and the


            Volume 3 Issue 1 (2024)                         3                          https://doi.org/10.36922/td.2512
   62   63   64   65   66   67   68   69   70   71   72