Page 13 - GPD-1-1
P. 13

Gene & Protein in Disease                                                   Bioinformatics study of PCNP



            were calculated using MDS for the 50-ns time interval. The   a buffer system for purification by the pI method . The
                                                                                                       [42]
            results of RMSD and RMSF are presented in Figure 2. The   instability index of PCNP protein was computed as 15.56,
            RMSD values were observed in the range of 0.2–0.9 nm   which indicates that the protein may be marginally stable
            of the protein model. In brief, the results revealed that the   as the values are lower than 40. The instability index of a
            RMSD  value  came  to equilibrium,  maintaining  around   protein greater than 40 indicates that the protein is likely
            0.6–0.8 nm throughout the MDS run of 50 ns (Figure 2A).   to be unstable.
            Although the lack of experimental structure, these MDS   In addition, the value of the aliphatic index, which is
            results suggest that the protein model is dynamically stable   the relative volume of aliphatic side chains, indicates the
            throughout the MDS period with a slightly higher RMSD   thermal stability of a protein structure. The higher the
            value. Similarly, the amino acid residue wise fluctuation,   positive factor value, the  greater  the  thermal  stability  of
            the RMSF value, was plotted and is shown in Figure 2B.   the respective protein. The aliphatic index of PCNP was
            The RMSF analysis demonstrated that notable deviations   calculated  to be 89.93,  indicating its  stability at  higher
            have occurred in the first 20–25 amino acid residues, but   temperatures . The grand average of the hydropathicity
                                                                         [43]
            this type of high fluctuation was not observed in other   of the protein was predicted to be −0.463, which indicates
            residues throughout the MDS period.
                                                               the hydrophilic nature of the protein and improved
            3.3. Physicochemical characterization and          interaction with water. The detailed results are documented
            secondary structure prediction                     in Figure S1.
            The ProtParam tool, available on the ExPASy server,   Of the total 178 amino acids, 58 (32.58%) were found
            predicts the physiochemical properties of the protein   in the alpha helix, 11  (6.18%) in the extended strand,
            under investigation. The PCNP gene encodes a protein of   8  (4.49%) in the beta turns, and 101  (56.74%) in the
            178 amino acids (C H 1304 N O S ) having a molecular   random coil region of the PCNP protein as predicted by
                            805
                                      280 4
                                   238
            weight of 18924.88 kDa. The theoretical isoelectric point   the SOPMA tool. These results demonstrate that the PCNP
            (pI) of this protein was found to be 6.86 with 28 negatively   protein secondary structure has a dominant random coil
            charged residues (Asp + Glu) and 28 positively charged   region followed by beta turns and an alpha helix. The
            residues (Arg + Lys). At this computed pI, the PCNP protein   detailed results are presented in Figure S2.
            seemed to be stable and compact and was considered
            slightly acidic. These results will be helpful in preparing   3.4. GO and pathway enrichment analysis
                                                               GO is an efficient machine learning method useful for the
            A                       B                          identification of biological processes, molecular functions,
                                                               and cellular components of a query protein. In the present
                                                               study, a total of 46 biological processes, 10 molecular
                                                               functions, and 10 cellular components of PCNP were
                                                               identified by the Network Analyst server (Table S1). All
            C                       D                          the predicted biological processes results were below 0.05
                                                               (P-value) and considered a significant outcome. The top 10
                                                               biological processes results are demonstrated in Figure 3A.
                                                               In biological processes analysis, our results indicate
                                                               that PCNP has proximate and profound involvement in
            Figure 1. The 3D structures of PCNP protein predicted by (A) AlphaFold,   essential cellular activities, such as cell cycle, cycle arrest,
            (B) RoseTTAfold, (C) Robetta servers, and (D) I-TASSER.  and cellular catabolic processes (Table S1). Out of the 10

            Table 2. The quality factors‑score values of four structures of PCNP by predicted by various corresponding servers.

             Protein  SERVER             SWISS MODEL                   PROCHECK             PROSAWEB   ERRAT
             PCNP                  QMEAN4 Score  QMEANDisCo   Favored   Allowed   Disallowed   Z‑score  ERRAT
                                                 Global Score  region (%)  region (%)  region (%)
            1       AlphaFold2        −9.33       0.34±0.06    61.5       37.8      0.7       −1.71    92.4528
            2       RoseTTAFold       −2.39       0.42±0.06    80.4       17.6       2         NA      91.3386
            3       I-TASSER          −6.37       0.28±0.06    87.2       11.5      1.4        NA      94.7059
            4       Robetta           −12.32      0.31±0.05    40.4       55.7      3.8       −3.08    82.8244
            Based on these results, I-TASSER model was selected as best structure and subjected to further MDS analysis.



            Volume 1 Issue 1 (2022)                         5                       https://doi.org/10.36922/gpd.v1i1.65
   8   9   10   11   12   13   14   15   16   17   18