Page 12 - GPD-1-1
P. 12

Gene & Protein in Disease                                                   Bioinformatics study of PCNP



            protein structure was prepared using the GROMOS96 43a1   and enriched pathway analysis of PCNP, the Network
            force field solvated by the single point charge water model and   Analyst server (https://www.networkanalyst.ca/) was
            was fixed in a periodic cubic solvated box. Other parameters   employed [34,35] . It is a publicly available online tool widely
            were  neutralized  by  adding  the  salt  (NaCl)  concentration   used for gene expression profiling, transcriptional factor
            (0.15 M), energy minimization for 5000 steps of the prepared   analysis, PPI networks, pathway analysis, toxicogenomics,
            system, and equilibration type NVT/NPT at 300 K and 1 bar   and pharmacogenomics studies [36,37] . In addition, the
            of pressure. The final MDS step was processed for 50 ns of   Enrichr web server (https://maayanlab.cloud/Enrichr/)
                                                                                                        [35]
            time. The MDS was analyzed by calculating the root mean   was utilized to validate the Network Analyst results .
            square deviation (RMSD) and root mean square fluctuation
            (RMSF) values of the protein structure.            2.7. Phylogenetic analysis
                                                               To determine the evolutionary history of PCNP between
            2.4. Physicochemical characterization and          humans and other vertebrates, the phylogenetic tool,
            secondary structure prediction                     molecular evolutionary genetics analysis (MEGA) X, was
                                                                   [38]
            The basic physicochemical properties, such as theoretical   used . PCNP has no reported paralogs in the biological
            isoelectric point (pI), molecular weight, total number   database (ENSEMBL) or in the available literature.
            of positive and negative residues, extinction coefficient   Orthologs of PCNP were used for tree construction
            instability index, aliphatic index, and grand average   and sequences of eight orthologous candidate proteins
            hydropathy, were computed using the ProtParam tool   were  retrieved  from  the  NCBI  database.  Alignment  of
            of ExPASy (http://web.expasy.org/protparam/) [26-28] . For   multiple sequences was carried out using the MUSCLE
            the secondary structure prediction of PCNP, SOPMA   program implemented within MEGA X. For pairwise
            bioinformatics  tool  (https://npsa-prabi.ibcp.fr/cgi-bin/  sequence identity, Sequence Demarcation Tool Version 1.2
                                                                                                           [39]
            npsa_automat.pl?page=/NPSA/npsa_sopma.html) was used   (SDTv1.2; http://web.cbio.uct.ac.za/~brejnev/) was used .
            with its default query parameters . This tool uses the self-  The best-fit substitution model was estimated in MEGA
                                      [29]
            optimized prediction method and a neural network method   and selection was done based on the Neighbor-Joining
            (PHD) to predict the secondary structure of proteins [29,30] .  Method. The rate of variation among sites was modeled
                                                               with a gamma distribution (shape parameter = 1). All
            2.5. Coexpression and protein-protein interaction   positions that contained alignment gaps and missing data
            (PPI) analysis                                     were eliminated from the analysis. A phylogenetic tree was
            The coexpression analysis of the PCNP was performed   constructed using the unweighted pair group method with
            by the GeneMANIA server (https://genemania.org/) by   arithmetic mean, and the validity of the tree generated was
                                                                                                         [40,41]
            setting  Homo sapiens as the species organism and the   tested by bootstrap analysis of 1000 pseudoreplicates  .
            maximum number of resultant genes and attributes as 10.   3. Results and discussion
            It is a web interface that has a high-accuracy prediction
            algorithm and a large database for gene functional analysis,   3.1. Structure prediction
            including coexpression, genetic interaction, colocalization,   Four tools were utilized to generate the 3D structure of
            physical interaction, pathway interaction, and shared   the PCNP protein. From the results of all these tools, we
            protein domains of the submitted query . Furthermore,   selected the best score-generating model and subjected it
                                            [31]
            the PPI of PCNP was evaluated by accessing the search   to further validation through the ERRAT and PROCHECK
            tool for the retrieval of interacting genes (STRING)   servers.  In  addition,  the  quality  model  QMEAN  score
            protein database (https://string-db.org/). The PPI network   was evaluated for the best scoring models. The graphical
            was constructed based on the data from tandem affinity   illustration of these four best models is presented in
            purification assay, affinity chromatography technology   Figure 1. Following the 3D model analysis, it was discovered
            assay, and coimmunoprecipitation assay. The STRING   that the I-TASSER generated model had the highest quality
            database query terms were cutoff at 0.4 and the maximum   factor score among the four best-scoring models, as shown
            additional interaction was 10 . Cytoscape software   in Figure 1D and Table 2. Moreover, the I-TASSER model
                                      [32]
            version  3.8.2 (https://cytoscape.org/) was employed to   structure was subjected to MDS for profound validation
            construct the coexpression and PPI networks .      analysis.
                                                [33]
            2.6. Gene ontology (GO) and pathway enrichment     3.2. Molecular dynamics simulation
            analysis                                           An MDS was conducted for the analysis of the stability
            To assimilate the biological data of GO, including biological   and flexibility of the I-TASSER-generated PCNP protein
            processes, molecular functions, and cellular components,   model. The RMSD and RMSF of the protein structure


            Volume 1 Issue 1 (2022)                         4                       https://doi.org/10.36922/gpd.v1i1.65
   7   8   9   10   11   12   13   14   15   16   17