Page 94 - MI-2-2
P. 94

Microbes & Immunity                                               Big data and DNN-based DTI model in CHP




                                                           elements such as TFs, genes, miRNAs, and lncRNA is
                                                           deleted by AIC in Equations XXII-XXV, its corresponding
                                                           component in R will be reduced to zero. To employ KEGG
                                                     pathways annotation for the core signaling pathways, real
                                 
             ζ   11  …  ζ 1y  …  ζ 1Y  ρ 11  …  ρ 1y  … ρ 1Y 
                                                GWGEN should be truncated to a core GWGEN with
                                                           6,000 significant components based on their significance
                              



             ζ  …   ζ   …   ζ    ρ   … ρ     … ρ             from the energy perspective. Based on singular value
                                                    
              1 l    ly      lY    m 1   my      m mY      decomposition, the network structure projection method,
                                         
              …                 …                    PNP, can be described with Equation XXLIII:
              ζ L 1  ζ Ly  …  ζ LY     ρ M 1  ρ My  …  ρ MY    R =ΛΣΘ T                        (XLIII)
                                
                                                                         ( KL MN *X YZ) (++  +  +  +  )  (KL MN++  +  ) ((*K LM N+ +  +  ) ,
                                                               where  R             ,  Λ
                                                          Θ (XY Z++  )*(X YZ++  )  , and  Σ is a diagonal matrix (i.e.
             κ 11  …  κ 1 y  …  κ 1 Y 
                                                       Σ  =  diag[σ ,σ ,…,σ ,…,σ X+Y+Z ]), including X+Y+Z non-
                                                                        1
                                                                               i
                                                                          2
                                                             negative  singular values of  R with descending order
                               
             κ   …  κ    …  κ                                 σ ≥ …≥ σ ≥ … ≥ σ     ≥0. In this context, diag(σ  σ )


              n1      ny     nY                               1      I        X+Y+Z                     1,  2
                                                                                               σ 1  0
              …                                           indicates the diagonal matrix of σ  and σ  (i.e.,   0  σ  ).
                                                                                          1
                                                                                                2
              κ N1  κ Ny  …  κ NY                                                                      2
                                                               The eigenvalue expression (network energy) fraction (E )
                                                      (XLI)    can be defined by the normalization in Equation XLIV.  I
              R m−g , R m−m , and R  stand for the matrices associated   Σ σ 2
                                                                      I
                             m−l
                                                                      i=1
            with transcriptional regulatory abilities of miRNAs on   E =  XY Z++  i  2  ≥ 085.         (XLIV)
                                                                I
            genes, miRNAs, and lncRNAs, respectively, as shown in   Σ c=1  σ c
            Equation XLII.                                       We selected the top I singular vectors of Θ, such that the
                                                                I
                                                               Σ σ i  ≥ 0  accounts for at least 85% of the total energy. The
                                                                i=1
                                                           minimal  I was chosen to ensure this proportion. These
                                                           selected singular vectors are then used to construct the
                                                           principal network structure, which captures 85% of the
                                                     network’s energy. Subsequently, the projection of each row
             ω 11  …  ω 1z  …  ω 1Z  λ 11  …  λ 1z  … λ 1Z 
                                                in  R onto the top  I singular vectors is performed. This
                                                           means that all edges of each node (i.e., each protein, gene,




                              
             ω  …   ω    …  ω  λ     … λ     … λ             miRNA, and lncRNA) in real GWGENs should be
                                                    
              1 l     lz     lZ    m 1   mz      m mZ      projected to the top  I  singular vectors as the following
                                         
              …                 …                    equation (Equation XLV):
              ω L 1  ω Lz  …  ω LZ     λ M 1  λ Mz  …  λ MZ    Project (,  = r  ⋅θ T
                                                                     bi)
                                                                   R    b  i
                                                             for b = 1,2,…,K + L + M + N and i = 1,2,…,I   (XLV)
                               
                                                            where  r   represents  the  b-th  row  vector  of  R  and  i
                                                                        b
             τ 11  …  τ  z 1  …  τ 1 Z                       denotes the i-th column vector of Θ, which is the right-
                         
                                                             singular vector of  Θ. We further define the 2-norm

             τ    …  τ  …  τ                                 projection value of each node to the top I right singular
                              
              n1     nz      nZ                              vectors using Equation XLVI.
                          
              …                                                  I            1
              τ N1  τ Nz  …  τ NZ                          D R b ()  =  ∑ ( project R ( , bi) 2   2
                                                                                )
                                                     (XLII)          i=1         
              In the network matrix  R in Equation XXXVIII or   for b = 1,2,…, K + L + M + N            (XLVI)
            real GWGENs of CHP and non-CHP, if an interaction    In conclusion, we can obtain the projection values
            between any two proteins or regulation between any two   of all nodes, including proteins, genes, miRNAs,  and
            Volume 2 Issue 2 (2025)                         86                               doi: 10.36922/mi.4620
   89   90   91   92   93   94   95   96   97   98   99