Page 28 - TD-3-1
P. 28

Tumor Discovery                                                       AI uncovers tumor spatial organization



            1. Introduction                                    neural network (GNN) architectures aimed at reducing
                                                               the dimensionality of ST data and identifying spatial
            Spatiotemporal  molecular   medicine    involves   regions. Notably, SpaGCN  presents a comprehensive
                                                                                     18
            comprehending medicine across  various dimensions,   framework that partitions data based on the aggregation
            layers, perspectives, and dynamics through the integration   of gene expression, histology, and spatial location. In the
            of spatialization and temporalization of clinical   cell clustering for ST data technique, a sequence of graph
            phonemes with spatiotemporal molecular omics.  This   convolutional networks is embedded within the deep
                                                     1
            emerging discipline strives to elucidate the pathogenesis,   graph infomax module to derive embeddings for cell
            epidemiology, historical context, patient symptoms and   nodes.  STAGATE  devises a graph attention autoencoder
                                                                              20
                                                                    19
            signs, clinical measurements, and therapeutic approaches.   framework for spatial domain identification. This
            Spatially resolved transcriptomics plays a crucial role
            in connecting and correlating information between   approach constructs a cell-type-aware spatial neighbor
                                                               network to characterize the spatial similarity at domain
            histological sections and molecular profiles, such as cell-  boundaries. Similarly,  GraphST  utilizes a  graph self-
                                                                                         21
            cell interactions, transcription factor distribution, spatial   supervised contrastive learning mechanism for the unified
            positioning, and mRNA expression within the cell through   analysis of ST data, encompassing spatial clustering,
            artificial intelligence, computerized programming, and
            visualization techniques.  Spatial transcriptomics (ST)   multisample integration, and cell-type  deconvolution.
                                2
            technologies document transcriptomic expression data and   The contrastive learning framework within the spatial
            their respective two-dimensional tissue locations at spatial   clustering module enables the acquisition of informative
                    3
            resolution.  In the past decade, various ST technologies   and discriminative spot representations by minimizing the
            have been introduced to capture mRNA molecules and   embedding distance between spatially adjacent spots. All
                                                               the algorithms mentioned above can be improved in terms
            their locations, each differing in resolution, sensitivity,
            sequencing depth, and scale. Some notable technologies   of clustering performance.
            include seqFISH+,  MERFISH,  omsFISH, 10× Visium,    Motivated by the successful application of diverse
                           4
                                                          7
                                     5
                                               6 
            Slide-seqV2, 10× Xenium,  and Stereo-seq. 10       artificial intelligence algorithms in analyzing ST datasets,
                      8 
                                 9
              Partitioning spatial  domains in  a  tumor’s spatial   this paper introduces a variational  graph autoencoder
            transcriptome data can shed light on the composition   (VGAE) enriched with multiple layers of GNNs for
            of the tumor microenvironment and the localized    identifying spatial domains in tumor tissues (Figure  1).
            development of tumor cells. The primary goal of spatial   Initially, gene expression profiles and spatial location
            clustering is to segment tumor tissues into distinct cell   information are transformed into a gene feature matrix and
            subpopulations, facilitating downstream analyses of   a cell adjacency matrix, respectively. The VGAE consists of
            differential gene expression, biological function grouping,   two primary components: an encoder and a decoder. The
            and cell-cell communication.  Conventional methods   encoder employs GNNs to extract matrix features, reduce
                                     11
            classify cell types based on gene expression profiles using   the feature dimensions, and generate latent embeddings. In
            algorithms such as K-means and Louvain.  Giotto    contrast, the decoder reconstructs two matrices based on
                                                         13
                                                  12
            identifies spatial domains through the construction of a   these latent embeddings. Clustering methods were, then,
            hidden Markov random field model, leveraging both the   applied to latent embeddings to detect spatial domains
            feature matrix and spatial coordinates. stLearn  utilizes a   within the tumor tissue. Various loss functions are utilized
                                                 14
            standard Louvain clustering workflow to handle the cell   to enhance the quality and performance of the latent
            adjacency matrix, subsequently using spatial positions to   embeddings by  minimizing  the  overall  errors,  including
            identify subgroups within broader clusters. BayesSpace    the reconstruction error and Kullback-Leibler (KL)
                                                         15
            employs a fully Bayesian statistical model to enhance   divergence error. The presented spatial clustering method
            resolution and conduct clustering analysis, promoting the   is compared with existing approaches to demonstrate
            categorization of nearby spots into the same group based   its effectiveness. Additional details are provided in the
            on a predefined spatial prior. Despite the potential of spatial   “Methods” section. In addition, we employ this clustering
            location information to enhance clustering accuracy, the   framework on different tumor tissues to investigate their
            aforementioned spatial clustering algorithms have yet to   respective tumor microenvironments.
            achieve optimal performance. 16
                                                               2. Methods
              Graph deep learning (GDL) has emerged as a
            promising approach for integrating gene expression   2.1. VGAE
            profiles and spatial location information to address spatial   The VGAE is a machine learning model that is primarily
            clustering tasks.  Numerous studies have proposed graph   utilized in the domains of graph representation learning
                         17

            Volume 3 Issue 1 (2024)                         2                          https://doi.org/10.36922/td.2049
   23   24   25   26   27   28   29   30   31   32   33