Page 116 - AIH-1-3
P. 116

Artificial Intelligence in Health                                 ISM: A new multi-view space-learning model



            In this approach, sets of views with enough common   and versatility of ILSM, extending far beyond the scope of
            observations are integrated with ISM separately. Using the   multi-view data analysis.
            model parameters, the transformation into the latent ISM
            space can be expanded to all views over all observations   5. Conclusion
            in the set, resulting in much larger transformed views   The  proof-of-concept  analysis  results  provide  strong
            than the original intersection would allow. This expansion   preliminary support for the proposed new method. As a
            process enables the integration of the ISM-transformed   next step, we will perform a comprehensive comparison of
            data from the different view sets, again using the ISM.   ISM  with  state-of-the-art  alternative  methods,  including
            Interestingly, a similar integrated latent space approach   those considered in this article, and report the findings in
            has already been proposed to study the influence of social   a follow-up article.
            networks on human behavior.  After masking a large
                                     47
            number of views, the dataset of UCI Digits dataset was   To further illustrate ISM’s key benefits and broad
            analyzed using this approach. A more detailed description   applicability, we will conclude by presenting some potential
            of the expansion  process (Workflow S1,  Figure S1)   applications currently under evaluation, with results to be
            and preliminary results (Figure S2) can be found in the   published in future articles.
            Supplementary Materials.                             In longitudinal clinical studies, where participants are
              Important issues such as the handling of highly dynamic   followed up later, the ISM model can be trained at baseline
            or rapidly updating datasets have not yet been investigated.   and applied to subsequent data to calculate meta-scores.
            This will be addressed in a future article.        The interpretability of the associated components makes
                                                               ISM meta-scores more appealing to clinicians compared
              It is worth noting that by replacing NMF with NTF in   to the mixed-sign latent factors from other factorization
            the initialization unit of the  ISM workflow, ISM  can be   methods.
            easily extended to multi-view data where the views are
            themselves tensors of order three or higher, provided that   Consider complex multidimensional multi-omics data
            all dimensions except the attribute dimension are shared   from one and the same set of cells (single-cell technology).
            between the  views.  Interesting applications  include the   There is a growing amount of single-cell data corresponding
            analysis of longitudinal multi-view data or the integration   to different molecular layers of the same cell. Data
            of multiple X-ray views. These topics will be the subject of   integration is a challenge as each modality can provide a
            dedicated articles.                                different clustering stemming from a specific biological
                                                               signal. Therefore, data integration and its projection into
              Finally, the extension of ISM to the ILSM approach, as   a space must: (i) preserve the consensus between two
            described in the methods section (Section 2), is achieved   clusterings and (ii) highlight the differences each modality
            by a simple chained matrix multiplication – an example of   may bring. ISM view loadings can address these two key
            ISM inheriting the simplicity and compactness of the NTF   requirements: components with similar contributions from
            model, made possible by embedding views in a 3D array.   each molecular layer highlight a consensus that can be
            This has important advantages:                     inferred from clustering based on the ISM meta-scores of
            i.  Performance                                    such components. In contrast, components with differing
               •   Independent view factorizations can be achieved   contributions from each molecular layer highlight each
                   using parallel computing.
               •   The number of attributes in each transformed   modality’s specificities, which can be inferred from clustering
                                                               based on the ISM meta-scores of such components.
                   view is reduced to its factorization rank, allowing
                   ISM to be performed on a much smaller dataset.  The area of spatial mapping, including spatial
            ii.  Versatility                                   imaging and spatial transcriptomics,  is expanding at an
               •   ILSM  can  be  applied to  compute NMF  on  big   unprecedented pace. An effective method for integrating
                   data in a federated or distributed way. To this   different levels of information, such as gene or protein
                   end, smaller slices are constructed at random,   expression and spatial organization of cell phenotypes, is
                   with each slice considered a particular view   an unmet methodological need. We believe that ISM can
                   that is submitted to ISM. Preliminary results   integrate these different levels of information, as shown
                   indicate significant performance improvements   in the analysis of the UCI Digits data, to capture the
                   (Workflow S2 and example in the Supplementary   constituents that allow spatial patterns to be distinguished
                   Materials).                                 across all levels.
              While ILSM does not claim to outperform all alternative   The identification of new chemotypes with
            approaches in every context, this illustrates the scalability   biological activity similar to that of a known active


            Volume 1 Issue 3 (2024)                        110                               doi: 10.36922/aih.3427
   111   112   113   114   115   116   117   118   119   120   121