Page 82 - AIH-2-4
P. 82

Artificial Intelligence in Health                            Federated learning health stack against pandemics



            1. Introduction                                    model under the coordination of a central server without
                                                               exchanging raw data.
            1.1. General overview: The big picture
                                                                 The FL process typically begins with the central server
            The emergence of the COVID-19 pandemic in 2019 shook   initializing a global model and distributing its parameters
            the foundations of human civilization. While it was not the   to selected clients. Each client independently trains the
            deadliest pandemic in history, it ranks fifth in terms of death   model using its local dataset, ensuring data privacy by
            toll, with an official World Health Organization (WHO)   keeping raw data on the local device. After local training,
                                   1
            estimate of 7 million deaths.  When ranked by death toll,   clients send model updates, such as model gradients, to the
            the following pandemics are the deadliest, in descending   central server. The server then aggregates these updates,
            order: (i) the Spanish flu (17 – 100 million deaths, 1918 –   commonly using methods such as federated averaging
            1920),  (ii) the Plague of Justinian (15 – 100 million deaths,   (FedAvg),  to update the global model. The updated model
                 2
                                                                       11
            541 – 549),  (iii) human immunodeficiency virus/acquired   is then redistributed to the clients for the next round of
                     3
            immunodeficiency syndrome (approximately 43 million   training. This cycle continues iteratively until the model
            deaths, 1981–present),  and (iv) the Black Death (7 – 35   converges or reaches a predefined accuracy threshold.
                              4
            million deaths, 1346 – 1353). 4
                                                               Finally, the trained global model is deployed for use.
              What sets COVID-19 apart from previous pandemics   As discussed, FL eliminates the need to transfer raw
            is that it occurred in a highly globalized, post-internet   data to a central server; instead, only model parameters
            world, where information could spread instantaneously.    are exchanged. This approach significantly reduces the risk
                                                          5
            Combined  with  significant  advancements  in  modern   of data breaches, as sensitive information remains on the
            sciences, particularly in medical and computational   client’s side. FL thus upholds data privacy and security,
            sciences, it was shocking to see how the pandemic exposed   making  it  one  of the  highly accepted protocols  across
            the  limitations  of  human  technological  capabilities.   countries, communities, and agencies. For this reason, it
            As argued in previous studies,  although we had the   has been chosen in this article as the foundational model
                                      5
            components of a modern technological infrastructure to   for addressing future pandemics.
            create a formidable defense, we failed to synergize them
            effectively to prevent the pandemic’s escalation.    We envision that combating future pandemics will
                                                               require the holistic deployment of the entire computing
              Therefore, the central question addressed in     infrastructure, including, but not limited to, the Internet-
            this article is: What would the next-generation    of-Things (IoT) devices, sensors, edge computing, and
            computational infrastructure look like to effectively   cloud infrastructure. While FL offers privacy-preserving
            combat future pandemics? Such infrastructure should   model training, its integration with emerging technologies,
            be deployable within current medical record-keeping   such as edge intelligence and IoT, in healthcare introduces
            systems, which involve heterogeneous data types and   new challenges that must be addressed. Recent studies
            regulations  (mostly  privacy-related).  To  stimulate  this   have highlighted the growing need for edge intelligence
            discussion, we propose a federated learning (FL) -  and distributed learning in health care. 12,13  For example,
                                                        6,7
            based, global machine learning (ML) architecture with   research presents an overview of mobile health systems
            the potential to address this key issue. While we do not   and IoT technologies, focusing on system architecture
            claim this is the ultimate solution (though we believe it   and data integration strategies.  However, centralized
                                                                                         14
            may be), the main purpose of this article is to motivate   or cloud-based models often require raw or partially
            the  journey  towards  that  goal.  For  brevity,  we  will   processed data to be transmitted, raising concerns about
            highlight one or a few methods for each step, although   data security in sensitive applications, such as health
            multiple viable solutions may exist. A  comprehensive   care. In another study,  the authors discussed the role
                                                                                  13
            review of FL applications in smart healthcare is   of artificial intelligence in IoT-enabled smart healthcare
            presented elsewhere; 8-10  hence, we will only summarize   applications,  with  a  primary  focus on lightweight
            key technical preliminaries of FL here.            neural network architectures deployed directly on edge
              In the absence of a global enforcing agency for   devices. Yet, the study does not address the security risks
            administering  critical  pandemic/medical  issues  (as  the   associated with transmitting trained model parameters
            WHO lacks such authority), a decentralized approach   from edge devices. This omission overlooks potential risks
            offers the most promising pathway for global acceptance   such as adversaries intercepting and reverse-engineering
            and implementation. FL is a decentralized ML approach   the model parameters to extract sensitive information,
            in which clients (e.g., organizations, healthcare units,   or manipulating them to induce bias or abnormal
            mobile devices, and sensors) collaboratively train a shared   model  behavior.  In  our  proposed  framework,  we  use  a


            Volume 2 Issue 4 (2025)                         76                          doi: 10.36922/AIH025080013
   77   78   79   80   81   82   83   84   85   86   87