Page 128 - DP-2-3
P. 128

Design+                                                             ML for predicting Alzheimer’s progression



            critical role of cognitive assessments in the early detection   Ultimately,  this  framework  serves  as  a  foundation  for
            and diagnosis of AD.                               developing intelligent, personalized diagnostic support
                                                               systems that prioritize early intervention and optimized
            7. Conclusion and future work                      resource allocation.

            In this study, non-imaging data from the AIBL were   Given  the  time  constraints  of  this  research,  explicit
            analyzed to classify AD into three classes: HC, individuals   handling of outliers, multicollinearity, or distribution
            with MCI, and those diagnosed with AD. Extensive   abnormalities was not performed. However, the selected
            data cleaning and exploration were conducted to reveal   models possess built-in capabilities to address these
            underlying patterns and extract information using various   issues. Involving domain experts to directly address
            data mining techniques and statistical methods, including   these factors could further enhance model performance.
            correlation analysis, feature association analysis, feature   Moreover, the models were fine-tuned using the
            importance analysis, and feature selection.        “RandomizedSearchCV” function; however, a more
                                                                                                       28
              To address the challenge of class imbalance, synthetic   exhaustive approach, such as “GridSearchCV,”  could
            oversampling methods were employed to generate artificial   potentially yield better parameters by exploring a wider
            samples to balance the target classes. Subsequently, the data   range of combinations. Although this study focused solely
            were modeled and evaluated using advanced non-parametric   on direct multi-class classification, alternative approaches
                                                                                                        29
            ML algorithms, such as RF and XGBoost, first with complete   such  as  one-versus-one  and  one-versus-the-rest   may
            feature set and then with selected features obtained through   offer additional insights. Acknowledging these limitations
            the feature selection process. Fine-tuning techniques were   provides a pathway for future research and further
            applied to enhance predictive accuracy. The results from   exploration of the problem.
            these models underwent thorough evaluation to determine
            the most effective algorithm. The tuned RF model emerged   Acknowledgments
            as the top performer, achieving an accuracy of 90%, with   None.
            precision, recall, and F1 scores also reaching 90%.
              Furthermore, to reduce the diagnosis cost of AD and   Funding
            provide valuable insights toward developing a more   None.
            reliable and affordable diagnostic tool, the data were
            segmented into three  main variable  groups:  medical   Conflict of interest
            history, neuropsychological assessment, and blood analysis   The authors declare that they have no competing interests.
            with ApoE genotype variables. Corresponding ML models
            were then developed using the fine-tuned model of the   Author contributions
            best-performing algorithm. The “neuropsychological   Conceptualization: All authors
            assessment” classifier emerged as the most effective,   Formal analysis: Adhinrag Kalarikkal Induchudan
            exhibiting an exceptional accuracy of 90%.
                                                               Investigation: All authors
              Beyond its strong classification performance, this study   Methodology: All authors
            presents a replicable and cost-effective methodology that   Writing–original draft: Adhinrag Kalarikkal Induchudan
            may benefit other research groups, clinical practitioners,   Writing–review & editing: All authors
            and public health systems aiming to improve early detection
            of AD. By leveraging non-imaging data—including widely   Ethics approval and consent to participate
            available neuropsychological assessments—our approach   Not applicable.
            avoids the high costs and limited accessibility associated
            with imaging-based diagnostics. The use of interpretable   Consent for publication
            ML models, combined with robust feature selection and
            data  preprocessing  techniques,  facilitates  deployment  in   Not applicable.
            diverse clinical or research settings without the need for   Availability of data
            extensive computational infrastructure. Moreover, the
            proposed methodology can be adapted to other populations   Data is available from the corresponding author upon
            or datasets, supporting generalizability studies and cross-  reasonable request.
            cohort validation efforts. This makes it particularly relevant
            for low-resource settings or large-scale screening efforts   References
            where  rapid,  accurate,  and  affordable  tools  are  essential.   1.   Alzheimer’s Association. 2023 Alzheimer’s disease facts and


            Volume 2 Issue 3 (2025)                         10                           doi: 10.36922/DP025270031
   123   124   125   126   127   128   129   130   131   132   133