Page 50 - AIH-1-3
P. 50

Artificial Intelligence in Health                                  Predicting mortality in COVID-19 using ML




            A                                                 B

































             Figure 16. SHAP summary plots of the “KNeighborsClassifier” method. (A) Barchart. (B) Beeswarm. Image created using Python’s Matplotlib library

            The highest-ranked LR models processed datasets      In third place were the MLPs models, scoring from
            processed using the “Min–Max” method, with ranges of   93.43% (142   position) to 95.62% (19   position), with
                                                                                               th
                                                                         nd
            0 – 100 and 0 – 1000 (mm_0 – 100 and mm_0 – 1000,   88.9% (48/54) occupying positions between 51  and 113 .
                                                                                                    st
                                                                                                            th
            respectively), and used either 22 or 15 attributes with the   The highest-ranked MLPs models processed datasets using
            default set of (default) hyperparameter.           the “Min–Max” method, with ranges of 0 – 100 and 0 –
            4.2.2. Recall                                      1000 (mm_0 – 100 and mm_0 – 1000, respectively) and
                                                               used either 22 or 10 attributes with either the first or the
            The RF models showed the highest values for recall, ranging   second sets of optimized (opt-01 and opt-02, respectively)
            from 90.83% (271   position) to 96.99% (1   position).   hyperparameter values.
                           th
                                                 st
            The distribution of the model’s rankings demonstrates
            significant dispersion, with 33.3% (18/54) ranking above   In the fourth place are the DTs models, scoring from
                                                                                                 th
                                                                         th
            the 19   position and 44.4% (24/54) occupying positions   83.42% (324  position) up to 93.69% (135  position). The
                 th
            between  119   and  176 .  The  highest-ranked  RF  models   highest scoring DT models handled datasets that were
                               th
                      th
            handled datasets processed with the “Min–Max” method,   either not processed with any normalization method (none)
            using ranges of 0 – 100 and 0 – 1000 (mm_0 – 100 and mm_0   or were processed with the “Min–Max” method, with 0 –
            – 1000, respectively), and used either 22 or 15 attributes with   1000 (mm_0 – 1000) range, used either 22 or 15 attributes
            the second set of optimized (opt-02) hyperparameter values.  and the first set of optimal (opt-01) hyperparameter values.
              Following  the  RF models, the XGBoost models      In  fifth  place  were  the KNN  models,  scoring  from
                                                                         rd
                                                                                                th
            ranked second, with recall values ranging from 93.96%   88.23% (303   position) to 92.95% (157   position). The
                                   th
            (133  position) to 95.61% (20  position), and 63% (34/54)   highest-ranked KNN models used datasets processed
               rd
            ranking above the 70   position. The highest-ranked   with the “StandardScaler” method (std) and used either
                               th
            XGBoost models processed datasets using the “Min–Max”   22 or 15 attributes with the first set of optimized (opt-01)
            method, with ranges of 0 – 100 and 0 – 1000 (mm_0 –   hyperparameter values.
            100 and mm_0 – 1000, respectively), and used either 22   Lastly, the LR models showed recall values ranging
            or 10 attributes with either the default or the first set of   from 90.79% (278  position) to 91.89% (182  position).
                                                                                                    nd
                                                                              th
            optimized (opt-01) hyperparameter values.          The highest-ranked LR models processed datasets using
            Volume 1 Issue 3 (2024)                         44                               doi: 10.36922/aih.2591
   45   46   47   48   49   50   51   52   53   54   55