Page 145 - EJMO-9-1
P. 145

Eurasian Journal of Medicine and
            Oncology
                                                                        Machine learning insights into heart failure outcomes



            Table 3. Performance metrics of various machine learning
            models trained on the selected attributes to predict death
            events among heart failure patients

            Performance   Logistic   Random   Support   Gradient
            metric      regression  forest  vector   boosting
                                          machine  machine
            Accuracy      0.80     0.73     0.58     0.73
            Precision     0.88     0.80     0.00     0.76
            Recall        0.60     0.48     0.00     0.52
            F1-score      0.71     0.60     0.00     0.61
            AUC-ROC       0.83     0.84     0.52     0.88
            Abbreviation: AUC-ROC: Area under the curve of the receiver
            operating characteristic.                          Figure 3. The area under the curve of the receiver operating characteristic
                                                               (AUC-ROC) plot illustrates the performance of different machine
                                                               learning models in predicting death events among heart failure patients.
            performance in predicting death events among HF patients.   Each curve represents the ROC curve for a specific model, with the AUC
            The SVM model performed less favorably, achieving an   score indicating the discriminative ability of the model to distinguish
            accuracy of 58.33%, with precision and recall values of   between positive and negative outcomes.
            0% due to the lack of positive predictions. Consequently,   Abbreviations: AOC: Area under the curve; ROC: Receiver operating
                                                               characteristic
            the F1-score and AUC-ROC metrics did not apply to this
            model. On the other hand, the GBM classifier exhibited an
            accuracy of 73.33%, precision of 76.47%, recall of 52%, and   A            B
            F1-score of 61.90%. The AUC-ROC value for this model
            was 88.23%, indicating strong discriminative ability and
            high predictive performance.

            3.4. Prediction analysis of machine learning models
            The confusion matrix illustrates the classification of actual
            and predicted values into four categories: true positives
            (1 – 1), false positives (0 – 1), true negatives (0 – 0), and
            false negatives (1– 0). Each cell in the matrix represents the   C         D
            count of instances falling into the corresponding category.
              The confusion matrix, depicted in Figure 4A, presents
            the performance of a linear regression model in predicting
            outcomes for HF patients. The model accurately predicted
            10 cases where both the actual and predicted outcomes
            were positive (true positives). However, it incorrectly
            predicted a positive outcome in 15 instances where
            the actual outcome was negative (false positives). The
            model also accurately predicted 33 instances where both
            the  actual  and  predicted  outcomes  were  negative  (true   Figure  4. The confusion matrix illustrates the performance of (A)
            negatives), while it erroneously predicted a negative   logistic regression, (B) random forest, (C) support vector machine, and
            outcome in two instances when the actual outcome was   (D) gradient boosting machine learning models in predicting death
            positive (false negatives). Figure 4B shows the confusion   events among heart failure patients
            matrix representing the performance of a random forest
            model in predicting outcomes for HF patients. The    Figure 4C displays the confusion matrix representing
            random forest model correctly predicted 13 instances   the performance of the SVM model in predicting outcomes
            where the actual and predicted outcomes were positive   for HF patients. The SVM model failed to predict any
            (true positives), with five instances as false positives. It   true positives (0 instances) and did not generate any false
            accurately  predicted 30  true negatives, but misclassified   positives. The model accurately predicted 35 true negatives,
            12 instances as negative when the actual outcome was   but misclassified 25 instances as negative outcomes when
            positive.                                          the actual outcomes were positive (false negatives).


            Volume 9 Issue 1 (2025)                        137                              doi: 10.36922/ejmo.6583
   140   141   142   143   144   145   146   147   148   149   150