Page 72 - GTM-3-1
P. 72

Global Translational Medicine                                       Evaluating ML models for CAD prediction




                           Raw         Simple      Simple      Ordinal     One Hot       Logistic
                           Data        Imputer     Imputer    Encoder      Encoder      Regressor
            Figure 1. Preprocessing pipeline for logistic regressor model.
















            Figure 2. Flowchart demonstrating workflow for Pycaret.

            3. Results

            Overall, 14 ML classification models were assessed with the
            current dataset. The logistic regressor (LR) model had the
            best outcome with the highest overall performance. LR had
            an accuracy of 0.79, recall of 0.80, precision of 0.80, F1 score
            of 0.80, Cohen’s Kappa (κ) of 0.57, and Matthews correlation
            coefficient (MCC) of 0.57. The linear discriminant analysis
            (LDA) had a comparable outcome to the LR model with
            slightly lower metrics. The overall performance metrics of
            all 14 ML models are outlined in Table 3.
              Since the LR had the highest overall performance
            metrics, further analysis was performed on this model
            and its learning process. The confusion matrix (Figure 3)   Figure 3. Confusion matrix for logistic regressor model.
            indicated  a true positive (TP) value  of 139  out  of 315
            record (44.12%), true negative (TN) value of 116 (36.83%),   reasonable balance of sensitivity and specificity, making it a
            false-negative value of  33  (10.48%), and false-positive   potentially viable tool for assisting in the diagnosis of CAD.
            value of 27 (8.57%). Hence, 116 instances were correctly   The ROC curve (Figure  4) is a graphical plot that
            identified as TP, indicating patients with CAD, while 139   illustrates the diagnostic ability of a binary classifier system
            were TN, correctly identifying patients without the disease.   as its discrimination threshold is varied. It is created
            Clinically, these numbers are significant as they ensure that   by plotting the TP rate against the false-positive rate at
            patients with the disease are identified for treatment and   various threshold settings. The AUC represents the degree
            those without are not subjected to unnecessary procedures.   of separability achieved by the model; it tells us how well
            However, the model also produced 27 false positives,   the  model  is  capable  of  distinguishing  between  classes.
            where the disease was incorrectly predicted, potentially   In the context of the study, the ROC curves and their
            leading to undue stress and unwarranted further testing   respective AUC values for class 0 and class 1 being equal to
            for those individuals. Of greater clinical concern are the   0.88 suggest that the logistic regression model has a high
            33 false negatives, where the disease was present but went   level of discrimination for both identifying patients with
            undiagnosed, potentially resulting in delayed treatment   CAD (class 1) and for identifying those without the disease
            with serious health implications. The model’s sensitivity,   (class 0). The same AUC of 0.88 for both the micro-average
            or its ability to correctly identify those with the disease, is   and macro-average ROC curves indicates that the model
            calculated as 116/(116 + 33), which equals approximately   is consistently accurate across both classes. Clinically, this
            0.78, while the specificity, or the ability to correctly rule out   means that the model is very effective at distinguishing
            disease, is 139/(27 + 139), which equals roughly 0.84. These   between patients with and without the disease, which is
            figures suggest that the logistic regression model has a   crucial for a diagnostic tool where the cost of misdiagnosis


            Volume 3 Issue 1 (2024)                         5                        https://doi.org/10.36922/gtm.2669
   67   68   69   70   71   72   73   74   75   76   77