Page 51 - AIH-1-1
P. 51

Artificial Intelligence in Health                                AI model for cardiovascular disease prediction



            the synthetic minority oversampling technique (SMOTE)   3. Materials and methods
            to enrich the data. Regarding model performance in terms
            of discrimination, GBT and RF were the models with the   3.1. Implementation of ANN-GA for CVDs model
            highest receiver operating characteristic curve of 97.8%   In this research, a CVD prediction model was developed
            and 97.7%, respectively. The discriminative powers of ET,   using a combination of techniques based on the evolution
            LR, and DNN were 96.8%, 96.4%, and 95.3%, respectively,   theory – GA and ANN. This ANN-GA model was used
            with the latter exhibiting the least discriminative power. In   for the correlated attribute selection, training, and
            terms of reliability, LR predictions were the least calibrated   optimization of the CVD selected features to achieve
            compared  with  the  other  four  algorithms.  Thus,  despite   better prediction accuracy. The ANN utilized a multilayer
            increasing the training time, SMOTE was proven to further   perceptron to receive the transmitted signal, process the
            improve the model performance of LR, while algorithms
            like GBT and DNN performed well with class-imbalanced   signal (neurons), and compute the output of each neuron
            data .                                             for weighted using back-propagation to adjust the NN
               [43]
                                                               model  parameters  for  a  reduced  mean  squared  error
              Khan  et al. utilized a machine learning algorithm   (MSE). While the GA is an optimization searching agent
            for  the  accurate prediction and  decision-making for   in the evolution theory, that is used to optimize the output
            CVD  patient .  Simple random  sampling  was  used  to   performance of a classifier model using a supervised
                       [44]
            select heart disease patients from the Khyber Teaching   machine learning algorithm (SMLA). The function of GA
            Hospital and Lady Reading Hospital, Pakistan. Some of   evolution initiates the process from a randomly generated
            the machine learning methods involved are DT, RF, and   population for an individual which changes at every
            LR. The performance of the proposed machine learning   iteration in the loops to formulate an offspring called
            algorithm was estimated using numerous conditions to       [45]
            recognize the most suitable machine learning algorithm   generation  The fitness (best chromosome) of every
            in the class of models. The RF algorithm has the highest   individual generated within the population was appraised,
            accuracy of prediction, sensitivity, and recursive operative   and the fittest was chosen (selection of chromosome) by
            characteristic curve of 85.01%, 92.11%, and 87.73%,   computing the probability for being selected from among
            respectively, for CVD prediction. It also has the least   the best chromosome in the population in proportion
            specificity and  misclassification errors  of 43.48% and   to the sum of the fitness among individuals as given
                                                                           [46]
            8.70%, respectively. The overall performance evaluation   in Equation I . Nevertheless, the generation of the
            result showed that the RF algorithm is the most appropriate   individual population can be modified by crossover and
            algorithm  for  CVD  classification  and  prediction .   mutation to form a newly evolved generation candidate
                                                        [44]
            A summary of the authors’ focus, methods, and strengths   as presented in Figure 1. The overview of the ANN-GA
            is presented in Table 1.                           flowchart is presented in Figure 2.

            Table 1. Summary of the related works in terms of the authors’ focus, methods, and strength
            Authors’ focus                  Methods                         Performance accuracy
            Proposed CVD prediction model [35]  Using machine learning algorithms such as   K-neighbor 95.62%, SVM 95.62%, DT 91.05%,
                                            SVM, RF, Naive Bayes classifier, and LR  RF 95.40%, and LR 95.59%
            Performance of data mining classification   Using three popular classification techniques,   KNN 82.96%, SVM 85.18%, and ANN 73.33%
            techniques for heart disease prediction [36]  such as KNN, SVM, and ANN
            Proposed machine learning algorithm for heart   Using PSO and ACO  The average accuracy for PSO and ACO is 99.65%
            disease prediction and classification [38]
            Machine learning approaches for predicting,   SVR, multivariate adaptive regression splines,   ANFIS 96.56%, and SVR 91.95%
            classifying, and improving the diagnosis   M5Tree model neural networks, adaptive
            accuracy of CVDs [42]           neuro-fuzzy, and KNN, and Naive Bayes classifiers
            Nurturing clinicians for accurate prediction of   LR, RF, ET, and GBT  GBT 97.8%, RF 97.7%, ET 96.8%, LR 96.4%,
            mortality among CVD patients [43]
            Accurate prediction and decision-making for   DT, RF, LR        DT 85.01%, RF 92.11%, and LR 87.73%
            CVD classification [44]
            Abbreviations: ACO: Ant colony optimization; ANFIS: Adaptive neuro-fuzzy inference system; ANN: Artificial neural network; CVD: Cardiovascular
            disease; DT: Decision tree; ET: Extra tree; GBT: Gradient boosting tree; KNN: K-nearest neighbor; LR: Logistic regression; PSO: Particle swarm
            optimization; RF: Random forest; SVR: Support vector regression.



            Volume 1 Issue 1 (2024)                         45                        https://doi.org/10.36922/aih.1746
   46   47   48   49   50   51   52   53   54   55   56