Page 54 - AIH-1-1
P. 54

Artificial Intelligence in Health                                AI model for cardiovascular disease prediction



            and two output layers. The effect of the number of hidden
            neurons on the network performance was carried out, and
            five different numbers of hidden neurons (10, 20, 25, and
            30) were investigated. In each case, the network was trained
            several times and the best performance was recorded
            using the Levenberg algorithm. The number of training
            cycles required for proper generalization of outputs
            was determined through the trial-and-error method.
            Furthermore, the threshold function was determined
            through  trial  and  error  (0.5).  The  NN  was  trained  and
            tested using 76 samples, of which 70% samples for training
            and 30% samples were for testing 14, 10, and 8 attributes,
            and the number of datasets varied between 2000, 1500,
            1000, and 500, respectively. The data collected for training
            was inputted into the training network designed. The
            training performance was evaluated, and the MSE was   Figure 4. Graphical user interface for cardiovascular disease prediction
            observed. The ANN and GA were combined to evaluate   system.
            the performance of the training and testing samples. The
            GA was used for attribute selection, and the obtained   4.1. Performance evaluation results of the selected
            attributes were then trained and tested with the number of   number of attributes using the ANN model
            datasets between 2000, 1500, 1000, and 500.
                                                               The performance of the ANN classification technique
            3.3. API for the CVD prediction                    using different numbers of attributes of 14, 10, and 8
                                                               was experimented. The result of ANN performance was
            An  expert  CVD  prediction  model  was  developed  in   evaluated in terms of accuracy, mean square error (MSE),
            MATLAB  for  the  experimentation  and  prediction
            observation of the CDV parameters. The CVD model   sensitivity, specificity, and precision, measuring 82.70%,
                                                               0.1728, 87.50%, 75.80%, and 84.00%, respectively. It is
            interface involved the load data tab, the pre-processing   observed that when the cardiac disease attributes were
            data tab, and the diagnoses patient tab. This model allows   reduced to 10, the results obtained from the training were
            a patient to enter the medical information required in the   80.20%, 0.1975, 91.70%, 63.60%, and 78.60%, for the same
            input data directory. The “Load data” button is ticked to   set of performance parameters (as in the above order). For
            load data for processing in the CVD system model. The   the 8 attributes, the results of the same set of performance
            “Processing data” button is ticked for pre-processing   parameters were 79.0%, 0.2099, 79.20%, 78.80%, and
            (data normalization) and displaying the data on the   84.40%, respectively. Therefore, it is observed that as the
            interface. The “Diagnoses patient” button is ticked to   number of attributes was reduced from 14 to 10 and 8, the
            initiate information processing by the system and generate   accuracy obtained decreased, the MSE value increased,
            results on the message box as either “Normal” or “CVD”.   and the sensitivity, specificity, and precision decreased
            The  graphical user interface developed in MATLAB is   (Figure 5).
            illustrated in Figure 4.
                                                                 The performance evaluation of combined ANN-GA for
            4. API for CVD prediction                          the vascular disease classification was analyzed. GA was

            The CVD dataset was pre-processed, trained, and    used for the subset of selected attribute. The seven best
            predicted using ANN, hybridized ANN-GA, K-means,   attributes correlated with cardiac disease classification are
            KNN, SVM, and DT  classification  techniques.  The   presented in Table 3. ANN was used to train the selected
            results obtained from the various AI techniques for   attributes, and GA was used for the attribute selection and
            CVD with their performance were evaluated using    classification. The results obtained showed 86.4%, 0.1358,
            the cardiac disease attributes and varying the number   91.70%, 78.80%, and 86.3% for accuracy, MSE, sensitivity,
            of datasets: 2000, 1500, 1000, and 500. This number of   specificity, and precision, respectively, as illustrated in
            selected attributes of the dataset varied from 14, 10, and   Figure 6.
            8, and their performance on the different classifiers of the   The K-means algorithm was used in the training of
            machine learning model was observed and presented in   different selected attributes (14, 10, and 5). The results
            Section 4.1 and Section 4.2.                       obtained showed 59.60%, 0.4030, 64.50%, 51.90%, and


            Volume 1 Issue 1 (2024)                         48                        https://doi.org/10.36922/aih.1746
   49   50   51   52   53   54   55   56   57   58   59