Page 53 - AIH-1-1
P. 53

Artificial Intelligence in Health                                AI model for cardiovascular disease prediction



            number of major vessels colored by fluoroscopy, and defect   values of the original image, respectively. The normalization
            type (Thal) (Table 2). The workflow of the machine learning   considered the input values for each attribute obtained
            model for vascular disease prediction and classification is   using the min-max normalization technique.
            given in Figure 3.
                                                                 The normalization ensures that the values stay in the
              However, machine learning is a method of analyzing data   range of 0 – 1 to prevent a surge to higher values. Min-
            samples and drawing main conclusions using mathematical   max  normalization  technique  was  used  because  of  the
            and statistical approaches, which allows machines to learn   NN activation function with a recommended range of
            without having to be programmed. The collected data   0.1 – 0.9 to avoid saturation. The data pre-processed was
            were pre-processed using the data imputation of the KNN   then divided into two: a training set of 70% and a testing
            method to fill in the missing values and data normalization   set of 30% ratio. The training allows the NN to develop
            before the classification process. The attribute average   a relationship between the input data and the target. The
            (mean) for a column with a missing value was calculated   testing set was used to test how efficiently and accurately
            and this was used to fill in the missing values. This technique   the NN can predict the target. The number of attributes
            is direct and simple to implement using Equation II.  was reduced from 14, 10, and 8, and also, GA was used for

               X −1  =  x − min( )x                    (II)    data selection. Furthermore, the number of datasets varied
                                                               from 2000, 1500, 1000, and 500, and their performance
                    max ( ) − min( )x  x                       was evaluated.
              where x  is the normalized size, x is the original value,   A multilayer feed-forward back propagation NN was
                     1
            and min(x) and max(x) are the minimum and maximum   used with 14, 10, and 8 input layers, various hidden layers,

            Table 2. CVD dataset attributes
            Key attribute                                 Patient attribute ID
            Age                                           Year
            Sex                                           Value 1: Male; value 0: Female
            Chest pain type                               Value 1: Typical type 1 angina; value 2: Typical type angina;
                                                          value 3: Non-angina pain; value 4: Asymptomatic
            Fasting blood sugar                           Value 1: >120 mg/dl; value 0: <120 mg/dl
            Resting electrographic results (Restecg)      Value 0: Normal; value 1: Having ST-T wave abnormality;
                                                          value 2: Showing probable or definite left ventricular hypertrophy
            Exercise-induced angina (Exang)               Value 1: Yes; value 0: No
            Slope of the peak exercise ST segment (Slope)  Value 1: Unsloping; value 2: Flat; value 3: Down-sloping
            Number of major vessels colored by fluoroscopy (CA)  Value 0–3
            Defect type (Thal)                            Value 3: Normal; value 6: Fixed defect; value 7: Reversible defect
            Resting blood pressure                        The blood pressure of patients admitted to the hospital was measured in mmHg
            Serum cholesterol                             Measured in mg/dl
            Thalach                                       Maximum heart rate achieved
            Oldpeak – ST                                  ST depression induced by exercise
            Abbreviation: CVD: Cardiovascular disease.














            Figure 3. The overview of the machine learning model for vascular disease prediction.


            Volume 1 Issue 1 (2024)                         47                        https://doi.org/10.36922/aih.1746
   48   49   50   51   52   53   54   55   56   57   58