Page 129 - MI-2-3
P. 129

Microbes & Immunity                                                  Statistical modeling of COVID-19 trends


































            Figure 6. Comparison between ARIMA and ARIMAX models for the first forecast period
            Abbreviations:  ARIMA:  Autoregressive  integrated  moving  average;  ARIMAX:  Autoregressive  integrated  moving  average  with  exogenous  variables;
            CI: Confidence interval; USA: United States of America.

            Table 5. Comparison between ARIMA and ARIMAX models   may capture nonlinear relationships between GDP per
            for the first forecast period                      capita and infection rate. Collectively, these correlation
                                                               measures suggest that higher GDP per capita is associated
            Model        AIC         RMSE          MAE
                                                               with increased infection rates, although other factors likely
            ARIMA       1,919.556   4,082,257    3,063,789     contribute to the remaining unexplained variance.
            ARIMAX      1,919.935   4,011,124    3,004,951
            Abbreviations: AIC: Akaike information criterion;  ARIMA:   The relatively low infection rates observed among low-
            autoregressive integrated moving average; ARIMAX: autoregressive   GDP per capita countries may reflect underreporting due
            integrated moving average with exogenous variables; MAE: Mean   to limited testing capacity rather than true differences in
            absolute error; RMSE: Root mean squared error.     transmission. Testing data is sparse and inconsistent across
                                                               countries, especially in low-income regions, making it
            Figure 7. The analysis (Table 6) demonstrates a statistically   difficult to correct this effect quantitatively. Nonetheless,
            significant positive relationship between GDP per capita   this under-detection is a plausible contributor to the
            and infection rate, with the regression coefficient for GDP   observed pattern.
            per capita being positive and highly significant (p<2×10 ).
                                                       −16
                                                                                       2
            This suggests that countries with higher GDP per capita   Given the relatively low  R , it is evident that factors
            tend to have higher reported infection rates. However, the   beyond GDP per capita may influence infection rates.
            model yields a relatively low R  value of 0.4763, suggesting   Therefore, the model was expanded to incorporate
                                    2
            that GDP per capita accounts for approximately 47.63% of   additional variables that could plausibly affect infection
            the variance in infection rates.                   rates, including HDI, Gini coefficient, health expenditure
                                                               per capita, the number of hospital beds per 1,000 people,
              In addition to the regression analysis, three correlation   and population density. The resulting multivariate
            metrics were calculated to further assess the relationship   regression model incorporated both main effects and
            between GDP per capita and COVID-19 infection rate.   interaction terms among these variables.
            Pearson’s correlation coefficient yields a value of 0.6902,
            which  indicates  a moderately  strong positive linear   The analysis reveals a more complex relationship
            relationship between the two variables. Spearman’s rank   between the predictors and the infection rate. While GDP
            correlation  coefficient  is  higher,  at  0.8593,  suggesting  a   per capita remains a significant factor (p=0.0065), other
            strong monotonic relationship. Additionally, the MIC   variables  like  health expenditure  and certain  interaction
            yields a value of 0.7256, reflecting a strong association that   terms also emerge as significant predictors.


            Volume 2 Issue 3 (2025)                        121                           doi: 10.36922/MI025040007
   124   125   126   127   128   129   130   131   132   133   134