Page 147 - GHES-3-2
P. 147

Global Health Economics and
            Sustainability
                                                                             Vaccine hesitancy in the US, India, and China


            (i)  What is the level of vaccine hesitancy among the three   BLR. The theory behind MLR includes the following key
               largest countries by population size: the US, India, and   components:
               China?
            (ii)  What factors are associated with vaccine hesitancy in   3.1.1. Logistic regression
               these countries?                                The MLR builds on the principles of BLR. The BLR models
            (iii) How can cross-sectional and longitudinal data be   the relationship between a set of predictor variables and
               utilized to study vaccine hesitancy?            a binary outcome variable. It uses the logistic function
            (iv)  Which subgroups require further research on vaccine   to estimate the probability of the outcome being in one
               hesitancy?                                      category versus the other. In MLR, this concept is extended

              To address these questions, we analyze two datasets   to multiple outcome categories.
            on vaccine hesitancy. The first dataset is extracted from   3.1.2. Categorical outcome variable
            the ICPSR COVID-19 database (https://doi.org/10.3886/
            E130422V1) and includes cross-sectional survey data   In  MLR,  the  outcome  variable  is categorical, with  three
            assessing the prevalence of vaccine hesitancy in the US,   or more unordered categories. Each category represents a
            India, and China. The second dataset is derived from the   distinct and mutually exclusive outcome. Examples could
            HPS data.                                          include predicting the choice of a political party (e.g.,
                                                               democrat, republican, and independent) or predicting the
              For the ICPSR dataset, we report proportions and   type of vehicle chosen (e.g., car, truck, SUV).
            summary statistics to give an overview of the vaccine
            hesitancy global picture. The HPS dataset was analyzed   3.1.3. Logits and log odds
            using multinomial and binary logistic regression. When   The probabilities of each category are modeled as the log
            the response or outcome can be categorized into two   odds (logit) of being in that category (Equations I and II).
            classes, such as “Hesitant” and “Not hesitant,” and with   The log odds represent the natural logarithm of the ratio
            several explanatory variables, then a BLR is commonly   of  the  probability  of  being  in a  specific  category  to  the
            used. If there are more than two categories, then a natural   probability of being in a reference category. The reference
            extension of BLR is MLR. Individual Chi-square tests   category is one of the categories used as the baseline for
            of independence between vaccine hesitancy and health   comparison.
            categories and exploratory data analysis supplemented
            and helped in our understanding of the causal factors   3.1.4. Parameter estimation
            influencing vaccine  hesitancy.  Rstudio (RStudio|Open
            Source & Professional Software for Data Science Teams-  The estimates of the parameters of an MLR model are
            RStudio, n.d.) and Microsoft Excel were utilized for the   obtained using maximum likelihood estimation. The goal
            analysis. Figure 1 provides a flow diagram from the HPS   is to find the set of parameter values that maximize the
            dataset to develop a logistic model for the analysis.  likelihood of observing the observed data.
            3.1. Model 1                                       3.1.5. Model equations
            MLR is a statistical regression model used to predict   The MLR uses multiple sets of equations to describe
            categorical  outcomes  in  the  form  of  probability  with   the relationship between the predictor variables and the
            more than two unordered categories. It is an extension of   outcome categories. Each equation compares the log odds of
                                                               membership in one category to the log odds of being in the
                                                               reference category. The parameters (coefficients) estimated
                                                               for each predictor represent the change in log odds associated
                                                               with a one-unit change in the predictor variable.
                                                               3.1.6. Model assumptions
                                                               The MLR assumes that the relationship between the
                                                               predictors and the outcome categories is linear on the logit
                                                               scale. It also assumes that the error terms are independent
                                                               and follow a multinomial distribution.

                                                               3.1.7. Model interpretation
                                                               The coefficients in MLR indicate the change in log odds
            Figure 1. Flowchart for vaccine hesitancy data analysis  of being in a specific category as the predictor variables


            Volume 3 Issue 2 (2025)                        139                       https://doi.org/10.36922/ghes.2958
   142   143   144   145   146   147   148   149   150   151   152