Page 285 - GHES-3-3
P. 285

Global Health Economics and
            Sustainability
                                                                             Carbon footprint of smartphones in healthcare


            make decisions that align with responsible management of   LLMs are trained on public data, reinforcement learning,
            the environment. As healthcare systems around the world   and the internet, they should have answers to our queries.
            strive for sustainability, it is essential to comprehend all   We then compare the actual data from the CSR reports to
            the elements that affect the environment, including the   the LLM answers.
            digital tools that physicians and patients use on a daily   The LLM query is: “What is the average carbon footprint
            basis. This study is intended for physicians and healthcare   of the entire lifecycle of (smartphone model)?”  These
            professionals, aiming to contribute to the growing body of   queries were conducted for flagship devices, including the
            knowledge about smartphones and their carbon footprint.
                                                               Apple iPhone 14, 15, and 16 series, as well as the Samsung
            2. Methods                                         Galaxy S24 series, Z Flip6, and Z Fold6.

            We chose Apple and Samsung smartphones since they    In addition, we asked the same queries to smart home
            are the most popular smartphones in terms of sales and   devices to determine if they produced any emissions as
            shipment market share in the United States (Counterpoint   individual household assistants. We conducted the same test
                                                                                    nd
            Research, 2025). Carbon emissions data for flagship   described above using the 2  generation of Apple HomePod
            smartphones from Apple and Samsung were obtained   and the latest versions of Google Home and Echo.
            from publicly available sources, including CSR reports,   The LLM responses were compared directly to the
            official  corporate  websites,  and  product  environmental   emissions data reported by Apple and Samsung. If the LLM
            reports (Apple Inc., 2024; Samsung Electronics, 2024).  model failed to provide an answer or explained that the
              The designation of “actual emissions data” was based   data were publicly available, it was noted as a missing data
            on the corporate materials of each company. For Apple   point. To quantify the accuracy of each model, the percent
            devices, we obtained the carbon footprint values directly   error was calculated using Equation II.
            from Apple’s official product environmental reports (Apple
            Inc., 2025), which provided detailed lifecycle emissions   Percenterror=  AIvalue-actualvalue  ×100  (II)
            in kg CO  emitted for each model that we used to test              Actualvalue
                    2
            our LLMs. Similarly, for Samsung devices, we referred
            to the official Galaxy Environmental Reports (Samsung   Error rates were analyzed for each LLM across all
            Electronics, 2025), which documented carbon emissions   queried devices, with a particular focus on patterns of
            across the whole product lifecycle.                overestimation or underestimation. This approach allowed
                                                               us  to  assess  the  reliability  and  consistency  of  LLMs  in
              To evaluate the accuracy of AI-generated data from
            the chatbots, we posed the same question to four LLMs:   estimating smartphone carbon footprints.
            ChatGPT-4.0, Gemini 1.5 Flash, Claude.ai, and Meta AI.   3. Results
            ChatGPT-4.0 was chosen because it has the most daily
            queries (570,000), while Gemini was chosen because it is   The analysis revealed variabilities in the accuracy of LLMs’
            linked to Google with 8.5 billion daily searches. Claude.  carbon emissions data compared to manufacturer-reported
            ai was chosen because it uses “constitutional AI,” meaning   values.  Responses  from  ChatGPT-4.0,  Gemini,  and
            that once humans set the algorithm, the AI trains itself.   Claude.ai frequently align closely with actual emissions for
            Meta AI uses LLM Meta AI, which is a different AI   Apple models (Table 1), such as the the iPhone 14 (61 kg
            algorithm from ChatGPT or Gemini. These models were   CO ) reported by the three AI-LLMs vs. 61 kg CO₂ actual.
                                                                  2
            built using distinct development pipelines, including   Answers given by ChatGPT and Gemini for the Samsung
            different architectures, data, and training methodologies,   models were mostly accurate, with only minor errors.
            which resulted in each LLM being unique on its own. We   However, Claude.ai’s response was less reliable and showed
            omitted Co-pilot since OpenAI designed both ChatGPT   higher percent errors.
            and Co-Pilot, and they use the same AI architecture.  For queries to Meta AI, the LLM often failed to provide
              We examined Apple HomePod, Google Home, and      definitive answers for iPhone and Samsung devices
            Echo, but only Apple provided CSR literature on its website   (Tables 1-3), citing that the information was “not publicly
            or 10-K statements. The latest versions of Google Home   disclosed” or recommending referral to official product
            and Echo did not have any current CSR data available to   environmental  reports.  For  Table  1,  there  is very little
            the public. We used the same query for the smart home   information available on the company’s 10-K statements
            devices.                                           or CSR report for any more information.
              We focused on smartphones to highlight the companies’   Overall, the average percent errors varied significantly
            transparency in revealing their products’ emissions. Since   among models. ChatGPT-4.0 demonstrated lower average


            Volume 3 Issue 3 (2025)                        277                       https://doi.org/10.36922/ghes.8359
   280   281   282   283   284   285   286   287   288   289   290