Page 35 - JCTR-9-4
P. 35

Shuster | Journal of Clinical and Translational Research 2023; 9(4): 246-252    251
          Shuster [1] reported on a small sample of 32 highly cited past   is a very common occurrence. Note that if Assumption A4 is false,
        meta-analyses that used mainstream methods for relative risk and   every weighted combination is estimating a different overall target
        found major disparities in eight (25%). It is fortunate that this is   population  mean,  and  the  mainstream  analysis  of  the  sequence
        not higher, but this is not good enough for trust in mainstream   will push the overall  estimate  toward targeting the unweighted
        methods.                                                mean in the urn. The key question when looking at the difference
          An analysis of the 31 of these studies reported in Shuster et al. [6]   between the mainstream point estimate and the unweighted point
        and Shuster and Walker [7] (the 32  study’s reanalysis had a few   estimate is whether it is simply sampling error or is it bias induced
                                    nd
        studies added but trended as the 31 we analyzed) also dispel the one   by failure of Assumption A5 (that the presumption that weights
        remaining scientific as opposed to traditional reasons for using the   and effect sizes are uncorrelated is false). There is no way to be
        mainstream: The mainstream might produce on average narrower   sufficiently certain, as there is no adequately powered statistical
        confidence intervals. If you analyze the natural logs of the ratios of   test that can prove the lack of such a correlation.
        the lengths of the confidence intervals (the traditional way to analyze   Note that the phenomenon of seeing two sets of data with the
        non-negative ratios) and treat the studies as a random sample of   same signals but noise level of the second reduced by a common
        highly cited meta-analyses, we obtain an estimate of the population   factor  and  turning  the  result  from  significant  to  not  significant
        ratio of widths (Mainstream: Shuster) of 1.10 (Mainstream wider)   cannot occur in the common statistical methods: t-tests, analysis of
        with 95% confidence interval from 0.93 to 1.29. The mainstream   variance or covariance, regression, logistic regression, frequency
        plausible mean in the total population of such potential reanalysis   tables, or Cox regression (survival analysis).
        ranges from slightly shorter to substantially longer.   11.4. Recommendations for meta-analysis of clinical trials with
          Further, due  to  mainstream  proponents’ concerns  about  its   tabular data
        normal  approximation,  newer versions of CMA have  added  a
        t-option  (degrees of freedom  number of studies less one) that   (1) Use random effects rather than fixed effects; (2) With fewer
        can  be  used  instead  of  the  normal  approximation.  When  we   than five studies, do a Systematic Review, not a formal meta-analysis,
        replaced the normal with the t, the new mainstream methods were   since the large sample distribution of the estimates should not be
        significantly  less  accurate  than  the  survey-based  method.  For   trusted; (3) with 5-20 studies, issue a limitation that the number of
        our sample of 31 meta-analyses, in the log scale, the mainstream   studies is small with a caveat on successful vetting for relative risk;
        averaged 30% wider than the survey-based methods, with 95%   (4) use Shuster [1] until new methods become available; (5) if you
        confidence  interval  from  9%  wider  to  54%  wider.  This  is  yet   have individual patient data, note that the off-label implications for
        another strong reason not to use the mainstream.        the mainstream for tabular data may or may not apply to individual
          Note  that  diagnostic  test  information,  such  as  Cochran’s Q,   patient data. Shuster [1] can still work if any of Assumptions A2,
        I , and Egger’s test for selection bias, as described in Borenstein   A4, and A5 are used in the individual patient analysis, potentially
        2
        et al. [8] is not relevant to the validity of Shuster [1].  endangering its evidence base; (6) a biostatistics group with access
          Shuster  et  al. [6], with a substantial  number  of simulations,   to supercomputers should conduct large simulations along the lines
        found that  when the  target  population  relative  risks in the  two   of Shuster et al. [6] for the robustness of the T-approximation of the
        universes were the same (ratio and equal weighting), the ratio   survey sampling method for differences in means or proportions; (7)
        method  had  an  average  confidence  interval  length  reduction  of   a parallel width of confidence interval comparison on mainstream
        about 10% compared to equal weighting.                  versus  survey  sampling  should  be  done,  and  (8)  if  you  have
                                                                published a meta-analysis that had a substantive influence on public
        11.3. The mainstream’s moving target                    health policy, consider conducting an equal weighted analysis on

          Suppose we have a sequence of meta-analyses  where the   the log of the relative risks or differences in means or proportions to
        urns described for obtaining the true study-specific effect sizes   see if this new analysis supports your original conclusions. If they
        (Assumption A1) are identical, but each member of the sequence   do not, consider writing a report, using the recommended methods
        has  within-study  variances  of  90%  of  the  previous  member  of   of Shuster [1].
        the sequence. Under the mainstream model, all of these meta-  12. Conclusion
        analyses have the same true effect sizes, namely the unweighted
        mean effect size in the urn. The true mainstream variance of the   Based on a reasonable fear gleaned from examples 1 and 2, the
        global effect size estimate is the sum of its between-study variance   continued use of the mainstream methods is threatening to public
        (Same for all members in the urn) and the within-study variance   health interests. In example 1, despite the published mainstream
        (which will shrink toward zero as you get later into the sequence).   inference,  there is no evidence that  a  widely  used invasive
        Thus, the mainstream estimates will become closer and closer to   intervention  is  effective.  Example  2  had  the  survey  sampling
        the unweighted estimator as we get further into the sequence. You   method been available and utilized, and the use of rosiglitazone
        therefore cannot rule out an artifact of where you might be in the   in Type II diabetes would likely have been eliminated far earlier
        sequence for any meta-analysis where the qualitative conclusions   than what occurred, saving a substantial number of cardiac events
        of the mainstream differ from the unweighted (one significant and   from happening. Other similar misjudgments stemming from the
        one not). The Neto et al. [4] example is one case of this, but this   mainstream  methods are all but certain  to occur in the future.
                                           DOI: http://dx.doi.org/10.18053/jctres.09.202304.22-00019
   30   31   32   33   34   35   36   37   38   39   40