Comparisons of various estimates of the I2 statistic for quantifying between-study heterogeneity in meta-analysis
Yipeng Wang, Natalie DelRocco, Lifeng Lin- Health Information Management
- Statistics and Probability
- Epidemiology
Assessing heterogeneity between studies is a critical step in determining whether studies can be combined and whether the synthesized results are reliable. The [Formula: see text] statistic has been a popular measure for quantifying heterogeneity, but its usage has been challenged from various perspectives in recent years. In particular, it should not be considered an absolute measure of heterogeneity, and it could be subject to large uncertainties. As such, when using [Formula: see text] to interpret the extent of heterogeneity, it is essential to account for its interval estimate. Various point and interval estimators exist for [Formula: see text]. This article summarizes these estimators. In addition, we performed a simulation study under different scenarios to investigate preferable point and interval estimates of [Formula: see text]. We found that the Sidik–Jonkman method gave precise point estimates for [Formula: see text] when the between-study variance was large, while in other cases, the DerSimonian–Laird method was suggested to estimate [Formula: see text]. When the effect measure was the mean difference or the standardized mean difference, the [Formula: see text]-profile method, the Biggerstaff–Jackson method, or the Jackson method was suggested to calculate the interval estimate for [Formula: see text] due to reasonable interval length and more reliable coverage probabilities than various alternatives. For the same reason, the Kulinskaya–Dollinger method was recommended to calculate the interval estimate for [Formula: see text] when the effect measure was the log odds ratio.