Stat Methods Med Res, 2020 · DOI: 10.1177/0962280220928572 · Published: November 1, 2020
This paper addresses the problem of analyzing clustered data where the size of the cluster is related to the outcome of interest, a situation called informative cluster size (ICS). Classical tests don't work well in this situation. The authors focus on categorical data, where outcomes fall into categories. The authors develop new statistical tests that adjust for ICS when dealing with categorical data. These tests are based on reweighting the data to account for the bias introduced by ICS. They also compare different ways of estimating the variance of these tests. Through simulations, the authors show that the choice of variance estimation method significantly impacts the performance of these tests. They find that variance estimators constructed under the null hypothesis perform best.
The developed tests provide more accurate statistical inference for clustered categorical data when informative cluster size is present, reducing potential bias.
The study highlights the importance of variance estimation techniques, suggesting that estimators constructed under the null hypothesis are preferable for cluster-weighted tests.
The methods can be applied to various biomedical research areas where clustered categorical data with informative cluster size is common, such as dental studies and longitudinal patient data.