The STROBE reporting guideline for writing up observational studies in epidemiology

Erik Elm; Douglas G. Altman; Matthias Egger; Stuart J. Pocock; Peter C. Gøtzsche; Jan P. Vandenbroucke

doi:10.1234/equator/1010101

12a. Statistical methods

What to write

Describe all statistical methods, including those used to control for confounding.

Explanation

In general, there is no one correct statistical analysis but, rather, several possibilities that may address the same question, but make different assumptions. Regardless, investigators should pre-determine analyses at least for the primary study objectives in a study protocol. Often additional analyses are needed, either instead of, or as well as, those originally envisaged, and these may sometimes be motivated by the data. When a study is reported, authors should tell readers whether particular analyses were suggested by data inspection. Even though the distinction between pre-specified and exploratory analyses may sometimes be blurred, authors should clarify reasons for particular analyses.

If groups being compared are not similar with regard to some characteristics, adjustment should be made for possible confounding variables by stratification or by multivariable regression (see 12a. Statistical methods)¹. Often, the study design determines which type of regression analysis is chosen. For instance, Cox proportional hazard regression is commonly used in cohort studies². whereas logistic regression is often the method of choice in case-control studies^3,4. Analysts should fully describe specific procedures for variable selection and not only present results from the final model^5,6. If model comparisons are made to narrow down a list of potential confounders for inclusion in a final model, this process should be described. It is helpful to tell readers if one or two covariates are responsible for a great deal of the apparent confounding in a data analysis. Other statistical analyses such as imputation procedures, data transformation, and calculations of attributable risks should also be described. Non-standard or novel approaches should be referenced and the statistical software used reported. As a guiding principle, we advise statistical methods be described “with enough detail to enable a knowledgeable reader with access to the original data to verify the reported results”⁷.

In an empirical study, only 93 of 169 articles (55%) reporting adjustment for confounding clearly stated how continuous and multi-category variables were entered into the statistical model⁸. Another study found that among 67 articles in which statistical analyses were adjusted for confounders, it was mostly unclear how confounders were chosen⁹.

Confounding

Confounding literally means confusion of effects. A study might seem to show either an association or no association between an exposure and the risk of a disease. In reality, the seeming association or lack of association is due to another factor that determines the occurrence of the disease but that is also associated with the exposure. The other factor is called the confounding factor or confounder. Confounding thus gives a wrong assessment of the potential ‘causal’ association of an exposure. For example, if women who approach middle age and develop elevated blood pressure are less often prescribed oral contraceptives, a simple comparison of the frequency of cardiovascular disease between those who use contraceptives and those who do not, might give the wrong impression that contraceptives protect against heart disease.

Investigators should think beforehand about potential confounding factors. This will inform the study design and allow proper data collection by identifying the confounders for which detailed information should be sought. Restriction or matching may be used. In the example above, the study might be restricted to women who do not have the confounder, elevated blood pressure. Matching on blood pressure might also be possible, though not necessarily desirable (see 6b. Matching criteria). In the analysis phase, investigators may use stratification or multivariable analysis to reduce the effect of confounders. Stratification consists of dividing the data in strata for the confounder (e.g., strata of blood pressure), assessing estimates of association within each stratum, and calculating the combined estimate of association as a weighted average over all strata. Multivariable analysis achieves the same result but permits one to take more variables into account simultaneously. It is more flexible but may involve additional assumptions about the mathematical form of the relationship between exposure and disease.

Taking confounders into account is crucial in observational studies, but readers should not assume that analyses adjusted for confounders establish the ‘causal part’ of an association. Results may still be distorted by residual confounding (the confounding that remains after unsuccessful attempts to control for it¹⁰), random sampling error, selection bias and information bias (see 9. Bias ).

Examples

“The adjusted relative risk was calculated using the Mantel-Haenszel technique, when evaluating if confounding by age or gender was present in the groups compared. The 95% confidence interval (CI) was computed around the adjusted relative risk, using the variance according to Greenland and Robins and Robins et al.”¹¹.

Training

The UK EQUATOR Centre runs training on how to write using reporting guidelines.

Discuss this item

Visit this items’ discussion page to ask questions and give feedback.

References

1.

Controlling for continuous confounding factors: Non- and semiparametric approaches. 2005;53.

2.

Greenland s1998 introduction to regression modelling (chapter 21). In:rothman KJgreenland smodern epidemiology 2nd ed lippincott raven 401 432.

3.

Thompson WD. Statistical analysis of case-control studies. Epidemiologic Reviews. 1994;16(1):33-50. doi:10.1093/oxfordjournals.epirev.a036143

4.

Schlesselman JJ1982 logistic regression for case-control studies (chapter 8.2). Case-control studies design, conduct, analysis new york, oxford oxford university press 235 241.

5.

Clayton dhills m1993 choice and interpretation of models (chapter 27). Statistical models in epidemiology oxford oxford university press 271 281.

6.

Altman DG, Gore SM, Gardner MJ, Pocock SJ. Statistical guidelines for contributors to medical journals. BMJ. 1983;286(6376):1489-1493. doi:10.1136/bmj.286.6376.1489

7.

New England Journal of Medicine. 1997;336(4):309-316. doi:10.1056/nejm199701233360422

8.

Müllner M, Matthews H, Altman DG. Reporting on statistical methods to adjust for confounding: A cross-sectional survey. Annals of Internal Medicine. 2002;136(2):122-126. doi:10.7326/0003-4819-136-2-200201150-00009

9.

Pocock SJ, Collier TJ, Dandreo KJ, et al. Issues in the reporting of epidemiological studies: A survey of recent practice. BMJ. 2004;329(7471):883. doi:10.1136/bmj.38250.571088.55

10.

Olsen J, Basso O. RE: RESIDUAL CONFOUNDING. American Journal of Epidemiology. 1999;149(3):290-290. doi:10.1093/oxfordjournals.aje.a009805

11.

Berglund A, Alfredsson L, Cassidy JD, Jensen I, Nygren Å. The association between exposure to a rear-end collision and future neck or shoulder pain: Journal of Clinical Epidemiology. 2000;53(11):1089-1094. doi:10.1016/s0895-4356(00)00225-0

Citation

For attribution, please cite this work as:

von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. The STROBE reporting guideline for writing up observational studies in epidemiology. The EQUATOR Network guideline dissemination platform. doi:10.1234/equator/1010101

12a. Statistical methods

What to write

Explanation

Confounding

Examples

Training

Discuss this item

References

Citation

Reporting Guidelines are recommendations to help describe your work clearly

Who reads research?

Cohort studies

Case-control studies

Cross-sectional studies

Systematic reviews

Systematic review protocols

TODO

Meta analyses of Observational Studies

TODO

Randomised Trials

Randomised Trial Protocols

TODO

Qualitative research

Case Reports

TODO

Diagnostic Test Accuracy Studies

Prediction Models

Animal Research

TODO

Quality Improvement in Healthcare

Economic Evaluations in Healthcare

TODO

Meta Analyses

How Meta-analyses and Systematic Reviews Work Together

Why Don't All Systematic Reviews Use a Meta-Analysis?

Protocol

Cohort_studies

Case_control_studies

Cross-sectional_studies