The STROBE reporting guideline for writing up observational studies in epidemiology

Erik Elm; Douglas G. Altman; Matthias Egger; Stuart J. Pocock; Peter C. Gøtzsche; Jan P. Vandenbroucke

doi:10.1234/equator/1010101

6b. Matching criteria

What to write

Cohort study: For matched studies, give matching criteria and number of exposed and unexposed.

Case-control study: For matched studies, give matching criteria and the number of controls per case.

Explanation

Matching is much more common in case-control studies, but occasionally, investigators use matching in cohort studies to make groups comparable at the start of follow-up. Matching in cohort studies makes groups directly comparable for potential confounders and presents fewer intricacies than with case-control studies. For example, it is not necessary to take the matching into account for the estimation of the relative risk¹. Because matching in cohort studies may increase statistical precision investigators might allow for the matching in their analyses and thus obtain narrower confidence intervals.

In case-control studies matching is done to increase a study’s efficiency by ensuring similarity in the distribution of variables between cases and controls, in particular the distribution of potential confounding variables^1,2. Because matching can be done in various ways, with one or more controls per case, the rationale for the choice of matching variables and the details of the method used should be described. Commonly used forms of matching are frequency matching (also called group matching) and individual matching. In frequency matching, investigators choose controls so that the distribution of matching variables becomes identical or similar to that of cases. Individual matching involves matching one or several controls to each case. Although intuitively appealing and sometimes useful, matching in case-control studies has a number of disadvantages, is not always appropriate, and needs to be taken into account in the analysis (see below).

Even apparently simple matching procedures may be poorly reported. For example, authors may state that controls were matched to cases ‘within five years’, or using ‘five year age bands’. Does this mean that, if a case was 54 years old, the respective control needed to be in the five-year age band 50 to 54, or aged 49 to 59, which is within five years of age 54? If a wide (e.g., 10-year) age band is chosen, there is a danger of residual confounding by age (see below), for example because controls may then be younger than cases on average.

Matching in case-control studies

In any case-control study, sensible choices need to be made on whether to use matching of controls to cases, and if so, what variables to match on, the precise method of matching to use, and the appropriate method of statistical analysis. Not to match at all may mean that the distribution of some key potential confounders (e.g., age, sex) is radically different between cases and controls. Although this could be adjusted for in the analysis there could be a major loss in statistical efficiency.

The use of matching in case-control studies and its interpretation are fraught with difficulties, especially if matching is attempted on several risk factors, some of which may be linked to the exposure of prime interest^3,4. For example, in a case-control study of myocardial infarction and oral contraceptives nested in a large pharmaco-epidemiologic data base, with information about thousands of women who are available as potential controls, investigators may be tempted to choose matched controls who had similar levels of risk factors to each case of myocardial infarction. One objective is to adjust for factors that might influence the prescription of oral contraceptives and thus to control for confounding by indication. However, the result will be a control group that is no longer representative of the oral contraceptive use in the source population: controls will be older than the source population because patients with myocardial infarction tend to be older. This has several implications. A crude analysis of the data will produce odds ratios that are usually biased towards unity if the matching factor is associated with the exposure. The solution is to perform a matched or stratified analysis (see 12dii. Statistical methods – matching cases and controls). In addition, because the matched control group ceases to be representative for the population at large, the exposure distribution among the controls can no longer be used to estimate the population attributable fraction (see 16c. Main results – risk#measures-of-association)⁵. Also, the effect of the matching factor can no longer be studied, and the search for well-matched controls can be cumbersome – making a design with a non-matched control group preferable because the non-matched controls will be easier to obtain and the control group can be larger. Overmatching is another problem, which may reduce the efficiency of matched case-control studies, and, in some situations, introduce bias. Information is lost and the power of the study is reduced if the matching variable is closely associated with the exposure. Then many individuals in the same matched sets will tend to have identical or similar levels of exposures and therefore not contribute relevant information. Matching will introduce irremediable bias if the matching variable is not a confounder but in the causal pathway between exposure and disease. For example, in vitro fertilization is associated with an increased risk of perinatal death, due to an increase in multiple births and low birth weight infants⁶. Matching on plurality or birth weight will bias results towards the null, and this cannot be remedied in the analysis.

Matching is intuitively appealing, but the complexities involved have led methodologists to advise against routine matching in case-control studies. They recommend instead a careful and judicious consideration of each potential matching factor, recognizing that it could instead be measured and used as an adjustment variable without matching on it. In response, there has been a reduction in the number of matching factors employed, an increasing use of frequency matching, which avoids some of the problems discussed above, and more case-control studies with no matching at all⁷. Matching remains most desirable, or even necessary, when the distributions of the confounder (e.g., age) might differ radically between the unmatched comparison groups^1,2.

Grouping

There are several reasons why continuous data may be grouped⁸. When collecting data it may be better to use an ordinal variable than to seek an artificially precise continuous measure for an exposure based on recall over several years. Categories may also be helpful for presentation, for example to present all variables in a similar style, or to show a dose-response relationship.

Grouping may also be done to simplify the analysis, for example to avoid an assumption of linearity. However, grouping loses information and may reduce statistical power⁹ especially when dichotomization is used^10–12. If a continuous confounder is grouped, residual confounding may occur, whereby some of the variable’s confounding effect remains unadjusted for (see 12a. Statistical methods)^13,14. Increasing the number of categories can diminish power loss and residual confounding, and is especially appropriate in large studies. Small studies may use few groups because of limited numbers.

Investigators may choose cut-points for groupings based on commonly used values that are relevant for diagnosis or prognosis, for practicality, or on statistical grounds. They may choose equal numbers of individuals in each group using quantiles¹⁵. On the other hand, one may gain more insight into the association with the outcome by choosing more extreme outer groups and having the middle group(s) larger than the outer groups¹⁶. In case-control studies, deriving a distribution from the control group is preferred since it is intended to reflect the source population. Readers should be informed if cut-points are selected post hoc from several alternatives. In particular, if the cut-points were chosen to minimise a P value the true strength of an association will be exaggerated¹⁷.

When analysing grouped variables, it is important to recognise their underlying continuous nature. For instance, a possible trend in risk across ordered groups can be investigated. A common approach is to model the rank of the groups as a continuous variable. Such linearity across group scores will approximate an actual linear relation if groups are equally spaced (e.g., 10 year age groups) but not otherwise. Il’yasova et al¹⁸. recommend publication of both the categorical and the continuous estimates of effect, with their standard errors, in order to facilitate meta-analysis, as well as providing intrinsically valuable information on dose-response. One analysis may inform the other and neither is assumption-free. Authors often ignore the ordering and consider the estimates (and P values) separately for each category compared to the reference category. This may be useful for description, but may fail to detect a real trend in risk across groups. If a trend is observed, a confidence interval for a slope might indicate the strength of the observation.

Examples

Cohort study

“For each patient who initially received a statin, we used propensity-based matching to identify one control who did not receive a statin according to the following protocol. First, propensity scores were calculated for each patient in the entire cohort on the basis of an extensive list of factors potentially related to the use of statins or the risk of sepsis. Second, each statin user was matched to a smaller pool of non-statin-users by sex, age (plus or minus 1 year), and index date (plus or minus 3 months). Third, we selected the control with the closest propensity score (within 0.2 SD) to each statin user in a 1:1 fashion and discarded the remaining controls.”¹⁹.

Case-control study

“We aimed to select five controls for every case from among individuals in the study population who had no diagnosis of autism or other pervasive developmental disorders (PDD) recorded in their general practice record and who were alive and registered with a participating practice on the date of the PDD diagnosis in the case. Controls were individually matched to cases by year of birth (up to 1 year older or younger), sex, and general practice. For each of 300 cases, five controls could be identified who met all the matching criteria. For the remaining 994, one or more controls was excluded...”²⁰.

Training

The UK EQUATOR Centre runs training on how to write using reporting guidelines.

Discuss this item

Visit this items’ discussion page to ask questions and give feedback.

References

1.

Costanza MC. Matching. Preventive Medicine. 1995;24(5):425-433. doi:10.1006/pmed.1995.1069

2.

Sturmer T. Flexible matching strategies to increase power and efficiency to detect and estimate gene-environment interactions in case-control studies. American Journal of Epidemiology. 2002;155(7):593-602. doi:10.1093/aje/155.7.593

3.

Rothman KJgreenland s1998 matching. In:rothman KJgreenland s2nd ed. Modern epidemiology lippincott raven 147 161.

4.

Szklo MFnieto j2000 epidemiology, beyond the basics sudbury (MA) jones and bartlett 40 51.

5.

Attributable risk percent in case-control studies. 1971;25.

6.

Gissler M, Hemminki E. The danger of overmatching in studies of the perinatal mortality and birthweight of infants born after assisted conception. European Journal of Obstetrics & Gynecology and Reproductive Biology. 1996;69(2):73-75. doi:10.1016/0301-2115(95)02517-0

7.

Gefeller O, Pfahlberg A, Brenner H, Windeler J. European Journal of Epidemiology. 1998;14(4):321-325. doi:10.1023/a:1007497104800

8.

Altman DG. Categorizing continuous variables. Encyclopedia of Biostatistics. Published online February 2005. doi:10.1002/0470011815.b2a10012

9.

Cohen J. The cost of dichotomization. Applied Psychological Measurement. 1983;7(3):249-253. doi:10.1177/014662168300700301

10.

Royston P, Altman DG, Sauerbrei W. Dichotomizing continuous predictors in multiple regression: A bad idea. Statistics in Medicine. 2005;25(1):127-141. doi:10.1002/sim.2331

11.

MacCallum RC, Zhang S, Preacher KJ, Rucker DD. On the practice of dichotomization of quantitative variables. Psychological Methods. 2002;7(1):19-40. doi:10.1037/1082-989x.7.1.19

12.

Zhao LP, Kolonel LN. Efficiency loss from categorizing quantitative exposures into qualitative exposures in case-control studies. American Journal of Epidemiology. 1992;136(4):464-474. doi:10.1093/oxfordjournals.aje.a116520

13.

Becher H. The concept of residual confounding in regression models and some applications. Statistics in Medicine. 1992;11(13):1747-1758. doi:10.1002/sim.4780111308

14.

Cochran WG. The effectiveness of adjustment by subclassification in removing bias in observational studies. Biometrics. 1968;24(2):295. doi:10.2307/2528036

15.

Clayton dhills m1993 models for dose-response (chapter 25). Statistical models in epidemiology oxford oxford university press 249 260.

16.

Cox DR. Note on grouping. Journal of the American Statistical Association. 1957;52(280):543-547. doi:10.1080/01621459.1957.10501411

17.

Altman DG, Lausen B, Sauerbrei W, Schumacher M. Dangers of using “optimal” cutpoints in the evaluation of prognostic factors. JNCI Journal of the National Cancer Institute. 1994;86(11):829-835. doi:10.1093/jnci/86.11.829

18.

Il’yasova D, Hertz-Picciotto I, Peters U, Berlin JA, Poole C. Choice of exposure scores for categorical regression in meta-analysis: A case study of a common problem. Cancer Causes & Control. 2005;16(4):383-388. doi:10.1007/s10552-004-5025-x

19.

Hackam DG, Mamdani M, Li P, Redelmeier DA. Statins and sepsis in patients with cardiovascular disease: A population-based cohort analysis. The Lancet. 2006;367(9508):413-418. doi:10.1016/s0140-6736(06)68041-0

20.

Smeeth L, Cook C, Fombonne E, et al. MMR vaccination and pervasive developmental disorders: A case-control study. The Lancet. 2004;364(9438):963-969. doi:10.1016/s0140-6736(04)17020-7

Citation

For attribution, please cite this work as:

von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. The STROBE reporting guideline for writing up observational studies in epidemiology. The EQUATOR Network guideline dissemination platform. doi:10.1234/equator/1010101

What to write

Explanation

Matching in case-control studies

Grouping

Examples

Cohort study

Case-control study

Training

Discuss this item

References

Citation

Reporting Guidelines are recommendations to help describe your work clearly

Who reads research?

Cohort studies

Case-control studies

Cross-sectional studies

Systematic reviews

Systematic review protocols

TODO

Meta analyses of Observational Studies

TODO

Randomised Trials

Randomised Trial Protocols

TODO

Qualitative research

Case Reports

TODO

Diagnostic Test Accuracy Studies

Prediction Models

Animal Research

TODO

Quality Improvement in Healthcare

Economic Evaluations in Healthcare

TODO

Meta Analyses

How Meta-analyses and Systematic Reviews Work Together

Why Don't All Systematic Reviews Use a Meta-Analysis?

Protocol

Cohort_studies

Case_control_studies

Cross-sectional_studies