Draft:Odds ratio for a matched case-control study

  • Comment: Not enough significant, independent coverage. WikiOriginal-9 (talk) 22:08, 5 November 2023 (UTC)

Odds Ratio for a Matched Case-Control Study edit

A case-control study involves selecting representative samples of cases and controls who do, and do not, have some disease, respectively. These samples are usually independent of each other. The prior prevalence of exposure to some risk factor is observed in subjects from both samples. This permits the estimation of the odds ratio for disease in exposed vs. unexposed people.[1]  Sometimes, however, it makes sense to match cases to controls on one or more confounding variables.[2] In this case, the prior exposure of interest is determined for each case and her/his matched control. The data can be summarized in the following table.

Matched 2x2 Table edit

Case-control pairs Control exposed Control unexposed
Case exposed
Case unexposed

This table gives the exposure status of the matched pairs of subjects. There are pairs where both both the case and her/his matched control were exposed, pairs where the case patient was exposed but the control subject was not, pairs where the control subject was exposed but the case patient was not, and pairs were neither subject was exposed. The exposure of matched case and control pairs is correlated due to the similar values of their shared confounding variables.

We consider each pair as belonging to a stratum with identical values of the confounding variables. Conditioned on belonging to the same stratum, the exposure status of cases and controls are independent of each other. For any case-control pair within the same stratum let

be the probability that a case patient is exposed,

be the probability that a control patient is exposed,

be the probability that a case patient is not exposed, and

be the probability that a control patient is not exposed.

Then the probability that a case is exposed and a control is not is , and the probability that a control is exposed and a case in not is . The within-stratum odds ratio for exposure in cases relative to controls is

We assume that is constant across strata.[2]

Now concordant pairs in which either both the case and the control are exposed, or neither are exposed tell us nothing about the odds of exposure in cases relative to the odds of exposure among controls. The probability that the case is exposed and the control is not given that the pair is discordant is

The distribution of given the number of discordant pairs is binomial  ~  B and the maximum likelihood estimate of is

Multiplying both sides of this equation by and subtracting gives

and hence

Under the null hypothesis that .

Hence, we can test the null hypothesis that by testing the null hypothesis that . This is done using McNemar's test.

There are a number of ways to calculate a confidence interval for . Let and denote the lower and upper bound of a confidence interval for , respectively. Since , the corresponding confidence interval for is

.

Example edit

McEvoy et al. [3] studied the use of cell phones by drivers as a risk factor for automobile crashes in a case-crossover study.[1] All study subjects were involved in an automobile crash requiring hospital attendance. Each driver's cell phone use at the time of her/his crash was compared to her/his cell phone use in a control interval at the same time of day one week earlier. We would expect that a person's cell phone use at the time of the crash would be correlated with his/her use one week earlier. Comparing usage during the crash and control intervals adjusts for driver's characteristics and the time of day and day of the week. The data can be summarized in the following table.

Case-control pairs Phone used in control interval Phone not used in control interval
Phoned used in crash interval 5 27
Phone not used in crash interval 6 288

There were 5 drivers who used their phones in both intervals, 27 who used them in the crash but not the control interval, 6 who used them in the control but not the crash interval, and 288 who did not use them in either interval. The odds ratio for crashing while using their phone relative to driving when not using their phone was

 .

Testing the null hypothesis that   is the same as testing the null hypothesis that   given 27 out of 33 discordant pairs in which the driver was using her/his phone at the time of his crash. McNemar's  . This statistic has one degree of freedom and yields a P value of 0.0003. This allows us to reject the hypothesis that cell phone use has no effect on the risk of automobile crashes ( ) with a high level of statistical significance.

Using Wilson's method, a 95% confidence interval for   is (0.6561, 0.9139). Hence, a 95% confidence interval for   is

 

(McEvoy et al.[3] analyzed their data using conditional logistic regression and obtained almost identical results to those given here. See the last row of Table 3 in their paper.)

References edit

  1. ^ a b Celentano DD, Szklo M, Gordis L (2019). Gordis Epidemiology, Sixth Edition. Philadelphia, PA: Elsevier. p. 149-177.
  2. ^ a b Breslow, NE, Day, NE (1980). Statistical Methods in Cancer Research: Vol. 1 - The Analysis of Case-Control Studies. Lyon, France: IARC Scientific Publications. p. 162-189.
  3. ^ a b McEvoy SP, Stevenson MR, McCartt AT, Woodward M, Haworth C, Palamara P, et al. (2005). "Role of mobile phones in motor vehicle crashes resulting in hospital attendance: a case-crossover study". BMJ. 331 (7514): 428. doi:10.1136/bmj.38537.397512.55. PMC 1188107. PMID 16012176.