Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker

Am J Epidemiol. 2004 May 1;159(9):882-90. doi: 10.1093/aje/kwh101.

Abstract

A marker strongly associated with outcome (or disease) is often assumed to be effective for classifying persons according to their current or future outcome. However, for this assumption to be true, the associated odds ratio must be of a magnitude rarely seen in epidemiologic studies. In this paper, an illustration of the relation between odds ratios and receiver operating characteristic curves shows, for example, that a marker with an odds ratio of as high as 3 is in fact a very poor classification tool. If a marker identifies 10% of controls as positive (false positives) and has an odds ratio of 3, then it will correctly identify only 25% of cases as positive (true positives). The authors illustrate that a single measure of association such as an odds ratio does not meaningfully describe a marker's ability to classify subjects. Appropriate statistical methods for assessing and reporting the classification power of a marker are described. In addition, the serious pitfalls of using more traditional methods based on parameters in logistic regression models are illustrated.

Publication types

  • Research Support, U.S. Gov't, P.H.S.
  • Review

MeSH terms

  • Biomarkers*
  • Case-Control Studies
  • Classification
  • Data Interpretation, Statistical*
  • Discriminant Analysis
  • Epidemiologic Methods*
  • False Positive Reactions
  • Humans
  • Logistic Models
  • Mass Screening / methods
  • Mass Screening / standards*
  • Odds Ratio*
  • Predictive Value of Tests
  • Prognosis*
  • ROC Curve
  • Reproducibility of Results
  • Risk Factors

Substances

  • Biomarkers