150 likes | 173 Views
Understand bias in Kaplan-Meier curves due to dependent censoring in prostate cancer trials. Learn why using the ASTRO definition for biochemical recurrence poses dangers. Explore cases of administrative censoring and its implications in medical studies.
E N D
University of Wisconsin – Madison Department of Biostatistics and Medical Informatics HOW TO BIAS A KAPLAN-MEIER CURVE AND WHY MANY PROSTATE CANCER TRIALS ARE IN DANGER Rick Chappell, Ph.D. Department of Biostatistics and Medical Informatics Depart of Statistics University of Wisconsin Madison
Outline • I. Obviously Dependent Censoring • “Retiring to Arizona” or “Going back to the farm” • Dependent Administrative Censoring • Induced by time trends • The ASTRO Definition of Biochemical Recurrence in Prostate Cancer
I. Obviously Dependent Censoring Consider a simple situation without censoring: x x x x Time
The K-M curve is the empirical CDF: 1 x x x x 0 Time
If the healthiest in terms of remaining life are selectively censored (negatively dependent censoring) then the K-M curve is biased downward: 1 x x O x O x 0 Time
If the sickest are selectively censored (positively dependent censoring) then the K-M curve is biased upward: 1 o x x x o x 0 Time
II. Dependent Administrative Censoring • Even when the sole source of censoring is administrative (event hasn’t yet occurred at the time of analysis), it can be dependent with failure time. • Pointed out by Kaplan & Meier (1958), credited to Sartwell and Merrell (1952), Am. J. Pub. Health42, “Influence of the dynamic character of chronic disease on the interpretation of morbidity rates”.
“For example, in a study of survival after an operation, a change in surgical technique five years before the data are analyzed will affect the survival times only of those with observation limit less than five years [p. 470].” Consider an extreme example: 1990 200 accrued 50% failure in 1991 1993 2000 accrued 50% failure in 1996 The rest are cured. An analysis is performed in 1995.
1 .5 0 K-M estimate for 1990 cohort of 200 analyzed in 1995 + 0 years 5 1 .5 0 K-M estimate for 1993 cohort of 2000 analyzed in 1995 0 years 5 = 1 .5 0 K-M estimate for combined sample of 2200 analyzed in 1995 0 years 5
Thus, even though the long-term failure rate in both cohorts is 50%, the K-M curve remains near 100%. • This is not a sample-size issue: the confidence intervals for the previous example are narrow (and can be made arbitrarily narrower by choosing higher sample sizes). • Note that censoring is solely administrative.
III. The ASTRO Definition of Biochemical Failure (BF) in Prostate Cancer The American Society for Therapeutic Radiology and Oncology consensus statement on guidelines for PSA following radiation therapy (1997): “Three consecutive rises in prostate-specific antigen (PSA) after reaching the PSA nadir constitute BF. The date of failure is the midpoint between the nadir and the first of the three consecutive rises in PSA.”
A hypothetical PSA curveafter radiation treatment PSA assay times PSA level observed nadir backdated BF treatment “at call” BF Time
Why Backdating is a Problem • Problems with definition quickly noticed by Vicini et al., attributed to inadequate followup. • They examined a series of prostate cancer patients treated with radiation and followed for up to 12 years. • They artificially censored patients at a range of followups, recalculated backdated BF times, and plotted K-M curves.
Vicini, F.A., Kestin, L.L., and Martinez, A.A. The importance of adequate follow-up in defining treatment success after external beam irradiation for prostate cancer. IJROBP 1999; 45:553-561.
Their conclusion: need more followup – at least 5, preferably 10 years. • Vicini and others recommended that most or all patients be followed “at least beyond the time point at which actuarial results are examined”. • This is problematic considering the lengthy progress of the disease and the frail patient population.