Published on 22/12/2025
Kaplan-Meier Curves and Estimating Median Survival in Clinical Trials
Survival analysis is crucial in clinical research, particularly when evaluating time-dependent outcomes like disease progression, recurrence, or death. Among its core techniques, Kaplan-Meier (KM) curves are the most widely used method to estimate survival probability over time. These curves allow researchers and regulators to visualize survival distributions and determine key metrics like the median survival time.
This tutorial offers a step-by-step guide to Kaplan-Meier curve construction, interpretation, and the estimation of median survival in the context of clinical trials. It is designed for pharma and clinical professionals seeking to strengthen their grasp of time-to-event analysis while ensuring compliance with statistical and regulatory guidelines such as those outlined by the USFDA.
What Is a Kaplan-Meier Curve?
A Kaplan-Meier curve is a step-function graph that estimates the survival function from time-to-event data. It shows the probability of surviving beyond certain time points in the presence of censored data.
The KM method allows for real-time survival analysis even when participants drop out or the trial ends before an event occurs. This flexibility makes it indispensable for studies where not all subjects reach an endpoint during the
Components of a Kaplan-Meier Curve
- X-axis: Time since the start of the study (e.g., days, weeks, months)
- Y-axis: Estimated survival probability
- Steps: Represent event occurrences (e.g., death, progression)
- Tick marks: Indicate censored data points
- Risk table: Number of patients at risk at different time points (often included below the graph)
Key Concepts for Estimation
1. Survival Probability (S(t))
The probability that a patient survives longer than a specific time t. This is recalculated at each time point when an event occurs.
2. Censoring
Occurs when a participant exits the trial (lost to follow-up, study end) before experiencing the event. Kaplan-Meier accommodates right censoring without introducing bias.
3. Median Survival Time
The time at which 50% of the study population is expected to have experienced the event. This is found by identifying the point where the survival curve drops below 0.5 on the Y-axis.
Constructing a Kaplan-Meier Curve: Step-by-Step
- Sort data: Order participants by the time to event or censoring.
- Calculate risk set: Number of patients still at risk at each time point.
- Calculate survival probability: Use the formula
S(t) = S(t−1) × (1 − d/n)where d = events, n = individuals at risk. - Plot curve: Each event causes a downward step in the curve.
- Mark censored observations: Use tick marks on the curve to show censored data.
Example Application: Oncology Trial
In a Phase III oncology trial comparing Drug A vs. placebo, survival data showed that the median overall survival (OS) for Drug A was 12.4 months compared to 9.8 months for placebo. Kaplan-Meier curves visually represented the survival advantage, and the log-rank test confirmed statistical significance.
This visualization allowed regulatory agencies to easily interpret survival benefit and contributed to the eventual approval of Drug A for this indication.
Interpreting Kaplan-Meier Curves
Proper interpretation of KM curves includes:
- Vertical drops: Represent event occurrences.
- Plateaus: Periods without events.
- Censored tick marks: Subjects no longer contributing to risk.
- Median survival: Time at which the curve crosses 0.5.
- Confidence intervals: Visualize uncertainty around estimates (often shaded areas or dashed lines).
Statistical Comparison Between Groups
To compare Kaplan-Meier curves between treatment groups:
1. Log-Rank Test
- Tests the null hypothesis that there’s no difference between groups.
- Assumes proportional hazards over time.
2. Cox Proportional Hazards Model
- Provides hazard ratios (HR) with 95% confidence intervals.
- Adjusts for covariates (age, sex, disease severity).
Best Practices in Kaplan-Meier Analysis
- Define event and censoring criteria clearly in the protocol and SAP.
- Ensure consistent time origin (e.g., date of randomization).
- Use software like R (
survivalpackage), SAS (PROC LIFETEST), or SPSS for accurate estimation. - Always include confidence intervals and risk tables in reports.
- Align plotting and reporting standards with regulatory expectations from CDSCO and StabilityStudies.in.
Software Tools for Kaplan-Meier Estimation
- R:
survivalandsurvminerfor estimation and visualization - SAS: PROC LIFETEST and PROC PHREG
- STATA, Python: Lifelines and other libraries
- SPSS: Kaplan-Meier Estimation module
Regulatory Expectations for KM Plots
Agencies like the EMA expect KM curves to be:
- Accompanied by a full SAP explanation
- Displayed in CSR (Clinical Study Report)
- Provided with digital source data for reproducibility
- Used in both interim and final analyses with consistency
Common Pitfalls to Avoid
- Failing to properly mark censored data
- Over-interpreting differences without statistical testing
- Incorrect time origin assignment
- Plotting survival beyond the last event time
Conclusion: Kaplan-Meier Curves Empower Clinical Decision-Making
Kaplan-Meier analysis provides a powerful visualization of survival trends in clinical trials. From estimating median survival to comparing treatment arms, KM curves offer actionable insights when executed correctly. Pharma professionals, statisticians, and regulatory experts must master the generation and interpretation of these curves to support successful trial design, execution, and submission.
