Survival curves are graphical representations that depict the survival experience of a population over time, offering an intuitive way to track the proportion of individuals who remain event-free at each time point. These curves are widely used in fields such as medicine, public health, and reliability engineering to visualize and compare survival probabilities across different groups or conditions.
The Kaplan-Meier estimator is the most common method for constructing survival curves. This non-parametric approach generates a stepwise function, where the curve drops at each time an event (such as death, disease recurrence, or mechanical failure) occurs. The horizontal segments between drops indicate periods of stability, during which no events occur. The x-axis of the curve represents time, while the y-axis shows the survival probability, ranging from 0 to 1. Survival curves provide several key insights:
For example, in a clinical trial comparing two cancer therapies, survival curves can reveal which treatment offers better survival outcomes. A curve that declines more gradually indicates a group with better survival probabilities. Similarly, in reliability engineering, survival curves are employed to estimate the lifespan of components or systems, enabling effective maintenance planning and failure analysis.
By providing a clear and accessible visual representation of complex time-to-event data, survival curves play a crucial role in data analysis. Their ability to summarize survival probabilities, identify key metrics like median survival time, and facilitate group comparisons makes them indispensable across a range of applications.
Consider a graph of the cumulative probability of death plotted as age on the X axis versus the proportion dead on the Y axis for a specific year.
This can be expressed as an equation, where the cumulative distribution function F(t) is the ratio of the number of people dead by time t to the total number observed.
As all population members are not observed until death, this curve cannot estimate survival.
So, the survivorship function or survival curve — S(t) — is the proportion or percent of people living till time t or beyond. It is expressed as follows.
The survival curve is then plotted using age and the percentage of people surviving.
There are various types of survival models. The exponential survival model characterizes a constant hazard over time, meaning the risk of the event occurring is independent of time.
The Weibull survival model can be used in various situations where the hazard rate monotonically increases or decreases over time.
The Log-normal and log-logistic models can be used in scenarios when the hazard rate is not monotonic.