The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is small or the data is complex.
Bootstrapping, also known as bootstrap resampling, simulates the sampling process by drawing multiple random samples, with replacement, from an existing dataset. Here, the original sample acts as a stand-in "population," and each resample is treated as an independent sample drawn from this "population." The underlying assumption is that the original sample is a good representation of the broader population. This approach is especially valuable when sample sizes are limited, as in studies with rare fossils, ancient genomic samples, tissues from rare diseases, endangered species studies, and unique experiments that cannot easily be repeated.
The basic process of bootstrapping includes the following steps:
Since resampling is with replacement, each new sample may include repeated values from the original data, reflecting the randomness in the resampling process. Bootstrapping typically requires a high number of resamples (often over 1,000) to achieve stable estimates, which can then be used to calculate statistics like the mean, variance, standard error, or confidence intervals for population parameters.
Bootstrapping is both cost-effective and accessible, offering a straightforward way to make inferences without needing additional data. However, it relies heavily on the original sample, meaning that any biases or errors in the original data will be present in the bootstrapped results as well.
Bootstrapping is a resampling method that uses samples randomly drawn from the already collected sample with replacement.
Imagine a paleontologist trying to determine the mean wing length of a prehistoric insect species with only five fossil specimens.
A higher sample size is desirable to make better inferences, but there is no way to obtain more fossils. In such cases, the bootstrap resampling method is beneficial.
These data from five specimens give a mean length of 10.7 cm.
To begin bootstrapping, randomly draw samples from the original sample set.
Notice that this sample has an identical sample size to the original one, but some values are repeated. This occurs because the bootstrap resampling is entirely random.
Several such bootstrap samples are drawn to estimate the mean wing length distribution. This way, confidence intervals can also be obtained to estimate the population mean more accurately.
Bootstrapping is easy and cost-effective, but it relies on a limited sample. If such a sample is biased or collected erroneously, the bootstrap resampling will remain as biased or erroneous as the original sample.