- frequentist statistics - makes predictions based on an estimate of 𝜃
- bayesian statistics - makes predictions based on all possible values of 𝜃
|
Based On | ||
|
Perspective |
|
|
|
What’s Computed |
|
|
|
What is Needed |
|
|
|
Methods |
| |
|
What |
|
|
|
How Uncertainty is Handled |
addresses uncertainty of point-estimate 𝜃ˆ by evaluating its variance (the variance of the estimator is an assessment of how the estimate might change with alternative samplings of the observed data) |
addresses uncertainty of point-estimator posterior-distribution 𝐏(𝜃|𝑛𝑒𝑤-𝑑𝑎𝑡𝑎) by simply integrating over it |
|
Other |
Example Parameter Estimation (Frequentist vs Bayesian)
say we want to estimate the average height of adult females. assume that:
- height has a normal distribution
- standard deviation is available and we don’t need to estimate it
Therefore, the only thing we need to estimate is the mean of the normal distribution
|
Frequentist Approach |
Bayesian Approach |
|---|---|
|
frequentist reasoning
The value mentioned in the end is known as the maximum likelihood estimate. It depends on the distribution of the data and I won’t go into details on its calculation. However, for normally distributed data, it’s quite straightforward: the maximum likelihood estimate of the population mean is equal to the sample mean |
bayesian reasoning
In a Bayesian setting, the newly collected data makes the probability distribution over the parameter narrower. More specifically, narrower around the parameter’s true (unknown) value. You do the updating process by applying Bayes’ theorem The way to update the entire probability distribution is by applying Bayes’ theorem to each possible value of the parameter |
Frequentists’ main objection to the Bayesian approach is the use of prior probabilities. Their criticism is that there is always a subjective element in assigning them. Paradoxically, Bayesians consider not using prior probabilities one of the biggest weaknesses of the frequentist approach