General Formula
Click here to expand...
Link to originalCI Definition
An interval [𝐴, 𝐵] is a (1 − 𝛼)100% confidence interval for the parameter 𝜃 if it contains the parameter with probability (1 − 𝛼):
- 𝐏{𝐴 ≤ 𝜃 ≤ 𝐵} = 1 − 𝛼
where:
- 𝛼 - significance level
- (1 − 𝛼) - confidence level or coverage probability
CI Formula Intuition
Click here to expand...
Given a sample of data and a desired confidence level (1 − 𝛼), how can we construct a confidence interval [𝐴, 𝐵] that will satisfy the coverage condition
- 𝐏{𝐴 ≤ 𝜃 ≤ 𝐵} = 1 − 𝛼
first we need to estimate 𝜃
choose an unbiased estimator with normal distribution (e.g. MLE)
use the estimator to take the sample data and estimate 𝜃 point estimate 𝜃ˆ
next we standardize 𝜃ˆ to get a standard normal variable 𝑧:
- 𝑧 = [𝜃ˆ - 𝐄(𝜃ˆ)] / 𝜎(𝜃ˆ)
since 𝜃ˆ was estimated with an unbiased estimator: 𝐄(𝜃ˆ) = 𝜃
- 𝑧 = (𝜃ˆ - 𝜃) / 𝜎(𝜃ˆ)
this variable 𝑧 falls between the standard normal quantiles 𝑞𝛼/2and 𝑞1−𝛼/2, denoted by
- -𝑧𝛼/2= 𝑞𝛼/2
- 𝑧𝛼/2= 𝑞1−𝛼/2
with probability (1 - 𝛼), then:
- 𝐏{-𝑧𝛼/2 ≤ (𝜃ˆ - 𝜃) / 𝜎(𝜃ˆ) ≤ 𝑧𝛼/2} = 1 - 𝛼
now rearrange for 𝜃:
- 𝐏{𝜃ˆ - 𝑧𝛼/2·𝜎(𝜃ˆ) ≤ 𝜃 ≤ 𝜃ˆ + 𝑧𝛼/2·𝜎(𝜃ˆ)} = 1 - 𝛼
we have obtained two numbers:
- 𝐴 = 𝜃ˆ - 𝑧𝛼/2·𝜎(𝜃ˆ)
- 𝐵 = 𝜃ˆ + 𝑧𝛼/2·𝜎(𝜃ˆ)
such that
- 𝐏{𝐴 ≤ 𝜃 ≤ 𝐵} = 1 − 𝛼
CI Formulas
Large Sample Size (𝑛)
Normal Population
𝑆𝐸(𝜃ˆ) Known
Confidence Interval
FALSE
FALSE
EITHER
FALSE
TRUE
FALSE
𝜃ˆ ± 𝑡𝛼/2·𝑆𝐸ˆ(𝜃ˆ)
FALSE
TRUE
TRUE
𝜃ˆ ± 𝑧𝛼/2·𝑆𝐸(𝜃ˆ)
TRUE
EITHER
FALSE
𝜃ˆ ± 𝑧𝛼/2·𝑆𝐸ˆ(𝜃ˆ)
TRUE
EITHER
TRUE
𝜃ˆ ± 𝑧𝛼/2·𝑆𝐸(𝜃ˆ)
where:
- 𝜃ˆ - point estimate/statistic or center of the interval
- 𝑧 - z-score a type of confidence multiplier
- 𝑡 - t-score a type of confidence multiplier
- 𝑆𝐸(𝜃ˆ) or 𝜎(𝜃ˆ) or 𝑆𝑡𝑑(𝜃ˆ) - standard error of the point estimator/statistic
- 𝑆𝐸ˆ(𝜃ˆ) or 𝑠(𝜃ˆ) or 𝑆𝑡𝑑ˆ(𝜃ˆ) - estimated standard error of the point estimator/statistic
CI Annotated
CI Diagram
CI For Sample Proportion
Click here to expand...
the general formula states the confidence interval is:
- 𝜃ˆ ± 𝑧*·𝑆𝐸(𝜃ˆ)
computing CI for population proportion, we substitute:
- 𝜃ˆ = 𝑝̂
- 𝑆𝐸(𝜃ˆ) = √(𝑝(1−𝑝)/𝑛)
therefore:
- 𝑝̂ ± 𝑧*·√(𝑝(1−𝑝)/𝑛)
where:
- 𝑝̂ - is the point estimate/statistic: sample proportion
- √(𝑝(1−𝑝)/𝑛) - is the standard error of sample proportion
CIs For Sample Proportion
|
Large Sample Size (𝑛) |
Normal Population |
𝑝 / 𝑆𝐸(𝑝̂) Known |
Confidence Interval |
|---|---|---|---|
|
FALSE |
FALSE |
EITHER | |
|
FALSE |
TRUE |
FALSE |
𝑝̂ ± 𝑡𝛼/2,𝑛-1·√(𝑝̂(1−𝑝̂)/𝑛) |
|
FALSE |
TRUE |
TRUE |
𝑝̂ ± 𝑧𝛼/2·√(𝑝(1−𝑝)/𝑛) |
|
TRUE |
EITHER |
FALSE |
𝑝̂ ± 𝑧𝛼/2·√(𝑝̂(1−𝑝̂)/𝑛) |
|
TRUE |
EITHER |
TRUE |
𝑝̂ ± 𝑧𝛼/2·√(𝑝(1−𝑝)/𝑛) |
/ci---formula-for-unbiased-estimator-with-normal-distribution/../../../../../../../../mathematics/probability---statistics---information-theory---econometrics/statistics/inferential-statistics/interval-estimation/confidence-interval-(ci)/ci---formula-for-unbiased-estimator-with-normal-distribution/confidence-interval-structure-general.png)
/ci---formula-for-unbiased-estimator-with-normal-distribution/../../../../../../../../mathematics/probability---statistics---information-theory---econometrics/statistics/inferential-statistics/interval-estimation/confidence-interval-(ci)/ci---formula-for-unbiased-estimator-with-normal-distribution/confidence-interval-structure-line.png)