- Sample Variance = Adjusted Sample Variance
- Sample Standard Deviation = Adjusted Sample Standard Deviation
Sample Variance - Intuition
Click here to expand...
read: Sample Mean
Let’s say we want the variance of the height of 1 trillion people.
There are 2 methods:
- population variance - measure the height of all 1 trillion people, acquire population mean, use population variance formula
- sample variance - get a sample of 100 people, acquire sample mean, use sample variance formula
The first method will get you the actual variance of height as it is the definition of variance over a population. However, it is impractical to measure 1 trillion people.
The second method is much easier, we only need to measure the height of 100 people. However, this sample variance may not accurately reflect the population variance. How well the sample variance reflects to population variance is through calculating sample variance’s:
- expected value - denoted as 𝐄(sample variance) = ?
- variance - denoted as 𝐕𝐚𝐫(sample variance) = ?
we want:
- 𝐄(sample variance) = population variance
- 𝐕𝐚𝐫(sample variance) = 0
Not surprisingly, the more we samples we take from population to calculate to sample variance:
- the closer 𝐄(sample variance) becomes the population variance
- the closer 𝐕𝐚𝐫(sample variance) becomes 0
below goes through the mathematics on why this is the case
Sample Variance - Definition / Formula
- sample variance (𝑠2) = [ 𝛴1≤𝑖≤𝑛 (𝑋𝑖- 𝑋̅)2] / (𝑛 - 1)
where:
- each 𝑋𝑖is a random sample drawn from a population
- 𝑛 is the sample size
- 𝑋̅ - is the sample mean
Sample Standard Deviation - Definition / Formula
- sample standard deviation (𝑠) = 𝑟𝑜𝑜𝑡(𝑠2)
where:
- each 𝑋𝑖is a random sample drawn from a population
- 𝑛 is the sample size
- 𝑋̅ - is the sample mean
Sample Variance - Expected Value
𝐄(sample variance 𝑠2) = 𝜎2
proof
- 𝐄(𝑠2) = 𝐄([ 𝛴1≤𝑖≤𝑛 (𝑋𝑖- 𝑋̅)2] / (𝑛 - 1)) # substitution of sample variance formula
- (𝑛 - 1)𝐄(𝑠2) = 𝐄[𝛴1≤𝑖≤𝑛 (𝑋𝑖2- 2𝑋𝑖𝑋̅ + 𝑋̅2)]
- (𝑛 - 1)𝐄(𝑠2) = 𝐄[𝑋𝑖2·𝛴1≤𝑖≤𝑛1] - 𝐄[2𝑋̅·𝛴1≤𝑖≤𝑛𝑋𝑖] + 𝐄[𝑋̅2·𝛴1≤𝑖≤𝑛1]
- (𝑛 - 1)𝐄(𝑠2) = 𝐄[𝑛𝑋𝑖2] - 𝐄[2𝑋̅·𝑛·𝑋̅] + 𝐄[𝑋̅2·𝑛] # 𝛴1≤𝑖≤𝑛𝑋𝑖 = 𝑛·𝑋̅
- (𝑛 - 1)𝐄(𝑠2) = 𝑛·𝐄[𝑋𝑖2] - 2𝑛·𝐄[𝑋̅2] + 𝑛·𝐄[𝑋̅2]
- (𝑛 - 1)𝐄(𝑠2) = 𝑛·𝐄[𝑋𝑖2] - 𝑛·𝐄[𝑋̅2]
- [(𝑛 - 1)/𝑛]·𝐄(𝑠2) = 𝐄[𝑋𝑖2] - 𝐄[𝑋̅2]
- 𝐄[𝑋̅2] = 𝑉𝑎𝑟[𝑋̅] + 𝐄[𝑋̅]2
- 𝐄[𝑋̅2] = 𝑉𝑎𝑟[(1/𝑛)𝛴1≤𝑖≤𝑛𝑋𝑖] + 𝜇2
- 𝐄[𝑋̅2] = (1/𝑛2)𝑉𝑎𝑟[𝛴1≤𝑖≤𝑛𝑋𝑖] + 𝜇2
- 𝐄[𝑋̅2] = (1/𝑛2)[𝛴1≤𝑖≤𝑛𝑉𝑎𝑟(𝑋𝑖)] + 𝜇2
- 𝐄[𝑋̅2] = (1/𝑛2)[𝛴1≤𝑖≤𝑛𝜎2] + 𝜇2
- 𝐄[𝑋̅2] = (1/𝑛2)[𝑛𝜎2] + 𝜇2
- 𝐄[𝑋̅2] = (1/𝑛)[𝜎2] + 𝜇2
- [(𝑛 - 1)/𝑛]·𝐄(𝑠2) = 𝐄[𝑋𝑖2] - [(1/𝑛)[𝜎2] + 𝜇2]
- 𝐄[𝑋𝑖2] = 𝑉𝑎𝑟[𝑋𝑖] + 𝐄[𝑋𝑖]2
- 𝐄[𝑋̅2] = 𝜎2 + 𝜇2
- [(𝑛 - 1)/𝑛]·𝐄(𝑠2) = 𝜎2 + 𝜇2 - [(1/𝑛)[𝜎2] + 𝜇2]
- [(𝑛 - 1)/𝑛]·𝐄(𝑠2) = 𝜎2 - (1/𝑛)[𝜎2]
- (𝑛 - 1)·𝐄(𝑠2) = 𝑛𝜎2 - 𝜎2
- (𝑛 - 1)·𝐄(𝑠2) = (𝑛 - 1)𝜎2
- 𝐄(𝑠2) = 𝜎2
the expected value / mean of the sample variance 𝑠2 is population variance 𝜎2
Sample Standard Deviation - Expected Value
𝐄(sample standard deviation 𝑠) = 𝜎
proof
- 𝐄(𝑠) = 𝐄(√( [ 𝛴1≤𝑖≤𝑛 (𝑋𝑖- 𝑋̅)2] / (𝑛 - 1) )) # substitution of sample standard deviation formula
- 𝐄(𝑠) = √𝐄( [ 𝛴1≤𝑖≤𝑛(𝑋𝑖- 𝑋̅)2] / (𝑛 - 1) )
- 𝐄(𝑠) = √𝐄(𝑠2)
- 𝐄(𝑠) = √𝜎2# see Expected Value of Sample Variance
- 𝐄(𝑠) = 𝜎
the expected value / mean of the sample standard deviation 𝑠 is population standard deviation 𝜎
Sample Variance - Variance
𝐕𝐚𝐫(sample variance 𝑠2) = ?
proof
- 𝐕𝐚𝐫(𝑠2) = 𝐕𝐚𝐫([ 𝛴1≤𝑖≤𝑛 (𝑋𝑖- 𝑋̅)2] / (𝑛 - 1)) # substitution of sample variance formula
- (𝑛 - 1)2𝐕𝐚𝐫(𝑠2) = 𝐕𝐚𝐫[𝛴1≤𝑖≤𝑛(𝑋𝑖2- 2𝑋𝑖𝑋̅ + 𝑋̅2)]
- (𝑛 - 1)2𝐕𝐚𝐫(𝑠2) = 𝐕𝐚𝐫[𝛴1≤𝑖≤𝑛(𝑋𝑖2) - 𝛴1≤𝑖≤𝑛(2𝑋𝑖𝑋̅) + 𝛴1≤𝑖≤𝑛(𝑋̅2)]
- (𝑛 - 1)2𝐕𝐚𝐫(𝑠2) = 𝐕𝐚𝐫[𝛴1≤𝑖≤𝑛(𝑋𝑖2) - 2𝑋̅𝛴1≤𝑖≤𝑛(𝑋𝑖) + 𝑛𝑋̅2]
- (𝑛 - 1)2𝐕𝐚𝐫(𝑠2) = 𝐕𝐚𝐫[𝛴1≤𝑖≤𝑛(𝑋𝑖2) - 2𝑋̅𝑛·𝑋̅ + 𝑛𝑋̅2] # 𝛴1≤𝑖≤𝑛𝑋𝑖 = 𝑛·𝑋̅
- (𝑛 - 1)2𝐕𝐚𝐫(𝑠2) = 𝐕𝐚𝐫[𝛴1≤𝑖≤𝑛𝑋𝑖2] - 𝑛2𝐕𝐚𝐫[𝑋̅2]
- TODO
- 𝐕𝐚𝐫(𝑠2) = ?
Sample Standard Deviation - Variance
𝐕𝐚𝐫(sample standard deviation 𝑠) = ?
proof
- 𝐕𝐚𝐫(𝑠) = 𝐕𝐚𝐫(√( [ 𝛴1≤𝑖≤𝑛 (𝑋𝑖- 𝑋̅)2] / (𝑛 - 1) )) # substitution of sample standard deviation formula
- TODO
- 𝐕𝐚𝐫(𝑠) = ?
Sample Variance - Standard Deviation / Standard Error
𝐒𝐄(𝑠2) = 𝐒𝐭𝐝(𝑠2) = ?
Sample Standard Deviation - Standard Deviation / Standard Error
𝐒𝐄(𝑠) = 𝐒𝐭𝐝(𝑠) = ?