2-sample problems - comparison of 2 samples, and making inferences of corresponding populations

  • Population 1: 𝑋 ∼ 𝑓𝑋(𝑥), 𝐄(𝑋) = 𝜇𝑋
  • Population 2: 𝑌 ∼ 𝑓𝑌(𝑦), 𝐄(𝑌) = 𝜇𝑌

CI with 2 Independent Samples - both 𝑋 and 𝑌 samples come from 2 different subjects (i.e. independent observations)

sample size of 𝑋 and 𝑌 may be different

𝑋

𝑌

𝑋1

𝑌1

𝑋2

𝑌2

𝑋𝑛

𝑌𝑚

CI General Formula

CI Formula For 2 Independent Samples of Sample Mean

the general formula states the confidence interval is:

  • 𝜃ˆ ± 𝑧*·𝑆𝐸(𝜃ˆ)

computing CI for population mean, we substitute:

  • 𝜃ˆ = 𝑋̅-𝑌̅
  • 𝑆𝐸(𝜃ˆ) = 𝑆𝐸(𝑋̅-𝑌̅) = 𝑟𝑜𝑜𝑡[(𝜎𝑋2/𝑛𝑋) + (𝜎𝑌2/𝑛𝑌)]

computation of 𝑆𝐸(𝑋̅-𝑌̅):

if population standard deviation 𝜎𝑋 and 𝜎𝑌are UNKNOWN and assumed to be:

  • EQUAL (use pooled standard deviation)
  • NOT EQUAL (use Satterthwaite’s Approximation)

CI Formulas For 2 Independent Samples of Sample Mean

Large Sample Sizes
(𝑛𝑋& 𝑛𝑌)

Normal Population
(𝑋 and 𝑌)

(𝜎𝑋and 𝜎𝑌)
Known

𝜎𝑋 = 𝜎𝑌Assumed

Confidence Interval

FALSE

FALSE

EITHER

EITHER

Bootstrap Method

FALSE

TRUE

FALSE

FALSE

(𝑋̅-𝑌̅) ± 𝑡𝛼/2,𝑣·𝑟𝑜𝑜𝑡(𝑠𝑋2/𝑛𝑋 + 𝑠𝑌2/𝑛𝑌)

FALSE

TRUE

FALSE

TRUE

(𝑋̅-𝑌̅) ± 𝑡𝛼/2,𝑛𝑋+𝑛𝑌-2·𝑟𝑜𝑜𝑡(𝑠𝑝2/𝑛𝑋 + 𝑠𝑝2/𝑛𝑌)

FALSE

TRUE

TRUE

EITHER

(𝑋̅-𝑌̅) ± 𝑧𝛼/2·𝑟𝑜𝑜𝑡(𝜎𝑋2/𝑛𝑋 + 𝜎𝑌2/𝑛𝑌)

TRUE

EITHER

FALSE

FALSE

(𝑋̅-𝑌̅) ± 𝑧𝛼/2·𝑟𝑜𝑜𝑡(𝑠𝑋2/𝑛𝑋 + 𝑠𝑌2/𝑛𝑌)

TRUE

EITHER

FALSE

TRUE

(𝑋̅-𝑌̅) ± 𝑧𝛼/2·𝑟𝑜𝑜𝑡(𝑠𝑝2/𝑛𝑋 + 𝑠𝑝2/𝑛𝑌)

TRUE

EITHER

TRUE

EITHER

(𝑋̅-𝑌̅) ± 𝑧𝛼/2·𝑟𝑜𝑜𝑡(𝜎𝑋2/𝑛𝑋 + 𝜎𝑌2/𝑛𝑌)