Hypergeometric Distribution

Probability Mass Function

𝑃(𝑋=𝑘) is the probability of obtaining exactly 𝑥 successes

where:

  • 𝑁 is the size of the population
  • 𝐾 is the number of “successes” in the population
  • 𝑛 is the size of the sample
  • 𝑘 is the number of “successes” in the sample
  • 𝑖𝐶𝑗 is the number of combinations without replacement of taking 𝑗 things from a set of 𝑖 things

Expectation

𝐄(𝑋) = 𝑛 (𝐾/𝑁)

Variance

𝑉𝑎𝑟(𝑋) = 𝑛 𝐾(𝑁-𝐾) (𝑁-𝑛) / [𝑁2(𝑁-1)]

𝑉𝑎𝑟(𝑋) = 𝑛 𝑝(1-𝑝) [1 - (𝑛-1)/(𝑁-1)]

where:

  • 𝑝 = (𝐾/𝑁)
  • [1 - (𝑛-1)/(𝑁-1)] is the “finite population correction factor”

Subpages

Hypergeometric Distribution Example

Suppose you first randomly sample one card from a deck of 52. Then, without putting the card back in the deck you sample a second and then (again without replacing cards) a third. Given this sampling procedure, what is the probability that exactly two of the sampled cards will be aces (4 of the 52 cards in the deck are aces). You can calculate this probability using the following formula based on the hypergeometric distribution:

In this example:

  • 𝑁 = 52 because there are 52 cards in a deck
  • 𝐾 = 4 because there are four aces in the deck
  • 𝑛 = 3 because 3 cards were sampled
  • 𝑘 = 2 because the problem asks about the probability of getting two aces

Therefore,