Inferential Statistics or Inductive Statistics or Statistical Inference is the process of inferring something about the population based on what is measured in the sample. Inferential statistics are used to determine if observed data we obtain from a sample (i.e., data we collect) are different from what one would expect by chance alone

Statistics - Introduction & Terminology

Some may argue that statisticians are not really interested in generalizing from a sample to a specified population but to an idealized super­population spanning space and time

best course on statistics: https://bolt.mph.ufl.edu/6050-6052/

Link to original

Inferential Statistics - Paradigms

  • Circular transclusion detected: mathematics/probability---statistics---information-theory---econometrics/statistics/inferential-statistics/index

Link to original

Inferential Statistics - Forms/Methods

Each form/method represents a different way of using the information obtained in the sample to draw conclusions about the population

Inferential statistics uses probabilistic approximate inference algorithms to infer probabilities of the global population

Inferential Statistics - Process

  1. A random sample is taken from the population
  2. In order to estimate a population parameter, a sample statistic is calculated from the sample (e.g. sample mean, sample proportion, etc.)
  3. We then learn about the sample statistic’s sampling distribution
  4. Using this sampling distribution we can make inferences about our population parameter based on our sample statistic

Inferential Statistics - Goals

  • Parameter Estimation / Interval Estimation / Hypothesis Testing
    • the parameters/properties of a population distribution are called population parameters and they are often an unknown constant. These parameters need to be estimated in such a way that the resulting distribution model best explains the observed data
    • e.g. the parameters of a normal distribution are its mean and standard deviation. So, if you know that the data resembles the model of a normal distribution, parameter estimation would amount to trying to learn the true values of its mean and standard deviation
  • Structure Estimation - Distribution Model Comparison
    • the distribution of a population is often unknown
    • we propose a set of possible distribution models, have each model parameter estimated, and then use model comparison to select the model that best explains the observed data
  • Data Prediction
    • for this goal, you usually have a distribution model produced from the first 2. Then you use them to predict future data.
    • e.g. after measuring the heights of females in a sample, you can estimate the mean and standard deviation of the distribution for all adult females. Then you can use these values to predict the probability of a randomly chosen female having a height within a certain range of values