Chi-Squared Goodness of Fit and Independence

Reminder of Inference and Inferential Tools

We use statistical inference to make or test claims about population parameters which we cannot measure directly

We make claims by constructing confidence intervals
We test claims by conducting hypothesis tests

Confidence intervals provide a range of plausible values for a population parameter

They are centered at the point estimate (sample statistic)
They open up some “wiggle room” called a margin of error, which is influenced by the critical value and the standard error

\[\left(\begin{array}{c}\text{point}\\ \text{estimate}\end{array}\right) \pm \left(\begin{array}{c}\text{critical}\\ \text{value}\end{array}\right)\left(\begin{array}{c}\text{standard}\\ \text{error}\end{array}\right)\]

Reminder of Inference and Inferential Tools

We use statistical inference to make or test claims about population parameters which we cannot measure directly

We make claims by constructing confidence intervals
We test claims by conducting hypothesis tests

Confidence intervals provide a range of plausible values for a population parameter

They are centered at the point estimate (sample statistic)
They open up some “wiggle room” called a margin of error, which is influenced by the critical value and the standard error

\[\left(\begin{array}{c}\text{point}\\ \text{estimate}\end{array}\right) \pm \boxed{\left(\begin{array}{c}\text{critical}\\ \text{value}\end{array}\right)\left(\begin{array}{c}\text{standard}\\ \text{error}\end{array}\right)}\]

Reminder of Inference and Inferential Tools

We use statistical inference to make or test claims about population parameters which we cannot measure directly

We make claims by constructing confidence intervals
We test claims by conducting hypothesis tests

Confidence intervals provide a range of plausible values for a population parameter

They are centered at the point estimate (sample statistic)
They open up some “wiggle room” called a margin of error, which is influenced by the critical value and the standard error

\[\left(\begin{array}{c}\text{point}\\ \text{estimate}\end{array}\right) \pm \left(\begin{array}{c}\text{critical}\\ \text{value}\end{array}\right)\left(\begin{array}{c}\text{standard}\\ \text{error}\end{array}\right)\]

Inferential Tools (Continued)

Hypothesis tests provide a framework for testing claims about a population parameter

Assume the claim is false, in reality (null hypothesis)
Measure the probability of observing a sample at least as extreme as ours in that reality (this is called the \(p\)-value)
- If our observed data is “unlikely” (\(p\)-value is lower than the level of significance, \(\alpha\)), then what we’ve observed is incompatible with our assumed reality; we declare evidence that the null hypothesis is false and accept the alternative hypothesis instead
- If our observed data is not “unlikely” (\(p\)-value at least as large as the level of significance), then our observed data is compatible with our assumed reality – we don’t reject the null hypothesis

Where We Are; Where We’re Going…

Inference On...	Covered
One Binary Categorical Variable	✔
Association Between Two Binary Categorical Variables	✔

Where We Are; Where We’re Going…

Inference On...	Covered
One Binary Categorical Variable	✔
Association Between Two Binary Categorical Variables	✔
One MultiClass Categorical Variable	Today
Associations Between Two MultiClass Categorical Variables	Today

Where We Are; Where We’re Going…

Inference On...	Covered
One Binary Categorical Variable	✔
Association Between Two Binary Categorical Variables	✔
One MultiClass Categorical Variable	Today
Associations Between Two MultiClass Categorical Variables	Today
One Numerical Variable
Association Between a Numerical Variable and a Binary Categorical Variable
Association Between a Numerical Variable and a MultiClass Categorical Variable
Association Between a Numerical Variable and a Single Other Numerical Variable
Association Between a Numerical Variable and Many Other Variables

Where We Are; Where We’re Going…

Inference On...	Covered
One Binary Categorical Variable	✔
Association Between Two Binary Categorical Variables	✔
One MultiClass Categorical Variable	Today
Associations Between Two MultiClass Categorical Variables	Today
One Numerical Variable
Association Between a Numerical Variable and a Binary Categorical Variable
Association Between a Numerical Variable and a MultiClass Categorical Variable
Association Between a Numerical Variable and a Single Other Numerical Variable
Association Between a Numerical Variable and Many Other Variables
Association Between a Categorical Variable and Many Other Variables	✘

Reminder: Inference on a Single Categorical Variable

We’ve been focused on binary (two-class) categorical variables

The single-variable questions we’ve asked are of the form:

Can we estimate the population proportion?
- For example, with 95% confidence, what is the proportion of likely voters in New Hampshire who are planning to vote in favor of Amendment 1?
Is the population proportion greater/less/different than some proposed value?
- For example, is the proportion of likely voters in New Hampshire who favor Amendment 1 at least 66.67%?

But what if we were interested in categorical variables that have more than just two levels?

Are ideological alignments of voting-aged citizens in the US uniformly distributed across the categories very liberal, liberal, moderate, conservative, and very conservative?

Reminder: Inference on Associations Between Two Categorical Variables

Our multivariable questions have been of the form:

Can we estimate the difference in population proportions between Group A and Group B?
- For example, Find a 90% confidence interval for the difference in the proportion of students who feel a sense of belonging at their university between first-year students and seniors.
Is the population proportion in Group A greater/less/different than the population proportion in Group B?
- For example, Is the proportion of students who feel a sense of belonging at their university greater for seniors than for first-year students?

What about associations between categorical variables where at least one has three or more levels?

Is there an association between and individual’s ideology and their perception of the state of their finances (better off, worse off, or about the same) relative to four years ago?

Highlights

Analysing the form of a test statistic
The need for a different test statistic
The need for a new probability distribution
Chi-Squared Tests for Goodness of Fit (inference on a single, multiclass categorical variable)
- A Completed Example
Chi-Squared Tests for Independence (inference on associations between two potentially multiclass categorical variables)
- A Probability Review
- A Completed Example
Additional Examples

A Closer Look at a Test Statistic

So far, I’ve told you that a test statistic takes the form:

\[\begin{array}{c}\text{test}\\ \text{statistic}\end{array} = \frac{\left(\begin{array}{c}\text{point}\\ \text{estimate}\end{array}\right) - \left(\begin{array}{c}\text{null}\\ \text{value}\end{array}\right)}{S_E}\]

The point estimate comes from our sample data – it is our observed value
The null value comes from our null hypothesis – it is our expected outcome

Another way to phrase the test statistic formula then is

\[\begin{array}{c}\text{test}\\ \text{statistic}\end{array} = \frac{\text{observed} - \text{expected}}{S_E}\]

For example, if our null hypothesis assumed that 50% of individuals have characteristic “A” and a sample of 200 people included 95 that did, then our observed proportion is 0.475 and our expected proportion was 0.50, so our sample was about 2.5 “percentage points” away from the expected sample.