Inference: Two-Way Tables

Chapter 18
Math 215

Government Spending

Do people that identify as belonging to different U.S. political parties have different views about government spending?
We will explore the relationship between party affiliation and opinions on government spending on both national defense and space exploration

Data

gss2016 ¹ dataset
Subset of General Social Survey (GSS) data from 2016
party (Dem, Ind, or Rep)
natarms opinion on current level of government spending on national defense
natspac opinion on current level of government spending on space exploration
149 respondents

Party	TOO LITTLE	ABOUT RIGHT	TOO MUCH	Total
Dem	17	14	12	43
Ind	20	28	24	72
Rep	24	8	2	34
Total	61	50	38	149

Party	TOO LITTLE	ABOUT RIGHT	TOO MUCH	Total
Dem	8	22	13	43
Ind	13	37	22	72
Rep	9	17	8	34
Total	30	76	43	149

With more than 2 groups, we can’t use a single difference in proportions to compare groups
Since there are more than two values for each categorical variable there are no “successes” or “failures”
We will use the \(\chi^2\) statistic (chi-squared) to measure the difference between groups

Military Spending

Hypotheses stated in terms of an association
- \(H_0\): There is no association between opinions on government spending on national defense and political affiliations.
- \(H_A\): There is an association between opinions on government spending on national defense and political affiliations.
Hypotheses stated in terms of differences
- \(H_0\): There is no difference in opinions on government spending on national defense between people with different political affiliations.
- \(H_A\): There is some difference in opinions on government spending on national defense between people with different political affiliations.

Expected Counts

Party	TOO LITTLE	ABOUT RIGHT	TOO MUCH	Total
Dem	17 (17.60)	14 (14.43)	12 (10.97)	43
Ind	20 (29.48)	28 (24.16)	24 (18.36)	72
Rep	24 (13.92)	8 (11.41)	2 (8.67)	34
Total	61	50	38	149

First compute the expected cell counts, assuming \(H_0\)
Overall proportion of people that said “too little” spending = \(61/149\) = \(0.4094\)
If no association between party and opinion, we expect the same proportion of Democrats, Independents and Republicans to have this opinion
- Expected count for dems with opinion “too little” = \(0.4094\cdot 43 = 17.60\)
- Expected count for inds with opinion “too little” = \(0.4094\cdot 72 = 29.48\)
- Expected count for reps with opinion “too little” = \(0.4094\cdot 34 = 13.92\)
Easy way to remember how to compute each expected cell count is \[\frac{row\_total \cdot column\_total}{table\_total}\]

Party	TOO LITTLE	ABOUT RIGHT	TOO MUCH
Dem	\(\frac{(17 -17.60)^2}{17.60}=0.02\)	\(\frac{(14-14.43)^2}{14.43}=0.01\)	\(\frac{(12-10.97)^2}{10.97}=0.10\)
Ind	\(\frac{(20-29.48)^2}{29.48}=3.05\)	\(\frac{(28-24.16)^2}{24.16}=0.61\)	\(\frac{(24-18.36)^2}{18.36}=1.73\)
Rep	\(\frac{(24-13.92)^2}{13.92}=7.30\)	\(\frac{(8-11.41)^2}{11.41}=1.02\)	\(\frac{(2-8.67)^2}{8.67}=5.13\)

Compute \(\frac{(observed\,count-expected\,count)^2}{expected\,count}\) for each cell
Add values to obtain \(\chi^2\) statistic, \[\begin{array}{rcl}\chi^2 &=& 0.02+0.01+0.10 \\ & + & 3.05 + 0.61 + 1.73 \\ & + & 7.30 + 1.02 + 5.13 \\ & =& 18.97\end{array}\]

Or just ask R…

library(infer)
X2_arm <- gss2016 |> 
  observe(natarms ~ party, stat = "Chisq") |> 
  pull()
X2_arm

X-squared 
 18.96998

Randomization Test for Independence

We can randomly permute the response (opinion) to simulate the null hypothesis being true
For each permuted sample, we calculate value of the \(\chi^2\) statistic
Let’s construct a null distribution for the military spending question

set.seed(8675309)

arm_perm <- gss2016 |>
  specify(natarms ~ party) |>
  hypothesise(null = "independence") |>
  generate(reps = 1000, type = "permute") |>
  calculate(stat = "Chisq")

Here is the original GSS data with 5 random permutations.

Here is the resulting histogram

Histogram of \(X^2\) statistics for 1,000 random permutations. Observed value (\(18.97\)) indicated by dashed vertical line.

Note that the shape of the histogram is neither symmetric nor bell-shaped. In fact, it only uses non-negative values

There were no values of \(\chi^2\) that were as extreme as the observed value
The p-value is approximately 0

arm_perm |>
  summarize(extreme_count = sum(stat >= X2_arm), pval = mean(stat >= X2_arm))

We can conclude that there is strong evidence of an association between opinions on military spending and political party (the two variables are not independent)

Test for Independence Using a Mathematical Model

Chi-squared test for assessing independence between categorical variables

When the null-hypothesis is true and the following conditions are met, \(X^2\) has a Chi-squared distribution with \(df=(r-1)\times(c-1)\) degrees of freedom:

Independent observations
Large samples: at least 5 expected counts in each cell

\(r\) is the number of rows and \(c\) is the number of columns in the two-way table (no totals)

Both two-way tables satisfy the large samples condition (at least 5 expected counts in each cell)
In both cases there are 3 rows and 3 columns in the table, so \(df=(r-1)\times(c-1)=(3-1)\times(3-1)=4\)

\(\chi^2(4)\) distribution
Overlay \(\chi^2(4)\) to histogram

Chi-squared disribution with \(df=4\). Purple line shows observed value for space question. Red line shows observed value for military question.

The p-value is the area under the curve that is beyond the observed \(X^2\) value
The pchisq function computes the area up to the specified cutoff, subtract value from 1 to find the p-value
Here the the p-value for the hypothesis test on military spending

Chi-squared disribution with \(df=4\). Red line shows test statistic for military question.

1 - pchisq(X2_arm, df = 4)

   X-squared 
0.0007966907

Test Results

As with the randomization-based test, the p-value is very small (<0.001) for the military spending question
Conclusion
- We reject null hypothesis in this case
- There is strong evidence that opinion on military spending and political affiliation are associated
- We can generalize these results to a larger population since it was a representative sample
- We cannot draw cause-and-effect conclusion since it was an observational study

Spending on Space Exploration

Hypotheses stated in terms of an association
- \(H_0\): There is no association between opinions on government spending on space exploration and political affiliations.
- \(H_A\): There is an association between opinions on government spending on space exploration and political affiliations.
Hypotheses stated in terms of differences
- \(H_0\): There is no difference in opinions on government spending on space exploration between people with different political affiliations.
- \(H_A\): There is some difference in opinions on government spending on space exploration between people with different political affiliations.

Test of Significance

Counts
Expected Counts
\(\chi^2\) Statistic
P-Value

Party	TOO LITTLE	ABOUT RIGHT	TOO MUCH	Total
Dem	8	22	13	43
Ind	13	37	22	72
Rep	9	17	8	34
Total	30	76	43	149

Party	TOO LITTLE	ABOUT RIGHT	TOO MUCH	Total
Dem	8 (8.66)	22 (21.93)	13 (12.41)	43
Ind	13 (14.50)	37 (36.72)	22 (20.78)	72
Rep	9 (6.85)	17 (17.34)	8 (9.81)	34
Total	30	76	43	149

This time we will go straight to R to calculate \(\chi^2\) statistic

X2_spac <- gss2016 |> 
  observe(natspac ~ party, stat = "Chisq") |> 
  pull()
X2_spac

X-squared 
  1.32606

Chi-squared disribution with \(df=4\). Red line shows test statistic for space exploration question.

1 - pchisq(X2_spac, df = 4)

X-squared 
0.8569388

Test Results

The p-value is quite large (\(p=0.857\)) for the space exploration question
Conclusion
- We failed to reject null hypothesis in this case
- There is no significant evidence that opinion on government spending on space exploration and political affiliation are associated. It is plausible that these two variables are independent
- We can generalize these results to a larger population since it was a representative sample
- We cannot draw cause-and-effect conclusion since it was an observational study

\(X^2\) distributions for different \(df\)

Chi-squared disributions with different degrees of freedom.

Chi-squared distribution is more peaked for lower \(df\)
Thicker tail for higher \(df\)