gss 1 datasetwordsum is score on Wordsum testclass is self-identified social class| class | n | mean | sd |
|---|---|---|---|
| LOWER | 41 | 5.07 | 2.24 |
| MIDDLE | 331 | 6.76 | 1.89 |
| UPPER | 16 | 6.19 | 2.34 |
| WORKING | 407 | 5.75 | 1.87 |
\[\begin{array}{rcl}SSG &=& n_L(\bar{x}_L-\bar{x})^2 +n_M(\bar{x}_M-\bar{x})^2 + n_U(\bar{x}_U-\bar{x})^2+ n_W(\bar{x}_W-\bar{x})^2 \\ &=& 41\cdot(5.07-6.14 )^2+ 331\cdot(6.76-6.14)^2+ 16 \cdot (6.19-6.14)^2 + 407\cdot (5.75-6.14)^2\\ &=& 236.56\end{array}\]
\[\begin{array}{rcl}SSE &=& (n_L-1)s_L^2+(n_M-1)s_M^2+(n_U-1)s_U^2+(n_W-1)s_W^2\\ &=& (41-1)\cdot2.24^2+(331-1)\cdot1.89^2+(16-1)\cdot2.34^2+(407-1)\cdot1.87^2\\ &=& 2869.80\end{array}\]
ANOVA table
Key for understanding the ANOVA table
| term | df | sumsq | meansq | statistic |
|---|---|---|---|---|
| grouping variable | \(df_G=k-1\) | \(SSG\) | \(MSG=SSG/df_G\) | \(F=MSG/MSE\) |
| Residuals (error) | \(df_E=n-k\) | \(SSE\) | \(MSE=SSE/df_E\) |
Histogram of F scores (null distribution) for 1,000 random permutations of word scores. Dashed vertical line indicates observed F score.
F-distribution
When the null hypothesis is true and the following conditions are met, the \(F\) statistic has an \(F\)-distribution with \(df_1=k-1\) and \(df_2=n-k\) degrees of freedom.
The \(F\)-distribution with \(df_G=3\) and \(df_E=791\) degrees of freedom. The observed \(F\)-statistic is 21.73. The p-value is the total area to the right of 21.73 (too small to see).