10.6 - Video Example: Exam Grade by Professor
This example uses the following dataset:
Download this Minitab dataset to follow along.
Three professors were each teaching one section of a course. They all gave the same final exam and they want to know if there are any differences between their sections’ scores.
\(H_0:\mu_1=\mu_2=\mu_3\)
\(H_a: Not\;all\;\mu\;are\;equal\)
Instructor | N | Mean | StDev | 95% CI |
---|---|---|---|---|
Dr. Al | 60 | 68.367 | 17.719 | (63.977, 72.756) |
Dr. Oh | 87 | 71.448 | 16.702 | (67.803, 75.094) |
Dr. Pa | 98 | 67.939 | 17.465 | (64.504, 71.373) |
Pooled StDev = 17.2609
The standard deviations for all three classes are all similar.
Using Minitab Express for Mac or PC: Statistics > ANOVA > One-Way ANOVA
The result is the following ANOVA source table:
Source | DF | Adj SS | Adj MS | F-Value | P-Value |
---|---|---|---|---|---|
Instructor | 2 | 635.3 | 317.671 | 1.07 | 0.3459 |
Error | 242 | 72101.1 | 297.938 | ||
Total | 244 | 72736.4 |
F (2, 242) = 1.07
From our ANOVA source table, p = .3459
Because \(p > \alpha\), we fail to reject the null hypothesis.
There is NOT evidence that the mean scores from the three different professors’ sections are different.
There is some debate as to whether pairwise comparisons are appropriate when the overall one-way ANOVA is not statistically significant. Some argue that if the overall ANOVA is not significant then pairwise comparisons are not necessary. Others argue that if the pairwise comparisons were planned before the ANOVA was conducted (i.e., "a priori") then they are appropriate.
The results of our Tukey pairwise comparisons were as follows:
Instructor | N | Mean | Grouping |
---|---|---|---|
Dr. Oh | 87 | 71.448 | A |
Dr. Al | 60 | 68.367 | A |
Dr. Pa | 98 | 67.939 | A |
Means that do not share a letter are significantly different.
Difference of Levels | Difference of Means | SE of Difference | 95% CI | T-Value | Adjusted P-Value |
---|---|---|---|---|---|
Dr. Oh-Dr. Al | 3.082 | 2.897 | (-3.698, 9.861) | 1.06 | 0.5366 |
Dr. Pa-Dr. Al | -0.428 | 2.829 | (-7.050, 6.195) | -0.15 | 0.9875 |
Dr. Pa-Dr. Oh | -3.510 | 2.543 | (-9.460, 2.441) | -1.38 | 0.3512 |
Individual confidence level = 97.99%
Looking at the first table, all three instructors are in group A. Means that share a less are not significantly different from one another (i.e., they are in the same group). Because all three instructors share the letter A, there are no significantly different pairs of instructors.
We could also look at the second table which gives us the t test statistic and adjusted p-value for each possible pairwise comparison. This p-value is adjusted to take into account that multiple tests are being conducted. You can compare these p-values to the standard alpha level of .05. All p-value are greater than .05, therefore no pairs are significantly different from one another.