Practical ANCOVA Example with R

Created:November 20, 2024

In this tutorial, we'll conduct a comprehensive Analysis of Covariance (ANCOVA) using R to evaluate the effectiveness of two teaching methods while controlling for students' prior academic performance. We'll cover the step-by-step process from data preparation to interpretation and reporting.

First and foremost, why do we use ANCOVA here? In educational settings, students' prior academic achievement often strongly predicts their future performance. By including prior GPA as a covariate, we can:

Reduce error variance and increase statistical power
Control for selection bias when random assignment isn't possible
Provide more accurate estimates of the teaching method's true effect

With this being said, our dependent variable is the final exam score, the independent variable is the teaching method, and the covariate is the prior GPA. Let's dive into the analysis.

1. Raw Data

Here we have a sample dataset of 20 students who were randomly assigned to either a traditional or interactive teaching method. The data includes students' prior GPAs and final exam scores.

Sample Dataset (First 20 students):

Student ID	Teaching Method	Prior GPA	Final Exam
1	Traditional	3.1	76
2	Traditional	3.4	82
3	Traditional	2.8	71
4	Traditional	3.7	85
5	Traditional	3.2	78
...
16	Interactive	3.0	79
17	Interactive	3.6	89
18	Interactive	3.1	82
19	Interactive	3.4	85
20	Interactive	3.7	90

Download the complete dataset: teaching_methods.csv

2. Initial Analysis in R

In this step, we load the data into R, summarize the key statistics, and create a scatterplot to visualize the relationship between prior GPA and final exam scores by teaching method:

# Load required packages
library(tidyverse)
library(car) # for assumption testing
library(effectsize) # for effect size
library(emmeans) # for adjusted means

# Read and examine the data
data <- read_csv("teaching_methods.csv")

# Get summary statistics by group
data |>
  group_by(teaching_method) |>
  summarise(
    n = n(),
    mean_gpa = mean(prior_gpa),
    sd_gpa = sd(prior_gpa),
    mean_exam = mean(final_exam),
    sd_exam = sd(final_exam)
  )

# A tibble: 2 × 6
  teaching_method     n mean_gpa sd_gpa mean_exam sd_exam
  <chr>           <int>    <dbl>  <dbl>     <dbl>   <dbl>
1 Interactive        10     3.35  0.303      84.6    4.72
2 Traditional        10     3.25  0.303      78.6    4.84

Visualizing the data can help identify patterns and relationships between variables. Let's create a scatterplot with a regression line:

ggplot(data, aes(x = prior_gpa, y = final_exam, color = teaching_method)) +
  geom_point(size = 3, alpha = 0.6) +
  geom_smooth(method = "lm", se = TRUE) +
  theme_minimal() +
  labs(
    title = "Final Exam Scores by Prior GPA and Teaching Method",
    x = "Prior GPA",
    y = "Final Exam Score",
    color = "Teaching Method"
  )

Scatterplot with regression line for final exam scores by prior GPA and teaching method.

The scatterplot shows a positive linear relationship between prior GPA and final exam scores, with a clear distinction between teaching methods. Interactive method students tend to have higher exam scores. The regression lines are roughly parallel, indicating homogeneity of regression slopes. Next, we'll proceed with the ANCOVA analysis and check assumptions.

3. Assumption Testing

a. Linearity Test and Homegeneity of Regression Slopes:

# Test linearity and homogeneity of regression slopes
model_interaction <- aov(final_exam ~ teaching_method * prior_gpa, data = data)
summary(model_interaction)

                          Df Sum Sq Mean Sq F value   Pr(>F)    
teaching_method            1  180.0   180.0 395.341 1.05e-12 ***
prior_gpa                  1  403.4   403.4 886.043 1.94e-15 ***
teaching_method:prior_gpa  1    0.1     0.1   0.213    0.651    
Residuals                 16    7.3     0.5                     
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

The non-significant interaction term (p = 0.651) indicates that the relationship between prior GPA and final exam scores is consistent across teaching methods, satisfying the linearity and homogeneity of regression slopes assumptions.

b. Testing Normality of Residuals:

# Test normality of residuals
model_initial <- aov(final_exam ~ teaching_method + prior_gpa, data = data)
shapiro.test(residuals(model_initial))

Output:

	Shapiro-Wilk normality test

data:  residuals(model_initial)
W = 0.98373, p-value = 0.9728

The Shapiro-Wilk test on residuals shows a non-significant p-value (p = 0.9728), indicating that the residuals are normally distributed. This satisfies the normality assumption for ANCOVA.

c. Test for Homogeneity of Variances:

# Check homogeneity of variances
leveneTest(final_exam ~ teaching_method, data = data)

Levene's Test for Homogeneity of Variance (center = median)
      Df F value Pr(>F)
group  1  0.0338 0.8562
      18

The Levene's test for homogeneity of variances is non-significant (p = 0.8562), indicating that the variances of residuals are equal across teaching methods. This satisfies the homogeneity of variances assumption for ANCOVA.

4. Fitting ANCOVA Model

With our assumptions met, we can proceed with the ANCOVA analysis to compare teaching methods while controlling for prior GPA:

# Fit ANCOVA model
ancova_model <- aov(final_exam ~ teaching_method + prior_gpa, data = data)
summary(ancova_model)

                Df Sum Sq Mean Sq F value   Pr(>F)    
teaching_method  1  180.0   180.0   414.5 2.24e-13 ***
prior_gpa        1  403.4   403.4   929.1 2.80e-16 ***
Residuals       17    7.4     0.4                     
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

The ANCOVA model shows a significant effect of teaching method (p < .001) and prior GPA (p < .001) on final exam scores. The adjusted means will provide a clearer picture of the group differences while controlling for GPA.

5. Effect Sizes

Let's calculate effect sizes to quantify the practical significance of our results:

# Get effect sizes
eta_squared(ancova_model)

# Effect Size for ANOVA (Type I)

Parameter       | Eta2 (partial) |       95% CI
-----------------------------------------------
teaching_method |           0.96 | [0.92, 1.00]
prior_gpa       |           0.98 | [0.96, 1.00]

- One-sided CIs: upper bound fixed at [1.00].

The effect sizes show a large impact of teaching method (η² = 0.96) and prior GPA (η² = 0.98) on final exam scores. This indicates a substantial practical significance of both variables.

6. Adjusted Means

Finally, let's calculate the adjusted means for each teaching method based on prior GPA:

# Calculate adjusted means
adjusted_means <- emmeans(ancova_model, "teaching_method")
pairs(adjusted_means)
confint(adjusted_means)

  contrast                  estimate    SE df t.ratio p.value
 Interactive - Traditional     4.44 0.299 17  14.831  <.0001

# Confidence Intervals
 teaching_method emmean   SE df lower.CL upper.CL
 Interactive      83.82 0.21 17    83.38    84.26
 Traditional      79.38 0.21 17    78.94    79.82

Confidence level used: 0.95

The adjusted means show a significant difference between teaching methods, with the interactive method scoring 4.44 points higher than the traditional method. The confidence intervals confirm the reliability of this difference.

7. Interpretation and Reporting

After controlling for prior GPA, the analysis revealed:

Significant effect of teaching method (p < .0001)
Large effect size (partial η² = 0.96)
Interactive method showed higher adjusted scores (diff = 4.44 points)
Prior GPA was a significant covariate (p < .001, partial η² = 0.98)

Example Report:

An analysis of covariance (ANCOVA) was conducted to examine the effect of teaching method (interactive vs. traditional) on final exam scores while controlling for students' prior GPA. Preliminary analyses confirmed that the assumptions of normality (Shapiro-Wilk test: W = 0.984, p = .973), homogeneity of variances (Levene's test: F = 0.012, p = .914), and homogeneity of regression slopes (F(1,16) = 0.213, p = .651) were satisfied.

After adjusting for prior GPA, there was a statistically significant difference between teaching methods, F(1,17) = 395.34, p < .001, partial η² = 0.96. Students in the interactive teaching group (adjusted M = 83.82, SE = 0.21) scored significantly higher than those in the traditional teaching group (adjusted M = 79.38, SE = 0.21), with a mean difference of 4.44 points (95% CI [3.81, 5.07]). The covariate, prior GPA, was significantly related to final exam scores, F(1,17) = 886.04, p < .001, partial η² = 0.98.

These results suggest that the interactive teaching method leads to significantly better exam performance compared to the traditional method, even after accounting for students' prior academic achievement. The large effect size indicates that this difference is not only statistically significant but also practically meaningful.

Practical ANCOVA Example with R

1. Raw Data

2. Initial Analysis in R

3. Assumption Testing

a. Linearity Test and Homegeneity of Regression Slopes:

b. Testing Normality of Residuals:

c. Test for Homogeneity of Variances:

4. Fitting ANCOVA Model

5. Effect Sizes

6. Adjusted Means

7. Interpretation and Reporting

Example Report:

8. Practical Implications

Related Calculators

ANCOVA Calculator

One-Way ANOVA Calculator

Simple Linear Regression Calculator

Scatter Plot Maker