Created:November 15, 2024
Last Updated:March 30, 2025
This Simple Linear Regression Calculator helps you analyze the relationship between two variables. It provides comprehensive analysis including model summary statistics, coefficient estimates, confidence intervals, and diagnostic tests. The calculator also generates a regression plot with the fitted line and confidence bands. To learn about the data format required and test this calculator, click here to populate the sample data.
Calculator
1. Load Your Data
2. Select Columns & Options
Learn More
Simple Linear Regression
Definition
Simple Linear Regression models the relationship between a predictor variable (X) and a response variable (Y) using a linear equation. It finds the line that minimizes the sum of squared residuals.
Key Formulas
Regression Line:
Slope:
Intercept:
R-squared:
Key Assumptions
Linearity: Relationship between X and Y is linear
Independence: Observations are independent
Homoscedasticity: Constant variance of residuals
Normality: Residuals are normally distributed
Practical Example
Step 1: Data
1 | 2.1 | -2 | -3.82 | 4 | 7.64 |
2 | 3.8 | -1 | -2.12 | 1 | 2.12 |
3 | 6.2 | 0 | 0.28 | 0 | 0 |
4 | 7.8 | 1 | 1.88 | 1 | 1.88 |
5 | 9.3 | 2 | 3.38 | 4 | 6.76 |
Means: ,
Step 2: Calculate Slope ()
Step 3: Calculate Intercept ()
Step 4: Regression Equation
Step 5: Calculate
(98.6% of variation in Y explained by X)
Code Examples
R
library(tidyverse)
data <- tibble(x = c(1, 2, 3, 4, 5),
y = c(2.1, 3.8, 6.2, 7.8, 9.3))
model <- lm(y ~ x, data=data)
summary(model)
ggplot(data, aes(x = x, y = y)) +
geom_point() +
geom_smooth(method = "lm", se = FALSE) +
theme_minimal()
par(mfrow = c(2, 2))
plot(model)
Python
import numpy as np
import pandas as pd
import statsmodels.api as sm
X = [1, 2, 3, 4, 5]
y = [2.1, 3.8, 6.2, 7.8, 9.3]
X = sm.add_constant(X)
model = sm.OLS(y, X).fit()
print(model.summary())