This One-Sample Z-Test Calculator helps you compare a sample mean to a known population mean when the population standard deviation is known. For example, you could test if the average weight of products from a manufacturing line differs significantly from the target weight, given known production variability. The calculator performs comprehensive statistical analysis including descriptive statistics and hypothesis testing. It also generates publication-ready APA format reports. To learn about the data format required and test this calculator, click here to populate the sample data.
Calculator
1. Load Your Data
2. Select Columns & Options
Learn More
One-Sample Z-Test
Definition
One-Sample Z-Test is a statistical test used to determine whether a sample mean significantly differs from a known population mean when the population standard deviation is known. It's particularly useful for large samples and when working with known population parameters.
Formula
Test Statistic:
Where:
- = sample mean
- = hypothesized population mean
- = known population standard deviation
- = sample size
Confidence Interval:
Key Assumptions
Practical Example
A manufacturing process is known to produce bolts with a population mean of 10mm and standard deviation of 0.2mm. To test if the process has shifted:
- Sample of 100 bolts measured
- Sample mean = 10.05mm
- Known = 0.2mm
- mm
- mm
Calculating z-statistic:
Constructing confidence interval
We are 95% confident that the true mean lies between and .
Effect Size
Cohen's d for one-sample z-test:
Interpretation guidelines:
- Small effect:
- Medium effect:
- Large effect:
Power Analysis
Required sample size for desired power (1-β):
Where:
- = anticipated population mean
- = hypothesized mean
- = population standard deviation
- = significance level
- = probability of Type II error
Decision Rules
Reject if:
- Two-sided test:
- Left-tailed test:
- Right-tailed test:
- Or if
Reporting Results
Standard format:
Code Examples
library(tidyverse)
set.seed(42)
sample_data <- tibble(
value = rnorm(50, mean = 150, sd = 10) # 50 observations, mean=150, sd=10
)
# Known parameters
pop_mean <- 145
pop_sd <- 10 # Known population SD (this is key for z-test)
# Basic summary statistics
summary_stats <- sample_data %>%
summarise(
n = n(),
mean = mean(value),
sd = sd(value),
se = pop_sd/sqrt(n) # Note: using pop_sd for z-test
)
# Calculate z-statistic
z_stat <- with(summary_stats, (mean - pop_mean)/(pop_sd/sqrt(n)))
p_value <- 2 * pnorm(-abs(z_stat)) # two-sided
# Effect size (Cohen's d)
cohens_d <- with(summary_stats, (mean - pop_mean)/pop_sd)
# Calculate confidence interval (95%)
alpha <- 0.05
ci <- summary_stats %>%
mutate(
lower = mean - qnorm(1-alpha/2) * se,
upper = mean + qnorm(1-alpha/2) * se
) %>%
select(lower, upper)
# Print results
print(str_glue("Z-statistic: {round(z_stat, 3)}"))
print(str_glue("P-value: {round(p_value, 4)}"))
print(str_glue("Effect size (Cohen's d): {round(cohens_d, 3)}"))
print(str_glue("95% CI: [{round(ci$lower, 2)}, {round(ci$upper, 2)}]"))
# Visualize the data
ggplot(sample_data, aes(x = value)) +
geom_histogram(aes(y = after_stat(density)), bins = 10, fill = "lightblue", color = "black") +
geom_density(color = "darkblue") +
geom_vline(xintercept = pop_mean, color = "red", linetype = "dashed") +
geom_vline(xintercept = summary_stats$mean, color = "blue") +
theme_minimal() +
labs(
title = "Sample Distribution with Population Mean",
subtitle = "Blue: Sample Mean, Red dashed: Population Mean",
x = "Value",
y = "Density"
)
import numpy as np
from scipy import stats
import matplotlib.pyplot as plt
import seaborn as sns
# Generate sample data
np.random.seed(42)
n = 50
pop_mean = 145
pop_sd = 10
sample_data = np.random.normal(150, pop_sd, n)
# Calculate z-statistic
z_stat = (np.mean(sample_data) - pop_mean)/(pop_sd/np.sqrt(n))
# Calculate p-value (two-sided)
p_value = 2 * (1 - stats.norm.cdf(abs(z_stat)))
# Calculate confidence interval
alpha = 0.05
ci = stats.norm.interval(1-alpha,
loc=np.mean(sample_data),
scale=pop_sd/np.sqrt(n))
# Calculate effect size
cohens_d = abs(np.mean(sample_data) - pop_mean)/pop_sd
# Visualization
plt.figure(figsize=(10, 6))
sns.histplot(sample_data, stat='density')
plt.axvline(pop_mean, color='red', linestyle='--', label='Population Mean')
plt.axvline(np.mean(sample_data), color='blue', label='Sample Mean')
plt.legend()
plt.show()
# Print results
print(f"Z-statistic: {z_stat:.4f}")
print(f"P-value: {p_value:.4f}")
print(f"95% CI: [{ci[0]:.2f}, {ci[1]:.2f}]")
print(f"Cohen's d: {cohens_d:.4f}")