A/B Test Sample Size Calculator body { font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif; line-height: 1.6; margin: 0; padding: 20px; background-color: #f8f9fa; color: #333; } .ab-test-calc-container { max-width: 800px; margin: 20px auto; background-color: #fff; padding: 30px; border-radius: 8px; box-shadow: 0 2px 10px rgba(0, 0, 0, 0.1); } h1, h2 { color: #004a99; text-align: center; margin-bottom: 20px; } .input-group { margin-bottom: 20px; display: flex; flex-wrap: wrap; align-items: center; } .input-group label { flex: 1 1 150px; /* Allow labels to grow but not shrink below 150px */ margin-right: 15px; font-weight: bold; color: #004a99; } .input-group input[type="number"], .input-group select { flex: 1 1 200px; /* Allow inputs to grow but not shrink below 200px */ padding: 10px; border: 1px solid #ccc; border-radius: 4px; box-sizing: border-box; /* Include padding and border in the element's total width and height */ } .input-group span { margin-left: 10px; color: #666; font-size: 0.9em; } button { display: block; width: 100%; padding: 12px; background-color: #004a99; color: white; border: none; border-radius: 4px; font-size: 1.1em; cursor: pointer; transition: background-color 0.3s ease; margin-top: 20px; } button:hover { background-color: #003366; } #result { margin-top: 30px; padding: 20px; background-color: #e7f3ff; /* Light blue for emphasis */ border: 1px solid #004a99; border-radius: 4px; text-align: center; } #result h3 { color: #004a99; margin-top: 0; } #result-value { font-size: 2em; font-weight: bold; color: #28a745; /* Success green */ } .article-section { margin-top: 40px; padding: 25px; background-color: #f0f0f0; border-radius: 8px; border: 1px solid #ddd; } .article-section h2 { color: #004a99; text-align: left; margin-bottom: 15px; } .article-section p, .article-section ul { margin-bottom: 15px; } .article-section ul { list-style-type: disc; margin-left: 20px; } .article-section code { background-color: #e0e0e0; padding: 2px 5px; border-radius: 3px; font-family: Consolas, Monaco, 'Andale Mono', 'Ubuntu Mono', monospace; } @media (max-width: 600px) { .input-group { flex-direction: column; align-items: stretch; } .input-group label { margin-right: 0; margin-bottom: 8px; flex-basis: auto; } .input-group input[type="number"], .input-group select { flex-basis: auto; width: 100%; } .input-group span { margin-left: 0; margin-top: 5px; font-size: 0.85em; } }

A/B Test Sample Size Calculator

Baseline Conversion Rate (%) e.g., 10.5%

Minimum Detectable Effect (MDE) (%) e.g., 1% or 20% relative

Statistical Power (%) 80% 90% 95% Probability of detecting an effect if it exists

Significance Level (Alpha) (%) 5% 1% Probability of a Type I error (false positive)

Test Duration (Weeks) e.g., 2 weeks

Required Sample Size Per Variation

Understanding A/B Test Sample Size

A/B testing, also known as split testing, is a method of comparing two versions of a webpage or app against each other to determine which one performs better. The goal is to understand how a variation affects a user's behavior, typically measured by a key performance indicator like conversion rate. A crucial aspect of running a statistically sound A/B test is determining the appropriate sample size.

Running an A/B test without adequate sample size can lead to unreliable results. You might miss a real, albeit small, improvement (a Type II error) or incorrectly conclude that a change had an effect when it didn't (a Type I error). This calculator helps you determine the minimum number of users you need to expose to each variation of your test to achieve statistically significant results.

Key Concepts:

Baseline Conversion Rate: This is the current conversion rate of your control (original) version. It's a historical performance metric that forms the foundation for your sample size calculation. A higher baseline conversion rate generally requires a smaller sample size for the same detectable difference.
Minimum Detectable Effect (MDE): This is the smallest improvement in conversion rate that you want to be able to confidently detect. It's often expressed as a relative percentage (e.g., a 20% increase over the baseline) or an absolute percentage point difference (e.g., a 1% increase from 10% to 11%). A smaller MDE requires a larger sample size.
Statistical Power (1 – Beta): This represents the probability of correctly detecting a true effect if it exists. A common standard is 80% or 90% power. Higher power means a lower chance of a Type II error (failing to detect a real effect). Achieving higher power requires a larger sample size.
Significance Level (Alpha): This is the probability of a Type I error – concluding that there is a difference when there isn't one (a false positive). Common values are 5% (0.05) or 1% (0.01). A lower significance level (higher confidence) requires a larger sample size.
Test Duration: While not directly used in the core statistical formula for sample size per variation, it's a practical consideration. Knowing your required sample size per variation and your expected daily/weekly traffic allows you to estimate how long your test will need to run to gather sufficient data.

The Math Behind the Calculator

The calculation of sample size for A/B tests often relies on formulas derived from statistical principles for comparing two proportions. A common approach involves using the normal approximation to the binomial distribution.

Let:

$p_1$ be the baseline conversion rate (control group)
$p_2$ be the expected conversion rate for the variation (i.e., $p_1 \times (1 + \text{MDE relative})$ or $p_1 + \text{MDE absolute}$)
$P = (p_1 + p_2) / 2$ be the pooled proportion
$Z_{\alpha/2}$ be the Z-score corresponding to the significance level (e.g., for alpha=0.05, $Z_{0.025} \approx 1.96$)
$Z_{\beta}$ be the Z-score corresponding to the statistical power (e.g., for 80% power, beta=0.20, $Z_{0.20} \approx 0.84$; for 90% power, beta=0.10, $Z_{0.10} \approx 1.28$)

The formula for the sample size per variation (n) is approximately:

n = ( (Z_α/2 * sqrt(2 * P * (1-P))) + (Z_β * sqrt(p₁ * (1-p₁) + p₂ * (1-p₂))) )² / (p₁ - p₂)²

This calculator simplifies the input for MDE by allowing either an absolute or relative percentage. The calculation then determines $p_2$ based on the provided MDE and $p_1$. The final result is the number of users needed for *each* variation (control and treatment).

When to Use This Calculator:

This calculator is invaluable for anyone planning an A/B test for:

Website landing pages
Product page designs
Call-to-action buttons
Email subject lines
App features
Marketing campaigns
Any scenario where you want to test variations of a user interface or experience to optimize conversion rates or other key metrics.

By ensuring you have adequate sample size, you increase the confidence in your A/B test results and make more informed decisions.

Ab Test Sample Size Calculator