A/B Test Duration Calculator

MDE	Sample / Variant	Total Sample	Days
5%	207,997	415,994	84
10%	53,224	106,448	22
15%	24,198	48,396	10
20%	13,915	27,830	6
25%	9,100	18,200	4
30%	6,454	12,908	3

Last updated: March 2026

What Is the A/B Test Duration Calculator?

Find out exactly how long to run your A/B test before making a decision. Enter your conversion rate, traffic, and the minimum improvement you want to detect — get the required sample size and test duration with a visual timeline. The calculator uses the two-proportion z-test formula, the same statistical method used by professional experimentation platforms.

One of the most common mistakes in A/B testing is stopping the test too early. When you end a test before reaching statistical significance, you dramatically increase false positive rates. This calculator tells you exactly how many visitors you need and how many days it will take, so you can plan your testing roadmap with confidence.

How to Use This Calculator

1. Enter your current conversion rate and daily traffic. These are the baseline numbers from your analytics.

2. Set the minimum detectable effect — the smallest improvement worth detecting. A 10% MDE on a 3% conversion rate means you want to detect a change from 3% to 3.3%.

3. Choose your statistical significance and power. 95% significance and 80% power are industry standards. Higher values require larger samples.

4. Review the sensitivity table to see how different MDEs affect your test duration, and use the visual timeline to plan your testing schedule.

Understanding Statistical Significance and Power

Statistical significance (confidence level) is the probability that your result isn't due to random chance. At 95% significance, there's only a 5% chance of a false positive — declaring a winner when there's actually no real difference.

Statistical power is the probability of detecting a real difference when one exists. At 80% power, you have a 20% chance of missing a true effect (false negative). Higher power means you're less likely to miss real improvements, but you'll need a larger sample.

Frequently Asked Questions

What's a good minimum detectable effect (MDE)?

For most tests, 10-20% relative change is practical. Smaller MDEs require much larger samples. If you have low traffic, aim for 15-20% MDE.

Why shouldn't I stop the test early if one variant is winning?

Early results are unreliable. Statistical significance requires a minimum sample size. Stopping early dramatically increases false positive rates — you might declare a "winner" that's actually just random noise.

What does "statistical power" mean?

Power is the probability of detecting a real difference when one exists. 80% power means there's an 80% chance you'll detect a true improvement. Higher power requires larger samples.

How does traffic allocation affect test duration?

If you allocate only 50% of traffic to the test, it takes twice as long to reach the required sample size. Use 100% allocation when possible to get results faster.

Should I use 90%, 95%, or 99% significance?

95% is the industry standard for most A/B tests. Use 90% for exploratory tests where false positives are less costly. Use 99% for critical changes like pricing or checkout flow modifications.

Related Tools

UTM Link Builder

Build UTM-tagged campaign links

Chart Maker

Create charts and graphs

Percentage Calculator

Calculate percentages instantly

Statistics Calculator

Mean, median, standard deviation

Results

Test Timeline

Sensitivity Analysis

⚠️ Should I Stop Early?

How It Works

What Is the A/B Test Duration Calculator?

How to Use This Calculator

Understanding Statistical Significance and Power

Frequently Asked Questions

Related Tools