Confidence Level

The percentage probability that a confidence interval calculated from a given experiment will contain the true population parameter, commonly set at 90%, 95%, or 99% in A/B testing.

Also known as: significance level complement, coverage probability

Why It Matters

The confidence level is a dial you set before running an experiment that controls the tradeoff between certainty and speed. A higher confidence level (99%) means you need more data but are less likely to act on false positives. A lower confidence level (90%) lets you reach conclusions faster but accepts more risk.

Choosing the right confidence level depends on the cost of being wrong. For a pricing change that is difficult to reverse and affects all customers, use 99%. For a minor UI tweak on a secondary page, 90% might be sufficient. The 95% default is a reasonable middle ground, but it is a convention, not a law of nature.

The confidence level directly affects your required sample size and test duration. Moving from 90% to 95% confidence increases the required sample by about 30%. Moving from 95% to 99% roughly doubles it. For sites with limited traffic, this difference can mean weeks of additional test runtime.

Industry Applications

E-commerce

A luxury retailer uses 99% confidence for pricing experiments because incorrect prices damage brand perception and are noticed by customers immediately. For product page layout tests, they use 90% confidence to iterate faster.

SaaS

A fintech company requires 99% confidence for any experiment that affects the transaction flow (because errors mean lost money) but uses 95% confidence for onboarding experiments where the cost of a false positive is lower.

How to Track in KISSmetrics

Set your desired confidence level in KISSmetrics before launching an experiment. The platform will indicate when the result has reached your chosen confidence threshold. Use higher levels for irreversible changes and lower levels for easily reversible experiments.

Common Mistakes

-Using 95% confidence for every test without considering the stakes - some tests deserve 99%, others are fine at 90%
-Changing the confidence level after seeing results to make a borderline test appear significant
-Confusing confidence level with the probability that your variant is actually better
-Not accounting for the impact of confidence level on required test duration when planning experiments

Pro Tips

+Document your confidence level choice and rationale in your experiment plan before launching
+Use 90% confidence for exploratory tests and iteration, 95% for important features, and 99% for pricing or high-revenue-impact changes
+Remember that confidence level and power together determine your sample size requirements - plan both in advance
+If stakeholders push for faster results, explain the tradeoff: lower confidence levels mean higher risk of false positives, not smaller experiments

Related Terms

Experimentation

See Confidence Level in action

KISSmetrics tracks every user across sessions and devices so you can measure what matters. Start free - no credit card required.

Start Free Trial Book a Demo

Confidence Level

Why It Matters

Industry Applications

How to Track in KISSmetrics

Common Mistakes

Pro Tips

Related Terms

Confidence Interval

P-Value

Type I Error

Statistical Power

Hypothesis Testing

See Confidence Level in action