Objective
Describe the characteristics of a normal distribution and apply them to your marketing metrics: conversion rate (CR) and average order value (AOV), with validity checks and actionable implications.
Key characteristics of a normal distribution
- Shape and symmetry: Bell-shaped, perfectly symmetric around the mean; mean = median = mode.
- Defined by two parameters: Mean (μ) sets the center; standard deviation (σ) sets the spread. Variance is σ².
- Empirical rule (68–95–99.7):
- About 68% of values lie within ±1σ of μ.
- About 95% within ±2σ.
- About 99.7% within ±3σ.
- Z-scores: Standardizing (z = (x − μ) / σ) allows probability calculations and anomaly detection.
- Additivity/approximation: Sums/averages of many independent effects tend toward normality (central limit theorem), making normal a practical model for aggregated marketing metrics.
- Tails: The tails are thin; extreme values are rare but possible and quantifiable.
Application to your metrics
Normality checks
- CR normality test p = 0.09; AOV p = 0.15. Both > 0.05 → fail to reject normality; “approximately normal” is reasonable for these data.
- Note: The provided CR variance (0.0016) is inconsistent with σ = 0.4% if σ is in proportion units (0.004), which implies variance 0.000016. I’ll use σ for interval and probability estimates.
Conversion rate (μ = 3.2%, σ = 0.4%, n = 3000)
- One-sigma range (≈68% of observations): 2.8% to 3.6%.
- Two-sigma range (≈95%): 2.4% to 4.0%.
- Three-sigma range (≈99.7%): 2.0% to 4.4%.
- Example probabilities:
- P(CR > 3.9%): z = (3.9 − 3.2) / 0.4 = 1.75 → ≈ 4.0%.
- P(CR < 2.5%): same z magnitude → ≈ 4.0%.
- Precision of the mean (95% CI for μ): SE = 0.4% / √3000 ≈ 0.0073%; CI ≈ 3.2% ± 0.014% → [3.186%, 3.214%]. The mean estimate is very stable.
Average order value (μ = 86, σ = 12, n = 3000)
- One-sigma range (≈68%): 74 to 98.
- Two-sigma range (≈95%): 62 to 110.
- Three-sigma range (≈99.7%): 50 to 122.
- Precision of the mean (95% CI for μ): SE = 12 / √3000 ≈ 0.219; CI ≈ 86 ± 0.429 → [85.571, 86.429].
Anomaly assessment: Thursday AOV = 125
- z = (125 − 86) / 12 = 3.25.
- One-sided tail probability ≈ 0.0006; two-sided ≈ 0.0012. This exceeds the 3σ upper bound (~122), making it a statistically rare event under normality.
- Business implications: Treat as an outlier likely driven by a discrete factor (promotion, campaign targeting, mix shift, data error). Investigate and document; consider robust handling (e.g., winsorize or analyze with/without the point).
Practical notes for marketing analysis
- CR is bounded between 0 and 1 and often arises from binomial processes; at μ = 3.2% with modest σ and large n, the normal approximation is acceptable, but beta/binomial modeling can be considered for finer inference.
- Normal modeling supports:
- Setting alert thresholds (e.g., 2σ bands for monitoring).
- Quantifying the rarity of spikes/drops.
- Building confidence intervals for KPIs to separate signal from noise.
Conclusions
- Both CR and AOV are approximately normal in your sample, enabling reliable use of z-scores, sigma bands, and confidence intervals.
- Expected operational ranges:
- CR: 2.4%–4.0% (95% band); mean tightly estimated at ~3.2%.
- AOV: 62–110 (95% band).
- The Thursday AOV = 125 is a statistically rare outlier (>3σ); investigate cause and treat carefully in reporting and optimization.