Free anomaly detector — paste numbers, get outliers.
Paste a column of numbers. The detector runs a rolling z-score across the series and flags every point that's statistically unusual — with severity (warning at |z| > 2, critical at |z| > 3) and how many standard deviations from the rolling mean it sits at. Your data never leaves your browser.
1Your data
2Detection settings
Ready when you are
Paste a column of numbers and click Detect anomalies. Or hit Load sample to see how it works on a series with 2 obvious outliers.
How does the anomaly detector work?
The detector applies a rolling z-score: for each point, it computes the mean and standard deviation of the previous N points (the rolling window) and asks how many standard deviations the current point sits from the rolling mean. Points beyond a configurable threshold are flagged.
- Z-score formula:
(value − rolling_mean) / rolling_stdev - Default thresholds: |z| ≥ 2 = warning (statistically unusual, ~5% of points in a normal distribution); |z| ≥ 3 = critical (~0.3% — almost certainly worth investigating).
- Default window: 12 (one year of monthly data, or one quarter of weekly). Shorter windows are more reactive; longer windows are more stable.
- Burn-in: the first N points have no rolling stats yet, so they're labelled "n/a" rather than scored.
When to use this
- Variance review — finance / FP&A: anomalies in line items often map directly to variances vs budget.
- Audit prep — flagged transactions become candidates for the audit work programme.
- Reconciliation — differences between systems often surface as outliers in one of them.
- Fraud screening — a starting filter for human review, not a verdict.
- Data quality — outliers often reveal export bugs or data-entry errors.
Limits of the rolling z-score
- Strong seasonality can produce false positives — a Q4 spike isn't an anomaly if Q4 always spikes. For seasonal data, deseasonalise first or use a more sophisticated model (Holt-Winters residual scoring on the platform).
- Structural breaks (a pivot, a merger, a regulation change) shift the underlying mean, generating short-term false positives until the rolling window adapts.
- Skewed distributions (where data isn't roughly symmetric around the mean) inflate the right tail's z-scores. Consider log-transform first, or use an interquartile-range-based detector.
- Multivariate anomalies (revenue and cost moving in opposite directions when they should track) need a correlation-residual approach. Available on the full DataHub Pro platform.
Want this on every numeric column of your spreadsheet?
This calculator runs one column at a time. DataHub Pro automatically scans every numeric column on upload, ranks anomalies by severity and value impact, ties them back to source rows, and includes them in your branded Word/PowerPoint reports. from $14.99/mo — 7-day free trial.
Run anomaly detection on every column with one click.
Upload any spreadsheet — DataHub Pro scans every numeric column and surfaces top anomalies in your KPI dashboard and exported reports.
