Feature · Conversational Analytics

Ask Your Data. Get answers backed by real maths.

Q: How is this different from ChatGPT's Advanced Data Analysis?

ChatGPT's ADA mode runs Python in a sandboxed container and can do real maths. The differences: (1) ADA generates code in the open ended chat — there's no curated tool catalogue, so a wrong column name silently produces a wrong answer; (2) the audit trail is the chat history, not a structured trace; (3) session timeouts and lost context are common; (4) there's no native export to Word/PowerPoint, white-labelling, or dashboard pinning. Ask Your Data is purpose-built for analytics workflows that end in a deliverable.

Q: Does the AI ever hallucinate numbers?

Numerical answers come from deterministic tools running real pandas on your file. The model's role is to pick the tool, not to invent the number. We've not seen a confirmed hallucinated number in production, but the audit trail makes any error visible — wrong column picked, wrong filter applied — so you'd see it before acting on it.

Q: Can it answer questions across multiple sheets or files?

Within a single workbook (multiple sheets), yes — pick a primary sheet on upload and the AI can join across sheets when asked. Across multiple files, the workflow is to combine them first using Pipelines (the platform's join/transform builder) and then chat over the combined output. We're working on multi-file native support.

Q: Does it remember previous sessions?

Within a single conversation, yes — context, prior answers, and tool outputs are reused for follow-ups. Across sessions, no — each new conversation starts clean for data-isolation reasons. Saved insights and pinned charts persist across sessions in the dashboard.

Q: How long do answers take?

Most answers return in 5–8 seconds on files under 1M rows. Multi-step questions (e.g. "top 5 customers by revenue minus refunds, plotted monthly") can take 15–25 seconds because each tool call is a round-trip. Latency on very large files (10M+) isn't supported on the spreadsheet pathway — those workflows belong on a warehouse.

Q: Can I see what data the AI looked at?

Yes. Each tool call shows the rows it touched and the columns it used. For aggregations, you can expand the trace to see the underlying detail. For filters, the output row count is visible — so "the AI summed 50 rows" is never confused with "the AI summed 5,000 rows".

Q: Is there a free tier?

Yes. The free tier includes Ask Your Data with a daily question limit. Pro removes the limit and adds export, dashboard pinning, scheduled questions and audit logs. See pricing for details.

Q: Can I use Ask Your Data on Shopify, Stripe, or HubSpot exports?

Yes — any of those tools' CSV exports work directly. We also have native Shopify and Google Sheets connectors that pull data on a schedule, so the file in Ask Your Data is always fresh.

Q: Does the chat support charts?

Yes. Ask "plot revenue by month" and the AI picks an appropriate chart type, renders it inline, and offers options to pin it to a dashboard or include it in Auto Report. Chart type can be overridden if you want a specific format.

Q: How is data handled in regulated industries?

UK/EU data residency by default. SOC 2 Type II in progress. Signed DPA available on request. Audit logs of every AI conversation, tool call and export. We're a fit for finance, professional services, healthcare back-office and regulated SMEs; we're not yet certified for HIPAA-PHI workloads.

100%deterministic — pandas operations, not free-form generation

Every answerships with its operation trace

Multi-stepfollow-up questions remember context

Charts + textappropriate chart type generated automatically

Dr Waqas Rafique · Founder & CTO
LinkedIn · About

Ask Your Data in DataHub Pro is a tool-use AI: it picks the right pandas operation, runs it on your file, and returns the answer with the call trace attached. No more "the AI said the total was £4.2m, but it didn't actually compute the total". Every answer ships with the rows used, the filters applied, and the function called — so reviewing the AI's work is faster than redoing it.

Updated 7 May 2026

Start free trial → Watch 2-min demo

✓ No credit card ✓ 14-day full-access ✓ Cancel any time

20+

Tools the AI can call

load_file_data, filter_by_value, aggregate_by_field, correlate, etc.

100%

Of answers come with a call trace

Audit any number in seconds, not minutes

5–8s

Median answer latency

On files under 1M rows

Why most "chat with your data" products miss

The promise of a conversational analyst is genuinely compelling: type a question, get an answer. The reality with most current implementations falls into one of three traps.

Trap 1 · Free-form text generation

The model reads your data as text and predicts plausible answers. Sometimes those answers are right. Sometimes they're confidently wrong. You can't tell which without recomputing — defeating the point of the AI.

Trap 2 · Pre-built dashboards in disguise

Some tools route every "question" to a pre-built tile. Ask anything outside the tile and you get "I can't answer that". It's a glorified search bar, not a chat with your data.

Trap 3 · No audit trail

Even when the AI does compute, you don't see the working. For finance, audit, regulated industries — "trust me" isn't a deliverable.

Ask Your Data is built on the opposite premise: the AI exists only to orchestrate tools. Every number it returns was produced by code on your data, and the code is shown.

How tool-use AI works under the hood

A loop, not a single LLM call. The model proposes a tool to call; the tool runs against your file; the result feeds back; the loop continues until the answer is grounded.

Tool catalogue, not free generation

The AI can only act through 20+ deterministic tools — load file, head, describe, filter by value/date/regex, aggregate by group, sort, correlate, top-N, pivot, forecast, segment. Each tool returns numbers, not prose.

Visible reasoning trace

Each AI turn shows: thought (one sentence on what to do next), tool name, arguments, result. Click any answer to expand the trace. The same question gives the same trace and the same answer every time.

Multi-step workflows

Questions like "top 5 customers by revenue in Q3 minus refunds" require multiple tool calls — load → filter Q3 → aggregate by customer → subtract refunds → top 5. The loop handles the orchestration.

Conversation memory

Follow-ups work naturally: "why did customer X drop in October?", "break that down by product", "forecast for the next quarter". The model carries context across turns and re-uses prior tool outputs where it can.

Charts on demand

Ask for a chart in natural language and the AI picks an appropriate type (bar, line, scatter, histogram) and renders it inline. Charts can be pinned to a custom dashboard or sent into Auto Report.

Suggested follow-ups

After every answer, the platform proposes 2-3 high-leverage follow-up questions based on what's interesting in your data. They're not generic prompts — they're contextual to the file you uploaded.

What this looks like in practice

Three example sessions, each starting from a single CSV upload.

Session 1 · Sales analysis

You: "What's our biggest customer this year?"
Trace: filter_by_date(2026-01-01, today) → aggregate_by_field(customer, sum, revenue) → sort(desc) → top_n(1)
Answer: "Acme Ltd at £487,200 across 18 invoices." — with the underlying rows shown.
You: "Plot their monthly revenue."
→ chart rendered inline, pinnable.

Session 2 · Variance investigation

You: "Why was October so far over budget?"
Trace: filter_by_date(2025-10) → group_by(line_item) → variance(actual, budget) → sort(desc, abs)
Answer: "Two line items drove 86% of the variance: Marketing £42,100 over (campaign launch), and Cost of Sales £28,400 under (delayed supplier invoices)."

Session 3 · Customer cohort

You: "Which clients haven't ordered for 90 days?"
Trace: aggregate_by_field(customer, max, order_date) → filter_by_date(< today - 90)
Answer: 47 clients, total prior LTV £312,000. CSV download offered.

Each of these would have taken 5–20 minutes in Excel. Each took under 30 seconds in Ask Your Data, with a complete audit trail of what was computed.

Who Ask Your Data is built for

Analysts who get interrupted with ad-hoc questions

The classic "just one quick number" from a stakeholder that takes 20 minutes to compute. Hand them a Ask Your Data session on the underlying file and they answer their own questions.

Finance, audit and regulated teams

The audit trail is the unlock. Any answer returned is reproducible from the trace, so internal review and external audit both move faster.

Founders and operators without an analyst

If you don't have a data team, Ask Your Data is the closest thing to having one on demand. Drop in your Stripe export or your CRM CSV and start asking questions.

When Ask Your Data is the wrong tool

If you need true real-time analytics over a transactional database (sub-second latency, freshness measured in seconds), this isn't that. Ask Your Data works on the file you upload at upload time. For live data, use a warehouse-connected BI tool — and bring the snapshot here when you need to write a report.

FAQs

How is this different from ChatGPT's Advanced Data Analysis?

ChatGPT's ADA mode runs Python in a sandboxed container and can do real maths. The differences: (1) ADA generates code in the open ended chat — there's no curated tool catalogue, so a wrong column name silently produces a wrong answer; (2) the audit trail is the chat history, not a structured trace; (3) session timeouts and lost context are common; (4) there's no native export to Word/PowerPoint, white-labelling, or dashboard pinning. Ask Your Data is purpose-built for analytics workflows that end in a deliverable.

Does the AI ever hallucinate numbers?

Numerical answers come from deterministic tools running real pandas on your file. The model's role is to pick the tool, not to invent the number. We've not seen a confirmed hallucinated number in production, but the audit trail makes any error visible — wrong column picked, wrong filter applied — so you'd see it before acting on it.

Can it answer questions across multiple sheets or files?

Within a single workbook (multiple sheets), yes — pick a primary sheet on upload and the AI can join across sheets when asked. Across multiple files, the workflow is to combine them first using Pipelines (the platform's join/transform builder) and then chat over the combined output. We're working on multi-file native support.

Does it remember previous sessions?

Within a single conversation, yes — context, prior answers, and tool outputs are reused for follow-ups. Across sessions, no — each new conversation starts clean for data-isolation reasons. Saved insights and pinned charts persist across sessions in the dashboard.

How long do answers take?

Most answers return in 5–8 seconds on files under 1M rows. Multi-step questions (e.g. "top 5 customers by revenue minus refunds, plotted monthly") can take 15–25 seconds because each tool call is a round-trip. Latency on very large files (10M+) isn't supported on the spreadsheet pathway — those workflows belong on a warehouse.

Can I see what data the AI looked at?

Yes. Each tool call shows the rows it touched and the columns it used. For aggregations, you can expand the trace to see the underlying detail. For filters, the output row count is visible — so "the AI summed 50 rows" is never confused with "the AI summed 5,000 rows".

Is there a free tier?

Yes. The free tier includes Ask Your Data with a daily question limit. Pro removes the limit and adds export, dashboard pinning, scheduled questions and audit logs. See pricing for details.

Can I use Ask Your Data on Shopify, Stripe, or HubSpot exports?

Yes — any of those tools' CSV exports work directly. We also have native Shopify and Google Sheets connectors that pull data on a schedule, so the file in Ask Your Data is always fresh.

Does the chat support charts?

Yes. Ask "plot revenue by month" and the AI picks an appropriate chart type, renders it inline, and offers options to pin it to a dashboard or include it in Auto Report. Chart type can be overridden if you want a specific format.

How is data handled in regulated industries?

UK/EU data residency by default. SOC 2 Type II in progress. Signed DPA available on request. Audit logs of every AI conversation, tool call and export. We're a fit for finance, professional services, healthcare back-office and regulated SMEs; we're not yet certified for HIPAA-PHI workloads.

Have a real conversation with your spreadsheet.

Drop in a CSV. Ask the questions you'd usually ask a junior analyst. Get answers with audit trails attached, in seconds.

Start free → Watch 2-min demo

✓ No credit card ✓ 14-day full-access ✓ Cancel any time

Compare with the rest of the market — we’ve published an independent 2026 review of all 12 AI tools for Excel data analysis (Ajelix, Formula Bot, GPT Excel, Microsoft Copilot, Numerous AI, ChatGPT, Polymer, Tableau, Power BI, Domo, KNIME).

Frequently asked questions

How does Ask Your Data avoid AI hallucinations?

By using LLM tool-use rather than free-form generation. The AI writes pandas operations, executes them on the actual dataset, and returns both the answer and the operations it ran. The maths is therefore deterministic and auditable, even though the question-understanding is AI-driven.

What kinds of questions can I ask my data?

Aggregations ('total revenue last quarter'), comparisons ('top 10 customers by lifetime value'), trends ('how has churn changed month-over-month'), filters ('orders over £500 from London customers'), and combinations of these. Multi-step questions work too: 'what's the LTV of customers who churned in 2025?'

Can it produce charts in addition to numbers?

Yes — the AI generates appropriate chart types alongside numerical answers. Bar, line, scatter, heatmap, and a few specialised analytics charts (cohort, RFM scatter).

Does it work on uploaded Excel/CSV or only on connected data sources?

Both. Drop in Excel/CSV files, or connect Shopify, and ask questions across them. Direct database connectors are on the roadmap.

How is this different from Excel Copilot?

Microsoft Copilot is integrated into the Excel grid itself and limited to operations Excel natively supports. Ask Your Data runs over the data more deeply (cohort, RFM, time-series) and ships exportable reports rather than living inside one Excel file.