AI theatre or real progress: how a founder tells the difference

TL;DR

AI theatre is what happens when an AI initiative is judged on how it presents rather than what it produces. Tool counts, polished demos, and usage reports substitute for named business outcomes, and a confident update creates the impression of progress where none exists. For a founder who has delegated an AI mandate, three questions, applied consistently, separate genuine progress from performance without requiring any technical knowledge.

Key takeaways

- AI theatre is what happens when activity metrics, tool counts, licences deployed, and demo polish substitute for named business outcomes. - The Addepar test asks whether an initiative would still matter if it did not use AI; if the answer is no, the AI has become the point rather than a means to an end. - High AI usage alongside flat business results is the classic theatre signature; genuine progress shows leading and lagging indicators moving together over time. - Three questions puncture a theatrical update without confrontation, asking what success looks like in six months, what would be lost if the initiative paused, and what single metric is being used to evaluate it. - A founder does not need technical knowledge to apply this test; the skill is asking outcome-focused questions and noticing when the answers go vague.

You’ve just sat through a demo. The slides were clean, the examples crisp, and the person presenting clearly knew their material. Fifteen minutes later, back at your desk, something felt off. There was no mention of a specific problem it solved, no before-and-after, no number. Just capability, neatly presented.

That feeling is worth attending to. What you watched might have been genuine progress. For a founder who has delegated an AI mandate, the difference is not always obvious, and the stakes for getting the read wrong are higher than they appear.

What is AI theatre?

AI theatre is what you get when an initiative is judged on how it presents rather than what it produces. Tool counts, licence deployments, polished demos, usage reports, all held up as evidence that AI is working in the business, with no named business outcome attached. The term has been in active use in implementation planning circles since around 2023, and the failure mode it describes is well documented.

The signs are consistent across business types. The update sounds confident. The tools are real. The team has been busy. But when you ask what problem the work has solved, the answer gets vague quickly. That vagueness is the tell.

Part of what keeps the pattern alive is that AI vendors and internal teams both have reasons to show activity. Licence purchases are easy to track. Demos can be scheduled in an afternoon. Measurable business outcomes take time to emerge, sometimes 12 to 24 months by the estimates of implementation practitioners, and in the interim something has to go into the report. Activity fills the gap that outcomes haven’t closed yet.

Why does this matter for a founder?

A founder who has delegated an AI mandate does not have the time or the technical depth to audit every initiative. That is exactly why a confident update with nothing underneath it is the most dangerous kind. You accept it, the team relaxes, and the moment passes. The gap between what AI is supposedly doing and what your results show keeps growing.

BCG’s 2025 research found that AI usage across organisations was rising while measurable business impact remained flat. Analysis of AI implementation puts the pilot failure rate at around 95% of initiatives that never reach P&L impact. Adoption increases; outcomes do not follow at the same pace.

For a founder, the exposure goes beyond wasted budget. Research into AI risk disclosures found that reputational risk was the top AI concern for 38% of S&P 500 companies surveyed. Boards expect AI to produce something tangible. A founder who cannot distinguish a genuine AI update from a performative one will struggle to manage that expectation when the questions arrive.

Where will you actually meet it?

AI theatre shows up in three predictable places in a founder-led business. The internal demo presents a tool working well in a controlled setting, often on a task that was already handled adequately. The progress update arrives heavy on adoption figures and usage statistics, with no business outcome attached. The licence announcement presents a purchasing decision as evidence of AI commitment rather than as a means to a specific end.

The tell in each case is the same. Activity language has replaced outcome language. “We have onboarded thirty users” is an activity. “We have reduced invoice processing time by 40%, which has freed the finance team for two additional days per month” is an outcome. Both can be true at the same time. But the first cannot be offered as a substitute for the second.

The drift happens gradually, particularly in teams under board pressure to show something. A demo can be arranged in a week. A licence can be procured in a day. Confirming that an AI initiative has produced real business value typically takes months, and teams learn, sometimes without noticing, to report what is available rather than what is meaningful.

When to push back and when to let it pass?

A founder who challenges every AI update will exhaust the team and slow the work. The test is whether an update can name the business problem it addresses and explain how you would recognise when the work has solved it. If the answer is clear and specific, let it pass. If the update sounds confident but the answer goes vague, that is the moment to push.

A useful test comes from Addepar, an investment management platform whose executive team has written about how they evaluate AI initiatives before committing to them. The test asks whether an initiative would still matter if it did not use AI at all. If the answer is no, AI has become the point of the initiative rather than a means to a business end. If yes, the AI is a component, and someone on the team should be able to name the outcome it is working toward.

Three questions carry this into a real conversation without making it confrontational. The first is to ask what success would look like in concrete terms six months from now. Putting an outcome on the table does not imply the current work is wrong. The second is to ask what would actually be lost if the initiative paused for a month. The answer reveals whether the work addresses a live problem or fills a gap in the reporting. The third is to ask for the single metric being used to evaluate progress. One number, not a dashboard. The question forces specificity, and if there is no clean answer, the work has not been designed around an outcome.

What does real progress actually look like?

Genuine AI progress has two recognisable signatures. The team is tracking leading indicators, adoption rates, prompts run, time saved per task, alongside lagging indicators, revenue, margin, or capacity figures that matter to the business. High usage paired with flat business results is the classic AI theatre signature. Where progress is real, the two converge, and someone in the team can show you the specific work that moved the numbers.

A genuine update also names a specific outcome without being prompted. Consider a BD team that has cut proposal turnaround from five days to 48 hours because an AI drafting tool reduced first-draft time by 65%. The AI did the work. The business change is what the update makes visible. A founder receiving that report knows something real happened.

Tracking where a programme actually stands requires watching two different measures. Implementation guides distinguish between trending ROI, the early signals that an initiative is working, and realised ROI, the financial outcomes that confirm it. A healthy AI programme tracks both. A theatre programme tracks only the trending side and presents it as evidence of progress. Knowing the distinction gives a founder a practical filter for every update they receive.

The founder’s advantage here comes from asking outcome-focused questions and noticing when the answers go vague. AI theatre persists in owner-managed businesses because no one in the chain between the board and the team insists on connecting activity to outcomes. That insistence is a founder’s job. It requires no understanding of the underlying technology to apply.

Sources

- BCG (2025). The AI Adoption Puzzle: Why Usage Is Up but Impact Is Not. Finds that AI usage across organisations is rising while measurable business impact has remained flat, the core pattern underlying AI theatre. https://www.bcg.com/publications/2025/ai-adoption-puzzle-why-usage-up-impact-not - Spencer Stuart (2025). Don't Delegate AI: A Power User Playbook for CEOs. Covers the delegate-assignment dynamic and the conditions under which confident AI updates can mask real implementation gaps. https://www.spencerstuart.com/research-and-insight/dont-delegate-ai-a-power-user-playbook-for-ceos - Korn Ferry (2025). 6 Signs Leaders Lack AI Readiness and How to Fix It. Identifies the paradox of assigning AI leadership to strong operators who lack AI-specific competencies, creating a deep mismatch between expectation and delivery. https://www.kornferry.com/insights/featured-topics/gen-ai-in-the-workplace-articles/6-signs-leaders-lack-ai-readiness-and-how-to-fix-it - Harvard Law School Corporate Governance Blog (2025). AI Risk Disclosures in the S&P 500: Reputation, Cybersecurity and Regulation. Reports that reputational risk is the top AI concern for 38% of S&P 500 companies, providing context for why boards scrutinise AI performance claims closely. https://corpgov.law.harvard.edu/2025/10/15/ai-risk-disclosures-in-the-sp-500-reputation-cybersecurity-and-regulation/ - EY (2025). AI Governance: Board Response to Investor Expectations. Covers board-level pressure on AI accountability and the growing expectation that AI produces measurable outcomes rather than activity. https://www.ey.com/en_us/board-matters/ai-governance-board-response-to-investor-expectations - PwC (2025). AI Predictions. Covers the gap between board AI expectations and on-the-ground implementation reality in mid-market and owner-managed businesses. https://www.pwc.com/us/en/tech-effect/ai-analytics/ai-predictions.html - Addepar (2025). Questions Executives Should Ask Before Adopting AI. The source of the "would this still matter without AI?" test and the formal framing of AI theatre as a named implementation failure mode. https://addepar.com/blog/questions-executives-should-ask-before-adopting-ai - LogixGuru (2025). The Board Wants an AI Strategy by Tuesday. Defines AI theatre explicitly as flashy demos with no measurable outcome and sets out a 90-day planning structure designed to avoid it. https://www.logixguru.com/post/the-board-wants-an-ai-strategy-by-tuesday-a-cios-survival-guide - SR Analytics (2025). Why 95% of AI Projects Fail. Analyses the pilot failure rate and the conditions that separate the small share of AI initiatives that reach P&L impact from those that stall. https://sranalytics.io/blog/why-95-of-ai-projects-fail/ - Propeller (2025). Measuring AI ROI: How to Build an AI Strategy That Captures Business Value. Introduces the dual-ROI frame of trending ROI (early signals) and realised ROI (financial outcomes), with realistic timelines of 12 to 24 months for meaningful impact. https://propeller.com/blog/measuring-ai-roi-how-to-build-an-ai-strategy-that-captures-business-value

Frequently asked questions

How do I tell whether an AI update represents real progress without technical knowledge?

Look for outcome language rather than activity language. A genuine update names the business problem that was solved and quantifies the improvement. If the update counts tools, licences, or users without attaching those numbers to a business result, the work may be real but the reporting is not. Ask for one metric and a before-and-after figure.

What is the Addepar test for AI initiatives?

The Addepar test asks whether an initiative would still matter if it did not use AI. If the answer is no, AI has become the point of the initiative rather than a means to a business end, which is a warning sign. Genuine AI initiatives address a named business problem, and the technology is one component of the solution. The test takes about thirty seconds and has a high signal-to-noise ratio.

What should I say when my team presents an AI demo and I cannot tell if it represents real progress?

Ask three things. First, what would success look like in concrete terms six months from now. Second, what would be lost if this initiative paused for a month. Third, what single metric is being used to evaluate it. These questions do not require technical knowledge. They signal that your standard for progress is outcomes, not activity, which sets the right expectation for every update that follows.

Written by Dr Dave Heath, AI consultant and business strategist.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

AI theatre or real progress: how a founder tells the difference

Key takeaways

What is AI theatre?

Why does this matter for a founder?

Where will you actually meet it?

When to push back and when to let it pass?

What does real progress actually look like?

Sources

Frequently asked questions

How do I tell whether an AI update represents real progress without technical knowledge?

What is the Addepar test for AI initiatives?

What should I say when my team presents an AI demo and I cannot tell if it represents real progress?

Ready to talk it through?

If any of this sounds familiar, let's talk.

AI theatre or real progress: how a founder tells the difference

Key takeaways

What is AI theatre?

Why does this matter for a founder?

Where will you actually meet it?

When to push back and when to let it pass?

What does real progress actually look like?

Sources

Frequently asked questions

How do I tell whether an AI update represents real progress without technical knowledge?

What is the Addepar test for AI initiatives?

What should I say when my team presents an AI demo and I cannot tell if it represents real progress?

Ready to talk it through?

Related reading

How safe is AI for business use, and where do the risks sit?

How accurate is AI translation for business documents?

Free AI fact-checkers worth considering for SMEs

If any of this sounds familiar, let's talk.