Should you use AI for this? A qualification framework

A managing director I work with sat down with a list of seven tasks his team wanted to “put AI on”. Within twenty minutes, three had come off the list. Not because the AI could not have done them, but because the underlying task was not a fit. He asked the question worth asking: “How do I tell which ones are real and which ones are not?”

This is the qualification problem. By 2026, the question for a UK SME is rarely whether AI exists for a task, it almost always does. The question is whether that specific task is one of the use cases that pays back, or one of the 80% that does not.

The decision in front of you

Hold the failure rate in mind. RAND research puts roughly 80% of UK business AI projects in the failure column, double the rate for traditional IT. MIT’s 2025 NANDA analysis of generative AI found 95% of pilots never move past proof-of-concept. S&P Global reported 42% of companies abandoned most AI initiatives in 2025, up from 17% the year before. The pattern is consistent across independent surveys.

Qualifying a task is therefore a defensive move as much as an offensive one. You are not deciding whether AI can technically do the work. You are deciding whether your firm can absorb the cost, the governance load and the reversal risk, and whether the task sits in the narrow band where the technology actually delivers.

Six gates frame the call: data readiness, process maturity, financial threshold, governance capacity, regulatory exposure and reversibility. A use case that passes all six is a real candidate. One that fails any should stay manual until it does not.

When AI qualifies

A task qualifies when it sits inside structure the AI can use and inside risk you can absorb.

Data readiness comes first. The task must rest on data that is consistent, traceable and representative. Bain’s 2025 research found a third of enterprises deploying AI have not yet invested in foundational data strategy, and the projects without it stall before scale. The practical test is whether you can name a single owner of the dataset and produce a complete, consistent extract within two to four weeks.

Process maturity comes second. AI automates and amplifies an existing process; it does not invent one. McKinsey’s 2025 state-of-AI survey found high-performing companies redesign workflows around AI rather than bolt it on top of legacy ones. If you cannot draw the current process on a single page a new joiner could follow, the task is not yet a candidate.

Financial threshold comes third. A realistic AI deployment carries setup cost (often £2,000-£5,000), monthly cost (often £200-£2,000) and hidden cost (governance, rework, debugging). The conservative test is whether the system reclaims at least 30% of the annual spend on the process within twelve months.

Governance capacity is fourth. AI requires continuous oversight, not a one-time review. The task qualifies only when you can name a person with authority to pause or kill the system, time to monitor it, and budget to fix it. The ICO and the FCA are explicit that this responsibility cannot be contracted away to the vendor.

Regulatory exposure is fifth. The lighter the regulatory weight on the decision the AI is making, the easier the task qualifies. UK GDPR and ICO rules apply to anything processing personal data. EU AI Act provisions apply from August 2026 if you serve EU customers. Sectoral rules add weight in financial services, legal practice and healthcare.

Reversibility is sixth. The task qualifies when you can switch the AI off and revert to manual within hours, with no break in customer service or compliance. Autonomous systems that make hard-to-undo changes raise the bar on every other gate.

When AI does not qualify

Some tasks look like AI candidates but consistently fail one or more gates.

Strategic decision support fails on process maturity and reversibility. AI language models are pattern-matchers trained on what is common, not what is right for your firm. They produce recommendations that read like a McKinsey deck and bear little relationship to your actual constraints, and the recommendations are hard to undo once acted on. Use AI to expand strategic options, not to make the choice.

Hiring decisions fail on regulatory exposure, governance and reversibility. Amazon abandoned an AI hiring system in 2018 after it learned to penalise CVs containing the word “women’s”, a pattern baked in by ten years of disproportionately male training data. UK employment law and EU AI Act provisions both require fair hiring practice and explicit bias testing. Few SMEs have the governance capacity to defend every rejection.

Regulated financial risk scoring (credit, insurance pricing, fraud detection) fails on regulatory exposure and governance. The FCA expects model explainability, fairness testing and continuous drift monitoring. Vendors cannot contractually transfer that obligation to you. The right move here is partnering with a specialist vendor who owns the validation work and contracts to fairness SLAs, not building it yourself.

Autonomous customer interaction (fully unsupervised chatbots and agents) fails on reversibility and governance. Microsoft’s Tay chatbot famously degraded within hours of its 2016 launch. Eight years later, autonomous customer-facing systems are still where the brand-damage incidents land. Customer-facing AI works when a human reviews and approves; it misfires when nobody is watching.

The pattern is consistent: the AI can technically do the work, the task fails the gates anyway, and the right call is to leave it alone until something on the gate side changes.

What it costs to get wrong

The visible cost of a failed deployment is the sunk subscription, the consultant invoice and the staff hours. The invisible costs are bigger.

Rework is the largest line. Misclassified data, hallucinated outputs and wrongly-routed tickets land back on the team the AI was meant to free up. A team running 80% manual review on a tool that was meant to remove manual work is worse off than before.

Regulatory exposure compounds when the AI touches personal data without qualification work behind it. The ICO is clear that data controllers retain obligations regardless of vendor promises. Costs from a serious incident, fines plus legal plus rework, run into millions. For an SME, that is the difference between a recoverable mistake and a survival event.

Governance debt accumulates quietly. An unmanaged custom GPT or chatbot deployed by an enthusiastic team member, never reviewed, never refreshed, becomes a liability long after the original use case has moved on.

Customer trust damage is hardest to recover. An AI that misprices, misroutes or misadvises in a high-stakes moment hurts the relationship long after the technical fix. The reversibility gate exists because this cost is asymmetric: the upside is incremental, the downside is structural.

What to ask before you commit

Five questions, in order, before any AI procurement decision.

One: which gate does this task fail today, and what would have to be true for it to pass? An honest answer is more useful than a long ROI spreadsheet. If the task fails on data readiness, the right next step is data work, not vendor selection.

Two: who owns this if it goes wrong? Name the person, authority, budget and time. If the answer is “we will work that out later”, the task does not qualify yet.

Three: what does the rollback look like? Reversibility is not a feature, it is a design choice. If the team cannot describe the manual fallback in two sentences, you are buying autonomous when you needed assistive.

Four: where is the regulatory weight? UK GDPR, sectoral rules, EU AI Act provisions, professional indemnity insurance. The vendor will not raise these. The auditor will.

Five: what is the success metric, and what is the failure metric? Both. Time saved is half the picture. The other half is the floor under accuracy, customer satisfaction and compliance below which you stop the system. Vendors will help you set the first. You set the second.

The point of the framework is not to talk you out of AI. It is to direct investment at the tasks where the gate analysis says it will pay back, and leave the rest manual until something changes. A small number of qualified deployments beats a long list of half-running pilots, and the firms that get this right in 2026 are the ones still using AI productively in 2027.

Should you be using AI for this at all? A qualification framework

Key takeaways

The decision in front of you

When AI qualifies

When AI does not qualify

What it costs to get wrong

What to ask before you commit

Sources

Frequently asked questions

Is the high failure rate just hype, or should I really worry?

My data is messy. Should I just start anyway and clean as I go?

What if a vendor tells me their tool does not need any of this?

Ready to talk it through?

If any of this sounds familiar, let's talk.

Should you be using AI for this at all? A qualification framework

Key takeaways

The decision in front of you

When AI qualifies

When AI does not qualify

What it costs to get wrong

What to ask before you commit

Sources

Frequently asked questions

Is the high failure rate just hype, or should I really worry?

My data is messy. Should I just start anyway and clean as I go?

What if a vendor tells me their tool does not need any of this?

Ready to talk it through?

Related reading

Zero-shot vs few-shot learning: when AI works on tiny data

What is AutoML? Why it matters for your business

What is edge AI? Why running AI locally matters for your business

If any of this sounds familiar, let's talk.