An owner-manager running a service business switches on an AI agent to handle customer queries and refund requests. The team saves four hours a week. Three months in, a complaint surfaces: the agent had approved a £300 refund on a dispute that should have been escalated to a manager. No threshold was set, nobody had reviewed a sample of outputs, and the business had no record of how the decision had been reached.
The agent worked exactly as designed. The problem was the context it had been placed in.
What choice are you actually making?
When you deploy an AI agent, you are deciding which decisions your business will hand over without a person in the loop. That is a manageable call on some workflows and a consequential one on others. The way to sort the two is straightforward: ask what happens when the agent gets it wrong, and whether you can reverse it quickly and cheaply.
The practical choice sits between deploying an agent with clear thresholds and a named human owner, and deploying one without those things. The first is a bounded deployment with accountability built in. The second is a liability with a friendly interface.
Owner-managed businesses often end up in the second category not because they are careless but because the vendor promise outran the governance conversation. A 2024 survey of UK businesses found that 54% of firms using AI cited time savings as the primary benefit, with 42% citing productivity gains and 42% citing cost savings. Those figures are real. They come from deployments where the agent’s scope was clear and someone was checking its work. Getting that clarity before you set the agent loose is the governance conversation many deployments skip.
When does an AI agent add risk rather than value?
Agents increase your exposure when the downside of an error is high, the data they touch is personal or regulated, or the process has no named owner. Five conditions reliably tip the balance towards risk: decisions with significant effects on customers, sensitive or poorly governed data, regulated activities, consumer-grade vendor terms with no data-location controls, and workflows where nobody has documented who is accountable.
The ICO’s guidance on AI and data protection expects organisations to have a lawful basis for any AI processing of personal data, and to conduct a Data Protection Impact Assessment for high-risk uses. The FCA’s Consumer Duty requires firms to avoid foreseeable harm, including harm arising from technology. An agent handling complaints, suitability decisions, or service terminations without validation and defined thresholds sits squarely in the territory both regulators are watching.
The UK government’s AI Playbook is direct on this: humans should validate any high-risk decisions influenced by AI and retain a meaningful ability to intervene. Letting an agent auto-adjust pricing, reject applicants, or resolve disputes above a defined value without a review step is a governance failure, not just a technology choice. Under FCA and ICO frameworks, accountability stays with the named person behind the decision. The Competition and Markets Authority has also indicated that AI-driven personalisation and pricing practices will face scrutiny where they create unfair consumer outcomes.
When does an agent genuinely earn its place?
Agents produce genuine value when the task is low-stakes, the data is internal and well-governed, and a named owner reviews outputs. These conditions bound the autonomy: errors are cheap to fix, impacts on customers or regulators are minimal, and the time savings arrive as intended. The clearest indicator is reversibility: if the agent gets something wrong, can your team catch and correct it before it causes harm?
Good candidates include appointment reminders, FAQ responses, internal status updates, and non-binding product queries. The NCSC’s guidance on AI security recommends defining clear performance metrics for deployed AI systems, including what normal looks like and how anomalies surface. Tracking first-response time, resolution rate, and hand-off rate to humans gives you early warning when an agent starts operating outside its intended range.
The threshold model makes this practical. UK SME guidance from 3L3C recommends monetary tiers: autonomous action below a defined amount, human review above it, manager sign-off above that again. The same logic applies to data categories, which customer records the agent may access, and which channels it can communicate through. Setting tighter thresholds gives the agent a genuine mandate rather than an open-ended remit, and that is what produces reliable value rather than unpredictable exposure.
What does it cost to get this wrong?
The costs run further than the immediate error. UK GDPR allows fines of up to £17.5 million or 4% of annual worldwide turnover for serious misuse of personal data. The ICO fined Clearview AI £7.5 million in 2022 for unlawful processing of biometric images. For owner-managed firms, the absolute exposure is lower, but a six-figure remediation bill alongside reputational damage can still end a business.
Beyond regulatory exposure, the operational costs accumulate. The UK government’s AI Playbook warns that generative models can produce plausible but incorrect information, and the NCSC identifies hallucinations as a documented, predictable behaviour of large language models. In 2023, a New York lawyer was sanctioned for submitting fabricated case law generated by ChatGPT. An unsupervised agent sending external communications is one confident but wrong output away from the same category of harm.
Professional services firms face additional exposure. The Solicitors Regulation Authority has reminded law firms that AI use must still respect confidentiality duties and client supervision obligations. Professional indemnity insurers are beginning to ask how firms govern AI tools. An agent that passes client documents to a consumer AI service without a data processing agreement may invalidate cover before anyone realises there is a problem.
What do you need to ask before you deploy?
A pre-deployment checklist does not need to be long to be useful. Six questions cover most of the ground: what is the worst realistic outcome if the agent gets it wrong, does this touch personal data or a regulated activity, what monetary and risk thresholds will cap autonomous action, who owns this agent’s behaviour day-to-day, can you evidence key decisions if challenged, and what is your rollback plan?
The threshold question is the one most often skipped. Explicit monetary limits are what separate bounded deployments from open-ended ones. FTI Consulting’s governance framework for AI agents identifies three roles every deployment needs: an owner accountable for the agent’s mandate, a reviewer who samples outputs regularly, and an approver who sets and signs off on thresholds and scope. If you cannot name a person for each role before you deploy, the agent is not ready to go live.
If the six questions produce answers you would struggle to defend to your insurer, your regulator, or a dissatisfied customer, the signal is to restrict scope before you commit to the upside. Book a conversation if you want to work through the pre-deployment checklist against your specific workflows.



