What to do when an AI system goes wrong in production

A professional reviewing documents at a desk with a laptop open in front of them
TL;DR

When an AI system goes wrong in production, the right response follows a clear sequence: contain first, classify the incident type, preserve evidence, then fix the root cause. For UK services firms, a data breach involving personal information triggers ICO reporting requirements within 72 hours, while failures touching regulated activities may involve FCA considerations. A short incident runbook and a named person who can switch the system off are the minimum viable preparation.

Key takeaways

- When an AI system starts producing wrong outputs, stop or throttle the workflow before you investigate: the NCSC incident framework puts containment before diagnosis. - If personal data is involved, the ICO's 72-hour notification clock starts when you become aware of a breach that is likely to risk individuals' rights and freedoms. - Production AI failures are often workflow failures, not just model failures: fixing the prompt alone rarely addresses the root cause. - A short runbook covering who can shut the system down, what counts as a reportable incident, and where the logs live is sufficient preparation for a 5 to 50 person firm. - Turn every incident into a regression test: that converts a failure into a safeguard against the same problem recurring unnoticed.

A senior account manager at a professional services firm noticed the problem on a Thursday afternoon. A client had replied to say their summary report contained figures from a different engagement. The AI drafting tool had been pulling context from a previous session for three weeks, and nobody in the team had caught it. The immediate priority was clear: what do they do right now, and in what order?

What does it mean for an AI system to go wrong in production?

An AI system goes wrong in production when it is live, connected to real workflows, and producing incorrect, harmful, or non-compliant outputs. That covers a wide range: a language model that hallucinates client advice, an automated tool that acts on the wrong data, or a workflow system that routes information to the wrong recipient. The type of failure shapes your response, so naming it clearly is the first practical move.

Production failures are not always dramatic. Many begin as silent quality degradation: outputs that drift from accurate to plausible-but-wrong, or automations that carry out the right action on the wrong record. The Latitude production failure guide identifies goal drift, context loss, and cascading errors in multi-agent systems among the common patterns. These are workflow failures as much as model failures, which means treating the failure as a prompt-tuning problem rarely addresses the root cause.

The NCSC’s incident management framework puts containment before diagnosis. When you spot a problem, stop the system or throttle it first. Disable the risky workflow, revoke API keys if the system is making external calls, or switch to human-only processing for that task. Investigating while the system is still running risks extending the damage before you have any picture of what went wrong.

Why does your response sequence matter more than the fix?

The order in which you respond to an AI incident affects how bad the consequences turn out to be. Containment first, then classification, then evidence preservation, then remediation. Reversing that sequence, by trying to fix the root cause while the system is still running, is how small incidents become large ones.

Once the system is stopped, classify what happened. Was this a data breach? Did personal data go to the wrong person? Did the AI produce advice that a client acted on? Did it trigger actions in downstream systems? Classification determines who else needs to be involved. A data breach involving personal information triggers the ICO’s 72-hour notification window if the breach is likely to risk individuals’ rights and freedoms. A failure touching regulated financial services activity brings FCA governance expectations into play. A cyber incident activates your insurer and, in some cases, your solicitor.

Preserve evidence before you begin any repair work. Save prompts, tool call logs, outputs, timestamps, and user actions so you can reconstruct what happened. For a small firm, this usually means screenshots, export files, and a short written account of when the problem was first noticed and what the symptoms were. That record matters if a client asks questions, a regulator makes enquiries, or your insurer needs to assess the claim.

Where do production AI failures show up in a services firm?

Production AI failures in a services firm tend to cluster around four points: outbound communications, client-facing documents, integrations with external systems, and automated decision steps. Each exposes a different type of risk. Outbound failures tend to be visible quickly. Integration failures can go unnoticed for days or weeks. The type of failure determines both the urgency and the remediation path.

Outbound communications are the most visible failure point. An AI that drafts client emails, writes summaries, or generates reports is only one session-context error away from including the wrong client’s information. The failure is immediate and obvious, but by the time it is noticed, the harm has already reached the client.

Integrations with external tools carry a subtler risk. When an AI agent is connected to CRM, accounting, or ticketing systems and authorised to take actions, a loop error or a context misread can create bad records, trigger wrong transactions, or send data outside the intended boundary. The NimbleBrain production AI guide notes that systems built without recovery paths, retry logic, and human escalation points for unknown situations are particularly exposed. For a small firm, the integration was often set up quickly and the failure modes were never mapped.

Document generation is the third common failure point. AI tools that produce contracts, reports, or proposals from template logic can propagate the same error across every document in a batch before anyone notices. One bad assumption in the context window is all it takes.

When does an AI incident become a regulatory matter?

An AI incident in a UK services firm becomes a regulatory matter when personal data is involved, when the system touches regulated activities, or when a client has suffered a concrete harm. Three bodies are most relevant in the UK: the ICO for data protection, the FCA for regulated financial services, and your insurer or solicitor for contractual or legal exposure. Classification happens before remediation.

The ICO is the first checkpoint. If your AI system accessed, disclosed, or processed personal data incorrectly, you need to assess whether the incident is likely to result in a risk to individuals’ rights and freedoms. If it is, the ICO expects notification within 72 hours. The ICO’s guidance on AI and data protection is clear that AI is not exempt from UK GDPR principles: fairness, transparency, data minimisation, and accuracy all apply, even when the processing is automated.

The FCA is relevant if your firm works in or for regulated financial services, or if the AI touched client onboarding, advice workflows, or operational processes that fall under FCA oversight. The FCA’s 2024 AI update made clear that firms remain responsible for outcomes and are expected to maintain governance and control even when AI is doing the work. A failure in that context is a conduct, record-keeping, and controls matter, not only an IT incident.

If neither applies, which is true for a firm using AI only for internal drafting with no client data and no external integrations, the response burden is substantially lower. Fix the problem, document what happened, and update your runbook.

What should you put in place before the next incident?

The practical lesson from every AI production failure is that the response playbook should exist before the incident, not be written during it. A short runbook covering four things is sufficient for a 5 to 50 person firm: who can switch the system off, what counts as a reportable incident, where the logs live, and what safe enough to re-enable means.

The UK government’s AI Playbook frames this as planning for errors, failure, and human override. In practice, for a 5 to 50 person firm, the whole thing fits on one page with a named person assigned to each decision point.

The second piece is turning each incident into a regression test. Latitude’s four-step framework for AI production failures works well at small scale: collect the trace, cluster the failure type, identify the root cause, and create an evaluation case that catches the same failure in future. A firm that treats incidents as a source of evaluation data builds more reliable AI over time.

The third point is contractual. Do not assume your vendor will handle liability or notification for you. The ICO and FCA frameworks place responsibility on the deploying organisation, regardless of where the model lives. Read the contract before an incident happens, understand what your vendor does and does not cover in a failure scenario, and know which notifications you are responsible for making yourself.

If you want to think through your firm’s exposure before something goes wrong, Book a conversation and we can work through it together.

Sources

- NCSC (2024). Incident management. Framework for containment, evidence preservation, and recovery applied to security and AI incidents. https://www.ncsc.gov.uk/collection/incident-management - ICO (2024). Personal data breaches. UK GDPR guidance covering the 72-hour notification obligation where a breach is likely to risk individuals' rights and freedoms. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/security/a-guide-to-data-security/personal-data-breaches/ - FCA (2024). Artificial intelligence in UK financial services. Update covering governance, oversight, and firm responsibility for AI-driven outcomes in regulated contexts. https://www.fca.org.uk/news/news-stories/artificial-intelligence-uk-financial-services - European Parliament and Council (2024). Regulation (EU) 2024/1689 (EU AI Act). Establishes post-market monitoring and serious-incident reporting obligations for high-risk AI systems. https://eur-lex.europa.eu/eli/reg/2024/1689/oj - ICO (2024). AI and data protection. Guidance confirming AI is subject to UK GDPR principles including fairness, transparency, accuracy, and accountability. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/ - NCSC (2024). Artificial intelligence and cyber security. Guidance on AI-related risks including automation errors, dependency on external services, and log preservation requirements. https://www.ncsc.gov.uk/collection/artificial-intelligence-and-cyber-security - UK Government (2024). Artificial Intelligence Playbook for the UK Government. Guidance on planning for AI errors, failure, and human override, with procurement and governance expectations. https://www.gov.uk/government/publications/ai-playbook-for-the-uk-government/artificial-intelligence-playbook-for-the-uk-government-html - Latitude (2025). Detecting AI Agent Failure Modes in Production. Operational framework covering trace collection, failure clustering, root cause analysis, and regression test generation. https://latitude.so/blog/ai-agent-failure-detection-guide - NimbleBrain (2025). The Production AI Playbook: From Pilot to Production in 4 Weeks. Covers recovery paths, retry logic, audit trails, and human escalation design for production AI systems. https://nimblebrain.ai/guides/production-ai-playbook/

Frequently asked questions

Does an AI incident mean I have to tell the ICO?

Only if personal data is involved and the breach is likely to risk individuals' rights and freedoms. If your AI accessed, disclosed, or processed personal data incorrectly, assess the risk and, if reportable, notify the ICO within 72 hours of becoming aware. An AI tool that generated wrong advice with no personal data involved does not automatically trigger the reporting obligation. When in doubt, document what happened and seek legal advice.

What should I do first when I notice my AI system is producing wrong outputs?

Stop the system or throttle it before anything else. Disable the workflow, revoke API keys if it is making external calls, or switch to manual processing for that task. The NCSC's incident management framework prioritises containment before diagnosis, and that applies directly to AI failures. Investigating the root cause while the system is still running risks extending the blast radius. Once it is stopped, preserve logs, prompts, and outputs before you change anything.

If my vendor's AI tool makes the mistake, is it still my responsibility?

Yes, in the relevant UK regulatory contexts. The ICO's guidance on AI and data protection makes clear that the deploying organisation remains responsible for UK GDPR compliance, regardless of where the model or infrastructure lives. The FCA's position on AI governance is the same: firms are responsible for outcomes, not just inputs. Read your contract to understand what the vendor covers in a failure scenario, but do not assume they handle regulatory notification or client liability on your behalf.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation