Picture the owner of a small accountancy practice on a Wednesday afternoon. Two of her senior staff have started using ChatGPT for first drafts of client letters and tax-position memos. She has read about the New York lawyer who got sanctioned for citing fake cases, and she has read about the UK barristers in the press more recently. She assumes the problem is specific to legal work. Her team is not writing court filings. They are writing accountancy memos. She closes the article and gets back to the VAT return on her desk.
That assumption is the failure.
What actually happened in the three cases?
In each case, a qualified professional treated chatbot output as research output and filed it without checking the citations. Mata v Avianca produced a $5,000 sanction in 2023. R (Ayinde) v Haringey and Al-Haroun v Qatar National Bank brought the same pattern to the UK High Court in June 2025, with judicial findings that the threshold for contempt had been met and wasted costs orders against both sides.
Mata was the first public sanction. In June 2023, attorney Steven Schwartz used ChatGPT to research a personal-injury claim against Avianca airlines in the US Southern District of New York. The model produced six case citations that supported his position. Schwartz attached them to an affidavit. None of the cases existed. The airline’s counsel could not find them. The court could not find them. Judge Castel imposed the sanction on Schwartz and his firm, and the ruling has been cited in every professional guidance note on AI ever since.
Two years later, the UK High Court ruled on Ayinde and Al-Haroun within days of each other. Ayinde involved five fabricated authorities cited in grounds for judicial review, with the Divisional Court finding that the barrister had either knowingly relied on fake cases or used AI and then given an untruthful account of doing so. Al-Haroun involved witness statements with citations that were either invented or misquoted. The claimant admitted using public AI tools. The solicitor conceded he had relied on the client’s research without independently checking it. The court called that a “lamentable failure” of the basic professional check.
Why does this matter to firms that are not law firms?
Because the failure mechanism is not legal, it is linguistic. A large language model is a probability engine over text. When you ask it for a citation it produces text that looks like a citation, scored on plausibility, not on whether the cited thing exists. That same mechanism produces plausible-looking HMRC manual references, FCA handbook clause numbers, audit standards and academic citations with identical confidence to the real ones.
The model has no concept of “real” versus “invented.” It has a concept of “what would a citation in this context plausibly look like.” Deloitte Australia learned this in 2025 when its AUD 440,000 report for the Department of Employment contained fabricated academic citations and a fake judicial quote, drafted with GPT-4o. The firm refunded part of the fee and republished the report. The pattern is identical to Mata. A professional treated AI output as research output, did not verify the citations, and the document went out with fakes inside it.
Where will you actually meet this risk in your firm?
In any client-facing document that contains a reference to an external authority. That is a wider net than it first sounds. A compliance memo citing FCA handbook paragraphs. A tax-position letter citing HMRC’s internal manual. An audit working paper citing an International Standard on Auditing. A market-analysis report citing percentages from a named industry study. A grant application citing a peer-reviewed paper.
Each one of those is a citation, and each one is fakeable by a confident-sounding model. The Doughty Street Chambers catalogue, updated in early 2026, lists at least thirty-eight UK decisions in which AI-generated content or unreliable digital evidence has surfaced. The cases include Chandra v Royal Mail Group in the Employment Tribunal and M v F in the Family Court, well outside the headline judicial-review and commercial-litigation contexts of Ayinde and Al-Haroun. The October 2025 Judicial Office guidance applies the same caution to judges: confidential information must not be entered into public AI tools, and AI output must be checked regardless of how confident it sounds.
When should you ask hard verification questions and when can you let it ride?
The cleanest rule is the one the SRA, the Bar Council and the Judicial Office all converge on. AI for low-risk, internal, non-cited work is fine. The bar rises sharply the moment the output contains a reference to an external authority, leaves the firm in any form, carries the firm’s name as the source, or will be relied on by a client, regulator or counterparty.
At that point every cited URL gets opened. Every statute reference gets cross-checked. Every HMRC manual paragraph gets opened in the manual. Every named report gets the figure verified against the report itself, not against a chatbot’s summary of the report. The person doing the check is named on the file. The check is logged.
The point of the log is not bureaucracy. The point is that when something does land wrong, the firm can show what was checked and by whom, and the partner who signed the document has not inverted the responsibility relationship the way Al-Haroun’s solicitor did when he leant on the client’s own AI-assisted research. That inversion was what the court reacted to as much as the fake cases themselves.
What this is not, and what to do this week
This is not a recommendation to ban AI from professional work. It is a recommendation to put the verification discipline in place before the first hallucination lands in a client document, rather than after. The lawyers who were sanctioned did not lack training, they had decades of it. What they lacked was a process step that made verification a checklist item with a name against it.
For a small firm starting from nothing, three moves cover the bulk of the risk. Write down which document types are allowed to contain AI-drafted content and which are not. Define the verification step for every cited reference, who does it, where they record it, and what the sign-off looks like. Train every staff member, including the one who is fast with AI and impatient with checks, that the AI sounding confident is the failure mode they are checking against, not the signal that the check is unnecessary.
The firms that get this right will use AI heavily without writing a Mata. The firms that get it wrong will end up cited in someone else’s blog post about how the pattern played out in their sector.



