What AI summaries are and where they go wrong

A person at a desk reading a printed document next to a laptop screen showing a meeting summary
TL;DR

AI summaries condense emails, meetings and documents into shorter text, but confident errors are common because the model predicts rather than understands. For a UK service firm the risk is both operational (wrong details in the summary) and regulatory, since ICO, FCA and EU AI Act obligations can apply. Treat every summary as a draft, keep the original as the record, and restrict use to low-risk workflows.

Key takeaways

- AI summaries predict statistically likely text from the original source. They do not understand the content, so confident errors, including invented numbers and fabricated decisions, are always possible. - The UK's National Cyber Security Centre recommends treating large language model outputs as untrusted by default. For any decision that matters, the original document or recording is the record. - AI summarisation is built into Microsoft 365, Google Workspace, Zoom and many CRM platforms, often switched on by default without staff realising it. - Summarising staff or client communications with AI counts as data processing under UK GDPR. Information Commissioner's Office compliance obligations apply regardless of which tool did the condensing. - Classify your workflows before applying AI summaries: low-stakes triage is fine with human review; contracts, HR notes and regulated financial advice are high-risk and should not rely on summaries.

A member of your ops team sends a project proposal to a client. The fee quoted is wrong, roughly £400 lower than agreed. When you trace back what happened, the answer is simple: she worked from the Teams meeting summary rather than the recording. The AI had condensed the call well enough to sound convincing. The one number that mattered hadn’t survived the summarisation.

That kind of error is easy to miss, precisely because the summary looked fine. AI summaries fail silently. They produce confident text that reads correctly and contains a wrong number, a fabricated decision, or a misread tone. By the time someone checks the source, the error has already done its work.

What is an AI summary?

An AI summary is a shorter version of a longer document, produced automatically by a machine-learning model. The model doesn’t understand the content the way a person would. It predicts which parts are worth keeping, based on patterns from its training data. The result can be extraction, lifting key sentences directly, or abstraction, re-wording ideas into new language that never appeared in the original.

Tools like Microsoft Copilot, Google Workspace’s AI features and Evernote’s AI Note Cleanup all use variants of this approach. They are trained on large datasets of documents paired with human-written summaries, so the model learns patterns of what human readers tend to include or drop. Evernote markets this as helping “busy professionals” get quick overviews of industry reports without reading every word. That’s accurate as far as it goes. What the marketing doesn’t mention is that the model is making educated guesses, not faithful reductions.

The UK’s National Cyber Security Centre puts it directly: large language models “do not understand the text they generate” and can produce confident-sounding content even when the original was clear. That constraint is shared by every AI summarisation tool on the market, regardless of who built it.

Why does this matter for your business?

When staff act on AI summaries without checking the source, errors in those summaries become errors in your work. For a service firm that means mis-stated fees, decisions that were never made, and occasionally a data protection problem. The Information Commissioner’s Office treats AI-assisted processing of personal data as processing under UK GDPR. Your usual compliance obligations apply whether a machine or a person wrote the note.

For firms regulated by the Financial Conduct Authority, the exposure is sharper. The FCA’s Consumer Duty makes clear that regulated firms remain responsible for client outcomes regardless of how those outcomes were reached. If an AI-generated summary was used in a client communication that misrepresented the advice given, the liability sits with the firm, not the software vendor. The FCA’s 2023 discussion paper on AI and machine learning reinforces that point: AI tools do not outsource your responsibility for operational risk.

Data protection is the other layer. Under UK GDPR, summarising client calls or staff communications with an AI tool counts as processing personal data. That means you need a lawful basis, a clear purpose limitation and, if the use is high-risk, a Data Protection Impact Assessment. The ICO’s employment practices guidance adds that staff should be told when AI tools are being used to process communications involving them, and that intrusiveness should be kept to a minimum.

Where will you actually meet AI summaries?

AI summarisation is already built into many tools a service firm uses by default. Microsoft Copilot generates meeting recaps from Teams calls. Google Workspace summarises long email threads. Zoom produces automatic Smart Recording highlights. Customer service platforms including HubSpot and Zendesk offer AI ticket summaries. And Google’s AI Overviews now pull information from multiple websites into a single answer at the top of search results.

The practical concern is that several of these features are switched on as part of a platform upgrade, without anyone making a deliberate decision. Many teams start relying on Teams meeting summaries or inbox digests without ever choosing to adopt AI. That means the risk is already present in many service firms, regardless of whether AI has appeared on the leadership agenda.

Google AI Overviews are worth calling out specifically because they affect research habits. When a staff member searches for guidance on a regulatory requirement, a pricing benchmark or a technical process, they may now see an AI-generated summary of multiple web pages before they see any of those pages. Mozilla Foundation’s 2024 review found multiple cases of Google’s AI Overviews giving incorrect medical, geographical and practical advice. Google’s own post-launch analysis acknowledged examples of “less-than-helpful” outputs produced when the model treated sarcastic forum content as authoritative guidance. For decisions that matter, an AI Overview is not a safe stopping point.

When should you trust an AI summary, and when should you verify?

AI summaries earn their keep on low-stakes tasks where the goal is a quick first pass, not a formal record. Skimming an industry report for relevance, triaging a support ticket, drafting an action list that a human then checks: all reasonable uses. The problems start when a summary replaces the source rather than pointing back to it.

Three failure modes are well documented. The first is hallucination, where the model generates content that sounds correct but wasn’t in the original. This produces invented dates, prices or decisions that were never made. The second is nuance loss: tone, context and complex argumentation often don’t survive condensation. Writer and technologist Doug Smith has argued that AI summaries can effectively erase the human voice by interposing machine-chosen highlights between speaker and listener, meaning sarcasm, frustration or subtle client signals simply disappear from the record. The third is source confusion, where the model struggles to distinguish authoritative guidance from speculation. Google’s own account of its AI Overviews problem pointed to forum content being treated as reliable advice when the model couldn’t recognise the irony in it.

For a service firm, the workflows to restrict from unsupervised AI summarisation are: contracts and legal correspondence, HR notes and disciplinary records, financial advice and suitability documentation, and anything involving personal data in a sensitive category including health records, beliefs or trade union membership. The ICO’s guidance on employment monitoring and the FCA’s Consumer Duty both point in the same direction. Human review is not optional for decisions that affect people.

What else should you understand before deploying AI summaries?

Hallucination is the technical term for when a model generates plausible-sounding text that wasn’t in the original source. It explains a significant share of AI summary errors and is why the NCSC categorises all LLM outputs as untrusted until verified. A Data Protection Impact Assessment, or DPIA, is the ICO’s process for evaluating high-risk AI uses before deploying them at scale.

The EU AI Act, formally adopted in 2024, is also worth understanding, particularly for firms with clients in EU member states or using EU-hosted AI services. The Act introduces risk categories. Systems used in employment decisions, creditworthiness assessments and access to services are classified as high-risk and require specific human oversight controls. Feeding AI summaries into promotion or performance decisions, even where the summary tool itself is not classified as high-risk, can pull those downstream decisions into the Act’s scope. UK firms without EU operations are not directly subject to the Act, but the requirements are shaping how AI vendors build their products and contracts, so the obligations will arrive via supplier terms regardless.

One other concept worth knowing is the vendor data-processing agreement, or DPA. Under ICO and FCA expectations, if a third-party AI tool is processing personal data on your behalf, you need a written contract covering security, sub-processing, breach notification and international transfers.


The value of AI summaries is real. They can save a 20-person firm hours a week on meeting notes and ticket triage. The risk is real too. It shows up as a wrong fee in a proposal, a complaint that was never properly understood, a decision the AI attributed to someone who never made it. Treat every AI summary as a draft, keep the source document or recording as the actual record, and decide clearly which workflows can use AI-assisted summaries and which cannot. That classification conversation is often the right starting point.

If you want help mapping where your current AI workflows sit on the risk spectrum, Book a conversation.

Sources

- UK National Cyber Security Centre (2023). Guidelines for secure use of generative AI. Advises treating LLM outputs as untrusted and states that models do not understand the text they generate. https://www.ncsc.gov.uk/blog-post/guidelines-secure-use-of-generative-ai - ICO (2023-24). Guidance on AI and data protection, including generative AI. Recommends DPIAs where AI processing poses high risk to individuals and sets out lawful basis requirements for processing personal data with AI tools. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/ - ICO (2023). Employment practices: monitoring at work. Stresses transparency with staff when using AI monitoring or analytics tools and the need to limit intrusiveness. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/employment/employment-practices-monitoring-at-work/ - ICO. Guide to the UK General Data Protection Regulation (UK GDPR). Sets out lawful processing requirements that apply to AI-assisted summarisation of personal data. https://ico.org.uk/for-organisations/guide-to-data-protection/guide-to-the-general-data-protection-regulation-uk-gdpr/ - Financial Conduct Authority (2023). Discussion Paper DP5/22: Artificial Intelligence and Machine Learning. Stresses that regulated firms remain responsible for outsourcing and operational risk when using third-party AI services. https://www.fca.org.uk/publication/discussion/dp5-22.pdf - Financial Conduct Authority. Consumer Duty PS22/9. Sets out that FCA-regulated firms remain liable for client outcomes, including where AI-generated summaries are used in client communications. https://www.fca.org.uk/publications/policy-statements/ps22-9-new-consumer-duty - EUR-Lex (2024). Regulation on artificial intelligence (EU AI Act). Introduces high-risk categories including employment, creditworthiness and access to services, with human oversight requirements. https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=COM:2021:206:FIN - Mozilla Foundation (2024). When Search Engines' AI Overviews Are Bad, They're Really Bad. Independent analysis of hallucination and source-quality failures in Google AI Overviews, with documented examples of incorrect advice. https://www.mozillafoundation.org/en/blog/ai-overview-google-search/ - Google (2024). AI Overviews: what we've learned so far. Google's own post-launch review acknowledging less-than-helpful outputs, including the cheese-pizza glue example from forum content. https://blog.google/products/search/google-search-ai-overview-update/ - Doug Smith (2025). What Could Possibly Be Wrong With AI Summaries? Practitioner analysis of how AI summaries erase human voice and cause staff to trust the machine's version over what a client actually said. https://thatdougsmith.com/2025/09/06/what-could-possibly-be-wrong-with-ai-summaries/

Frequently asked questions

Are AI summaries accurate enough to use as a formal record?

Not reliably. AI summaries predict plausible text rather than transcribing accurately, and can produce invented dates, prices or decisions that never appeared in the original. The UK's National Cyber Security Centre advises treating large language model outputs as untrusted by default. For anything that forms part of a client agreement, compliance record or HR file, the original document or recording should remain the authoritative version.

Does using AI to summarise client meetings create a data protection problem?

It can. The Information Commissioner's Office treats AI-assisted processing of personal data as processing under UK GDPR. That means you need a lawful basis, a clear purpose and, for large-scale or high-risk uses, a Data Protection Impact Assessment. Summarising client calls or staff communications with a third-party AI tool also raises questions about international data transfers and vendor data-processing agreements.

Which uses of AI summaries are too high-risk for a small service firm?

The high-risk end includes contracts, legal advice, HR notes and disciplinary records, regulated financial advice, and anything involving special-category personal data such as health information or trade union membership. Using a summary rather than the original source to make decisions in these areas can create compliance risk under UK GDPR, the FCA's Consumer Duty, or, for firms with EU clients, the EU AI Act.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation