Preventing client data from reaching public AI tools

Someone on your team is already using AI. They started with low-stakes tasks, found it genuinely useful, and gradually extended it to the work where the time saving is greatest, turning call notes into a summary email, drafting a proposal from raw client input, pulling key points from a contract before a meeting. There was no rule against it. Nobody thought to ask.

The issue is that client names, project details, pricing, and commercially sensitive context have almost certainly entered a public AI tool at some point in that process, and your firm had no policy, no visibility, and no way to know.

That is the position a significant number of owner-managed services firms are sitting in right now.

What actually happens when staff paste client data into a public AI tool?

When staff paste client notes into a public AI tool, the prompt travels to the provider’s servers, where it may be retained for quality review, used to train the model, or logged for provider-side monitoring. Consumer-tier versions of ChatGPT, Google Gemini, and Copilot do not carry the same contractual protections as their enterprise counterparts. The NCSC frames this directly as an information security issue, not just a productivity concern.

The distinction between consumer and enterprise tiers is important because the terms of service for public tools have historically permitted providers to use inputs to improve their systems, unless users opt out of mechanisms they may never have seen. OpenAI, Microsoft, and Google each publish enterprise privacy pages with specific commitments around training exclusions, data retention, and admin controls. The consumer tiers of the same products do not carry those commitments.

There is also a contractual dimension. A 2023 incident at Samsung, widely reported at the time, saw the company restrict generative AI use after staff entered sensitive internal material into a public tool. The lesson for a ten-person consultancy is the same. A prompt is a disclosure to a third party, and the fact that the disclosure happened inside a chat interface does not change that.

Why does this matter more for a services firm?

Services firms hold two distinct categories of risk, and many owners focus on only one. The first is personal data, names, contact details, anything covered by UK GDPR. The second, often more commercially significant, is client confidentiality, covering pricing, strategy, legal content, credentials, and tender documents. The ICO’s guidance covers both, and your professional duty of care to clients extends beyond the regulatory floor.

The ICO has been consistent in its message to UK organisations. Understand what data goes into AI tools and understand what happens to it afterwards. Where AI use is likely to create high risk, a data protection impact assessment is required. The FCA has gone further for regulated firms, stating that using AI does not remove your governance, record-keeping, or client data obligations. For a firm that supplies regulated clients, the FCA’s standards can reach you even if you are not regulated yourself.

UK GDPR fines can reach up to 4% of annual worldwide turnover for the most serious breaches, and a reportable breach must be notified to the ICO within 72 hours of discovery. Neither scenario is likely from a single accidental disclosure, but together they illustrate why this is a governance matter, not just an IT one.

Where will you actually meet this risk in a services firm?

The highest-risk moments tend to be the everyday shortcuts. Turning call notes into a summary email, drafting a proposal from raw client input, cleaning up a long email thread containing commercial terms, or pulling key points from a contract before a meeting. Each of those tasks can carry client names, financial figures, or legally privileged content. The exposure concentrates in daily workflow, in the tasks where AI fits most easily and time pressure is highest.

Verizon’s 2025 Data Breach Investigations Report found that 95% of security incidents involve some form of human error or behaviour. That finding supports a consistent principle in security, namely that policy and technical controls need to sit alongside training, because good intentions alone are not a reliable layer of defence. Both the NCSC and the ICO say the same thing. Awareness is necessary but it is the starting point, not the whole answer.

The specific categories to watch for in a services firm are call recordings or transcripts, draft proposals containing client budgets, legal documents of any kind, HR or payroll data, account credentials, financial data belonging to clients, and screenshots or database extracts containing client-identifying information. A practical test. If you would not email this content to the tool provider’s support team, it should not go into the tool.

When do you need to act, and when is the risk genuinely lower?

Proportionality matters. A firm handling financial data, legal documents, HR advice, or commercially sensitive client work sits in a different position from one whose output is largely public-facing or generic. If your firm handles the first type and has no written AI policy, no approved tool list, and no technical controls in place, the gap between your current exposure and your safeguards is worth closing before a client or regulator raises it.

Firms in a lower-risk position are not exempt. Even where no single item of information your team handles would trigger a GDPR notification, client confidentiality still applies. The professional reputation consequences of a data-handling incident can outlast any regulatory response. A sharper question is whether your clients would be comfortable knowing exactly what goes into your AI tools today.

One genuine limit case is a firm that uses a fully managed enterprise AI environment with strict no-training, no-retention, SSO, and data loss prevention controls. That firm is in a materially different position. That configuration is achievable for small firms, but it requires active setup. The public-tier tools many teams start with do not carry those settings by default.

What controls actually work for a firm of your size?

Four controls deliver the greatest protection for a small services firm without requiring a dedicated IT team. Start with a simple data classification rule, making anything client-confidential off limits for public AI tools. Approve specific products rather than AI in general, selecting only those with contractual commitments on retention and training. Give staff a safe alternative. Then back all of it with a clear incident escalation path.

The classification rule itself can be simple, covering three levels: public, internal, and client-confidential. A one-page document stating clearly what falls into each category, written in language your team can actually read, covers the basics. The UK Government AI Playbook recommends classifying data before mapping use cases, and before configuring or approving tools. Getting the sequence right means each step narrows the exposure before the next one opens new ground.

On approving specific tools, note that consumer-tier ChatGPT, business-tier ChatGPT, and ChatGPT Enterprise are different products with materially different terms. The same split applies to Microsoft Copilot and Google’s tools. Approve the version that carries admin controls, defined retention settings, and a training exclusion commitment. OpenAI, Microsoft, Google, and Anthropic all publish enterprise privacy documentation, and reading the relevant pages takes less than an hour.

On giving staff a safe alternative, consider that a team member who needs to summarise a sensitive document will find a way to do it. If the only available route is a public tool, they will use it. An approved enterprise or internal option removes both the risk and the workaround behaviour. UK guidance from both the NCSC and the UK Government AI Playbook leans towards controlled use over blanket prohibition for exactly this reason.

On the escalation path, if someone realises they have pasted client data into an unapproved tool, your firm should already know the first steps to take. The 72-hour UK GDPR breach notification window runs from the point of discovery. Having a clear response prepared is considerably cheaper than working it out in the moment.

None of this requires a large investment. The guidance from the ICO, NCSC, and UK Government is free to read and directly applicable to a firm of any size. The harder part is usually the internal conversation that gets the policy written and the approved tools confirmed. That conversation is worth having before a client asks why their information ended up somewhere unexpected.

If you want to think through what this looks like for your firm specifically, Book a conversation.

Preventing client data from reaching public AI tools

Key takeaways

What actually happens when staff paste client data into a public AI tool?

Why does this matter more for a services firm?

Where will you actually meet this risk in a services firm?

When do you need to act, and when is the risk genuinely lower?

What controls actually work for a firm of your size?

Sources

Frequently asked questions

Can I just ban staff from using AI tools entirely?

What is the difference between a public AI tool and an enterprise one?

Does UK GDPR apply to AI tools even if no personal data is pasted in?

Ready to talk it through?

If any of this sounds familiar, let's talk.

Preventing client data from reaching public AI tools

Key takeaways

What actually happens when staff paste client data into a public AI tool?

Why does this matter more for a services firm?

Where will you actually meet this risk in a services firm?

When do you need to act, and when is the risk genuinely lower?

What controls actually work for a firm of your size?

Sources

Frequently asked questions

Can I just ban staff from using AI tools entirely?

What is the difference between a public AI tool and an enterprise one?

Does UK GDPR apply to AI tools even if no personal data is pasted in?

Ready to talk it through?

Related reading

Write an AI acceptable-use policy your team will actually follow

Who owns the AI in your agency, and what do you tell the client?

What your board actually wants when it asks about AI

If any of this sounds familiar, let's talk.