When confidential information should never go into AI

A business owner pausing at a laptop in a quiet office, hands resting on the keyboard
TL;DR

Public AI tools like ChatGPT offer no contractual protection for data you input. Under UK GDPR, organisations remain fully responsible for what enters any AI system, and the ICO explicitly warns against putting sensitive information into public tools. Whether controlled AI use is lawful depends on having a written data-processing agreement, confirmed data residency, access logging, and an internal policy before confidential data goes near any AI model.

Key takeaways

- Public AI tools may log your inputs and use them for model training; under UK GDPR you remain fully responsible for whatever personal data you submit. - The ICO and NCSC have both issued explicit guidance warning against putting sensitive information into public generative AI services. - Four categories should never enter a public AI tool: personal data identifying a living person, UK GDPR special category data, trade secrets, and anything your client contracts restrict from third-party processing. - Enterprise AI deployments with written data-processing agreements, UK or EU data residency, and access logging can make controlled use of certain confidential data lawful. - A written internal AI policy specifying approved tools and permitted data categories is the lowest-cost safeguard a small firm can put in place today.

An operations director at a 15-person consulting firm told me recently that several of her team were pasting client board papers into ChatGPT to generate meeting summaries. She wanted to know whether that was a problem. The honest answer: it depends on which tool, which data, and what contractual protections are in place, and the gap between “fine” and “a UK GDPR breach” is narrower than many founders expect.

What choice are you actually facing?

Many owner-managed firms right now face two distinct options: public AI tools available to anyone online, where inputs may be logged and used to improve the model, or enterprise-grade deployments backed by written contracts, confirmed data residency, and explicit commitments about what happens to content after it is submitted. Understanding where your data lands in each scenario is the foundation of a workable AI policy.

A survey by Technology Reseller UK found that 67% of UK organisations cannot tell whether staff are sharing information via secure AI platforms, and 35% openly admit employees are using external tools without proper controls or visibility. For a professional services firm, that blind spot matters. Pasting a client’s full case notes into a public chat interface and routing anonymised summaries through a contract-backed enterprise deployment are not the same act, legally or in practice.

The data categories in the frame span several types: anything that can identify a living person under UK GDPR, special category data such as health records, financial histories, or religious beliefs, trade secrets and unpublished commercial strategies, and anything your client contracts say must not be processed by third parties without prior written consent. The right governance step varies by category, which is why this question deserves a considered answer rather than a blanket rule.

When should confidential data stay out of AI tools entirely?

For public AI tools, the rule is clear: keep out any information that identifies a living person, falls under special category data under UK GDPR, constitutes a trade secret, or is covered by professional confidentiality obligations. The NCSC’s guidance states that organisations should avoid entering sensitive information into public generative AI services because prompts may be stored, used for training, or accessed by the provider or others.

The ICO confirms that organisations remain fully responsible for UK GDPR compliance when using generative AI, including the requirements for a lawful basis, data minimisation, and appropriate data-processing agreements with any AI provider. Where those agreements do not exist, personal data should not go in. The same applies to professionally regulated firms: the SRA Code of Conduct requires solicitors to keep client affairs confidential, and FCA rules treat client data protection as a systems-and-controls obligation.

The Samsung incident in 2023 made this concrete. Staff pasted proprietary source code and internal meeting notes into ChatGPT; that material may have been stored and used in training before the company restricted all use of external AI tools. Trade secrets law protects information that has commercial value because it is secret and is subject to reasonable steps to keep it that way. Submitting that material to a public AI tool can undermine those steps in ways that are difficult to reverse.

When can AI tools handle confidential data safely?

Enterprise AI deployments can handle certain categories of confidential data lawfully, provided the right contracts and controls are in place. Microsoft Azure OpenAI, Google Vertex AI, and Anthropic’s Claude via Amazon Web Services each offer written data-processing agreements, commitments against using customer data to train foundation models, and data residency options that can keep processing within the UK or EU.

The ICO’s guidance on generative AI allows AI use where organisations can demonstrate a lawful basis, necessity, and data minimisation. Where a firm has a signed data-processing agreement with the AI provider, access controls and usage logging, staff training on which categories may be processed, and a documented risk assessment, the position differs materially from using a public tool. Routine back-office work, such as processing anonymised invoice data or running internal document search over files that users can already access through approved systems, often meets this bar.

Two sectors carry additional obligations. NHS guidance states that identifiable patient data must not enter public AI tools, and any clinical decision support must satisfy medical device and data protection requirements. The FCA and Bank of England’s joint discussion paper on AI in financial services confirms that firms remain responsible for model risk management and fair customer treatment even when using third-party AI. If you serve EU clients, the EU AI Act adds transparency and risk-management duties for providers and deployers from 2026 onwards.

What does it actually cost to get this wrong?

Under UK GDPR, the ICO can fine organisations up to £17.5 million or 4% of worldwide annual turnover, whichever is higher. British Airways received a £20 million fine after security failures exposed the personal data of around 400,000 customers. IBM’s annual Cost of a Data Breach report put the average UK breach cost at approximately £3.4 million, covering detection, response, and lost business.

Beyond the headline figures, costs spread in several directions. Client contracts increasingly carry data-processing clauses with indemnity provisions and termination rights; a breach of those clauses can end a client relationship before any regulatory process has even started. Individuals can bring claims for distress under UK data protection law, and case law has recognised compensation for privacy violations even without direct financial harm. Cyber insurers are asking for evidence of AI-related controls, and firms without a documented policy may find cover restricted or premiums elevated.

The UK government’s open letter to business leaders noted that AI is reducing the cost and skill barriers for cyber attackers, making leaked data easier to exploit at scale. Operational disruption from a breach, covering incident response, forensic investigation, and possible system changes, diverts leadership attention at the worst possible time. For a small services firm, the combined exposure from regulatory, contractual, and reputational consequences can quickly exceed the value of the productivity gains AI was meant to deliver.

What should you ask before confidential data goes near any AI tool?

Before any confidential data enters an AI system, three questions cut through the noise: Is this a consumer tool or an enterprise deployment with a written data-processing agreement? Would exposure of this data require notification to the ICO within 72 hours? And does the relevant client contract restrict third-party processing without prior written consent?

If any answer is no, the default is to keep the data out until the governance is in place. A written internal AI policy that specifies which data categories and tools are approved is the lowest-cost control available to a small firm. It costs little to write, gives staff a clear line to work from, and provides the documented position you would need if the ICO or a client challenged your approach.

Two further questions are worth adding for firms planning to scale AI use. Has the ICO-recommended Data Protection Impact Assessment been completed for any high-risk processing? And will this use hold up in three years when EU AI Act obligations apply to firms serving EU clients? If you cannot answer these positively, restrict the use until you can. The practical move this week is to audit which AI tools your team is already using and confirm whether any of them have a signed data-processing agreement attached. If you’d like help thinking through what responsible AI use looks like for your firm, Book a conversation.

Sources

- ICO (2023). Guidance on generative AI. ICO's official guidance for organisations on UK GDPR compliance when using generative AI, covering lawful basis, data minimisation, and data-processing agreements. https://ico.org.uk/for-organisations/innovative-technologies/generative-ai/ - ICO (2023). ICO statement on generative AI. Statement confirming that organisations remain fully responsible for UK GDPR compliance when using generative AI tools, regardless of the provider used. https://ico.org.uk/about-the-ico/media-centre/news-and-blogs/2023/03/ico-statement-on-generative-ai/ - NCSC (2023). Secure use of generative AI. NCSC guidance advising organisations to avoid entering sensitive information into public generative AI services, citing risks of prompt storage and provider access. https://www.ncsc.gov.uk/guidance/secure-use-of-generative-ai - ICO (2023). Personal data definitions under UK GDPR. ICO definition of personal data, covering identifiable individuals and the additional requirements for special category data. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/guide-to-data-protection/key-definitions/personal-data/ - ICO (2020). ICO fines British Airways £20 million for data breach. Enforcement notice establishing the maximum fine precedent and the link between security failures and personal data exposure. https://ico.org.uk/about-the-ico/media-centre/news-and-blogs/2020/10/ico-fines-british-airways-20m-for-data-breach/ - FCA and Bank of England (2022). Discussion paper DP5/22 on AI and machine learning in financial services. Confirms firms remain responsible for model risk management, data quality, and fair customer treatment even when using third-party AI. https://www.fca.org.uk/publication/discussion/dp5-22.pdf - UK Government (2024). AI cyber threats: open letter to business leaders. Government warning that AI is reducing cost and skill barriers for cyber attackers, increasing the consequences of leaked confidential data. https://www.gov.uk/government/publications/ai-cyber-threats-open-letter-to-business-leaders/ai-cyber-threats-open-letter-to-business-leaders-html - SRA (2019). Code of Conduct for Solicitors. SRA confidentiality requirement applicable when solicitors consider using AI tools for client matter work. https://www.sra.org.uk/solicitors/standards-regulations/code-conduct-solicitors/ - IBM (2023). Cost of a Data Breach Report 2023. Annual breach cost research placing the average UK data breach cost at approximately £3.4 million, covering detection, response, and lost business. https://www.ibm.com/reports/data-breach - Technology Reseller UK (2024). Critical blind spots for UK organisations. Survey finding 67% of UK organisations cannot determine whether staff are sharing information via secure AI platforms, and 35% openly admit staff are using external tools without controls. https://technologyreseller.uk/critical-blind-spots-for-uk-organisations-as-two-thirds-dont-know-whats-being-shared-with-ai/

Frequently asked questions

Can I use ChatGPT to summarise client documents?

For public-tier ChatGPT, the answer is no where documents contain personal data, special category data, or trade secrets. Inputs may be stored and used for model training. If you need AI for client documents, you require an enterprise deployment with a written data-processing agreement, confirmed data residency within the UK or EU, and a documented lawful basis under UK GDPR.

What counts as confidential information for UK GDPR purposes?

UK GDPR covers any information relating to an identified or identifiable living person, including names, contact details, client IDs, IP addresses, and combinations of data that together identify someone. Special category data, such as health information, ethnicity, and biometrics, carries stricter rules. Beyond personal data, trade secrets, professionally confidential material, and information subject to contractual restrictions all warrant the same caution with AI tools.

What do I need in place before using AI with confidential client data?

At minimum: a written data-processing agreement with the AI provider covering UK GDPR roles, a confirmed lawful basis for the data category involved, residency within an approved region, access controls and usage logging, and an internal policy specifying which tools staff may use and for which data categories. If any of these are absent, keep the data out until they are in place.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation