How to scope an agentic AI proof of concept safely

Two colleagues reviewing a printed document together at a desk in a small office
TL;DR

An agentic AI proof of concept should cover one narrow workflow, use synthetic or minimised data, and keep humans reviewing every action before it affects a client. The ICO expects a Data Protection Impact Assessment before you process personal data through an AI agent. Set your success metrics, isolate the test environment, and build a kill switch before you start.

Key takeaways

- An agentic AI system takes actions autonomously across multiple steps, which makes scoping more consequential than a standard chatbot pilot - The ICO requires a Data Protection Impact Assessment before an AI agent processes personal data in ways likely to result in high risk to individuals, which means before the PoC starts, not after - Start with one narrow workflow on synthetic or minimised data, with a named person reviewing every output before it reaches a client or leaves the business - The FCA has confirmed that regulated firms cannot outsource compliance responsibility to AI vendors, so human oversight of any agent output is mandatory in financial services contexts - Before beginning, have a written scope, vendor data processing agreements, an isolated test environment, and a kill switch in place

One conversation that comes up often with owners of professional services businesses goes roughly like this. A vendor demonstrates an agentic AI tool that will handle email triage, draft proposals, and follow up with prospects without anyone needing to intervene at each stage. The demo is polished. The pricing is more accessible than expected. But as the meeting ends, the owner has not yet asked who checks the outputs before they reach a client, what happens to client data during the test, or what the plan is if the agent does something that cannot easily be undone.

That gap between the demo and the operational reality is what scoping a proof of concept is designed to close.

What is an agentic AI proof of concept?

An agentic AI system can read a message, decide what action to take next, call an external tool, and act on the result, all without a human stepping in at each stage. A proof of concept is a contained, time-limited test of that capability on one specific workflow, designed to find out whether the approach works reliably in your operation and what it actually takes to run it safely.

What sets an agentic system apart from a standard chatbot is its ability to chain multiple steps together and act on your behalf inside existing tools and platforms. Cisco describes these systems as ones that reason step by step, sequence tasks, access tools and data, and coordinate with other agents or humans to achieve a goal. For an owner-managed services business, that might mean an agent that classifies an incoming support ticket, drafts a suggested response, logs the interaction in the CRM, and escalates anything it cannot handle. A 14-day PoC, as vendors like DBB Software advertise, tests whether that chain holds under real conditions. The PoC is meant to answer three questions: does it work reliably, can we govern it safely, and what does running it actually cost in day-to-day oversight and attention.

Why does getting the scope right matter for your business?

A poorly scoped agentic PoC creates problems that outlast the test itself. If the agent processes real client data before you have completed a Data Protection Impact Assessment, you may be in breach of UK GDPR before you have confirmed whether the tool is even worth running. Errors from an agent acting without human sign-off compound before anyone catches them, and the cost is reputational and regulatory as well as operational.

The ICO requires a DPIA before you deploy AI systems in ways likely to result in high risk to individuals. Processing client emails, support tickets, or any identifiable records through an agentic agent almost always meets that threshold. The 2023 Samsung incident is instructive: engineers pasted confidential source code and internal meeting notes into ChatGPT, and the data entered the model’s training corpus before anyone realised what had happened. Samsung subsequently banned the tool internally. The ICO has shown it will act on data protection failures involving AI: it fined Clearview AI £7.5m in 2022 for unlawfully scraping and processing UK residents’ data without a lawful basis or adequate transparency. For owner-managed businesses, the regulatory exposure from a badly scoped PoC is concrete, not theoretical.

Where does agentic AI actually show up in a services business?

Agentic AI in owner-managed businesses tends to cluster around work that is high-volume, rules-based, and time-consuming enough that someone on the team is already wishing it could be automated. Common starting points include email triage, first-response drafting, support ticket classification, and initial proposal generation from a structured brief. Each of these has clear inputs, clear outputs, and a natural review point before anything reaches a client or leaves the business.

Cisco and BCG both frame early agentic deployments as copilot tools embedded in existing platforms, rather than autonomous systems running without supervision. BCG’s analysis of enterprise agentic AI deployments points to 20 to 30 per cent productivity gains in administrative workflows where the agent is well-scoped and the human review step is designed in from the start. McKinsey’s 2023 State of AI survey found that 55 per cent of organisations had adopted AI in at least one business function, with agentic approaches now representing the next layer of that adoption curve. For a professional services firm, a realistic first PoC covers one workflow, one platform, and a defined window of around two weeks, with clear success metrics set in advance and a named person responsible for reviewing every output. The agent prepares the work; a human approves and acts on it.

When is the right time to run a PoC, and when should you wait?

The right time is when you have one clearly bounded workflow, a named person reviewing every agent output before it affects anyone outside your team, and a written plan for what happens if the test goes wrong. Wait if the agent needs real client data before you have completed the compliance paperwork, or if your vendor cannot confirm where your data is stored and how long it is retained.

The FCA has been explicit that regulated firms cannot outsource their compliance responsibilities to AI vendors, and that human oversight of AI-driven decisions is expected, not a courtesy. If your business gives regulated advice, an agent that drafts or sends client-facing communications without review is unlikely to meet Consumer Duty expectations under current FCA guidance. For the NCSC, the baseline for a safe PoC includes using synthetic or anonymised test data rather than live client records wherever possible, restricting the agent’s access to only what it genuinely needs, and logging every action for later review. ProductCrafters’ guidance on agentic PoC design recommends defining one measurable success metric before you start, something like average handling time per enquiry or the percentage of agent outputs that need major correction. Tracking that through the PoC gives you a concrete basis for deciding whether to scale, adjust, or stop at the end of the window.

What to have in place before the first test begins

Before your first agentic PoC starts, five things should be in place: a written scope with one measurable success metric, a DPIA if the agent will process personal data, vendor contracts confirming how your data is stored and used, a test environment isolated from your live systems, and a kill switch that allows you to stop the agent immediately. If any of these is absent, moving the start date is the sensible call.

The ICO guidance on AI and data protection makes clear that data processing agreements with AI vendors must specify how your data is handled, who can access it, and what happens to it after the PoC ends. Many enterprise AI platforms, including OpenAI’s enterprise offering and Microsoft’s Azure OpenAI Service, specify that customer data is not used to train foundation models by default, but this should be verified in writing rather than assumed. The CMA’s 2023 review of AI foundation models also flagged the risk of vendor lock-in in AI infrastructure. Where possible, designing the PoC around a platform that could be replaced later is preferable to one that hard-bakes a proprietary format into your workflow from day one. UK cyber insurers are beginning to include AI governance questions in proposal forms, so documenting the PoC scope, the DPIA outcome, and the human review process now makes later conversations with your insurer considerably easier. A well-documented PoC is also the evidence base for deciding whether to scale, and what conditions that scale-up would require.

Agentic AI is worth testing. The efficiency gains from a well-run PoC can be real, and the workflow knowledge you build during a contained test is valuable regardless of the outcome. Scoping it safely protects that investment. When the two weeks are up, you want a clear decision on whether to proceed, not a compliance query from a client you forgot to inform about the test.

If you want a second pair of eyes on your PoC scope before you begin, Book a conversation.

Sources

- ICO (2023). Guidance on AI and Data Protection. Covers lawful basis, data minimisation, and transparency requirements for organisations using AI systems. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/guidance-on-ai-and-data-protection/ - ICO (2023). Data Protection Impact Assessments. Explains when a DPIA is required and how to complete one, including for high-risk AI processing scenarios. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/accountability-and-governance/data-protection-impact-assessments/ - NCSC (2023). Guidelines for Secure AI System Development. Covers secure-by-design principles, least-privilege access, environment isolation, and monitoring practices for AI systems. https://www.ncsc.gov.uk/collection/guidelines-secure-ai-system-development - FCA (2024). AI Regulation in Financial Services. Sets out the FCA's expectation that firms remain responsible for compliance and cannot delegate accountability to AI vendors. https://www.fca.org.uk/news/speeches/ai-regulation-financial-services - CMA (2023). AI Foundation Models: Initial Report. Warns of vendor lock-in and market concentration risks from a small number of large AI infrastructure providers. https://www.gov.uk/government/publications/ai-foundation-models-initial-report - EUR-Lex (2024). Regulation (EU) 2024/1689: EU Artificial Intelligence Act. Sets transparency and high-risk use obligations relevant to UK firms serving EU clients. https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32024R1689 - BCG (2025). How Agentic AI is Reshaping Enterprise Platforms. Reports 20 to 30 per cent productivity gains in administrative workflows from well-scoped agentic deployments. https://www.bcg.com/publications/2025/how-agentic-ai-is-transforming-enterprise-platforms - McKinsey (2023). The State of AI in 2023: Generative AI's Breakout Year. Finds 55 per cent of organisations adopted AI in at least one business function by 2023, with agentic approaches now representing the next layer. https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai-in-2023-generative-ais-breakout-year - ProductCrafters (2025). AI Proof of Concept: Build and Validate Your AI POC. Recommends one measurable outcome, a clear success metric, and a tight timeframe for agentic PoC design. https://productcrafters.io/blog/agentic-ai-poc/ - ICO (2022). ICO fines Clearview AI Inc £7.5m. Demonstrates the ICO's willingness to take enforcement action against AI-driven data processing that lacks lawful basis and transparency. https://ico.org.uk/about-the-ico/media-centre/news-and-blogs/2022/05/ico-fines-clearview-ai-inc-7-5m/

Frequently asked questions

Do I need a Data Protection Impact Assessment before running an agentic AI PoC?

If the agent will process personal data in a way likely to result in high risk to individuals, the ICO requires a DPIA to be completed before you begin, not after. Processing client emails, support tickets, or any identifiable records almost always meets that threshold. The DPIA does not have to be lengthy, but it must document the purpose, the data types involved, the risks identified, and the mitigations you have put in place.

What is the safest first workflow to test with an agentic AI agent?

Choose a workflow with clear inputs and outputs, a natural human review point before anything reaches anyone outside your team, and no payment or legal commitments involved. Email triage, first-response drafting, and support ticket classification are common starting points. Run the PoC on synthetic or anonymised data rather than live client records until the compliance groundwork is complete.

Can we use a consumer AI tool like ChatGPT to run an agentic PoC?

Consumer-tier AI tools are generally not appropriate for processing client data in a business context. The 2023 Samsung incident, where engineers' confidential data entered a public model's training corpus within minutes, illustrates the risk clearly. Enterprise offerings from providers such as OpenAI and Microsoft specify that customer data is not used for model training by default, but you still need to verify this in the contract and have a data processing agreement in place before any client data is involved.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation