Using AI without letting quality slip in client work

TL;DR

Using AI on client work without quality slipping comes down to three things: deciding which outputs require human review, having a short written policy, and training staff on what can go wrong. The BCG study found AI improved work quality by 40% when used correctly and degraded it when people skipped their checks. Policy and process make more difference than any tool choice.

Key takeaways

- AI improves quality when humans stay in the loop: the BCG study with 758 consultants found a 40% quality improvement when AI was used correctly, and performance fell when people skipped their own checks. - Quality risk clusters in four areas: legal and contractual positions, financial analysis, advice carrying professional liability, and client communications drafted without checking the client's actual context. - A one-page AI policy covering approved tools, banned uses, data rules, and review requirements is the fastest way to remove uncontrolled quality risk from your workflow. - Use tools with clear data residency policies for any work involving client information, since prompts in public generative AI tools may be retained for model training. - Training matters more than the tools: users who receive guidance on AI limitations report 11 percentage points higher productivity gains than those who receive none, and they make fewer errors.

A consultant at a 15-person professional services firm sent an AI-drafted proposal to a client last autumn. The draft was fast, well-structured, and almost entirely accurate. It also contained a reference to a competitor’s project, absorbed from an old document the tool had processed during setup. Nobody had read it before it went. The client called within the hour.

That outcome is predictable when AI productivity gains outpace the review habits built for slower processes. Getting the balance right is straightforward in principle, but it requires a few deliberate decisions that many owner-managed services firms have not yet made explicitly.

What does quality control mean when AI is drafting your client work?

Quality control in AI-assisted client work means deciding which outputs get reviewed before they reach a client, and who reviews them. The BCG study with 758 consultants found AI improved task quality by 40% on average, while performance dropped on tasks where consultants skipped their own checks. Speed creates output. Your review habits determine what that output costs you.

For any AI tool in your workflow, ask yourself: what would I need to check to be confident this output is right? That is a harder question than a skim read answers, and it is the one that keeps errors from reaching clients.

The UK Government AI Playbook makes the same point in a regulatory register: AI systems are probabilistic rather than deterministic. Treat them as assistants with a strong track record on certain tasks and a recognised blind spot on others, and build your review habits around the blind spots.

Why does this matter more for owner-managed businesses than large companies?

Larger firms have compliance teams and review layers that survive when staff adopt new tools. Owner-managed services firms typically concentrate review responsibility in a small number of senior people who are already stretched. When AI accelerates the volume of output, that person is handed more to check, faster. Without a deliberate decision about what stays reviewed and what does not, quality controls erode quietly.

Cisco’s 2023 Data Privacy Benchmark study found 61% of employees using generative AI at work, with only 27% reporting a clear employer policy. That pattern cuts across firm size. In an owner-managed services firm, a single client complaint from an AI error lands directly on the person who signed the contracts and guarantees the relationship.

Clients are beginning to ask directly, too. KPMG’s 2023 CEO Outlook found 81% of UK CEOs would only adopt generative AI if they could trust it. Procurement teams at larger organisations already include AI governance questions in supplier reviews. Being able to answer clearly about how you use AI on their work, and what checks you run, is becoming a commercial expectation rather than a differentiator.

Where in a services business does quality actually fail?

AI performs well on bounded tasks: meeting notes, internal summaries, first drafts of documents where format matters more than judgement. Quality failures cluster at the boundary where AI handles client-facing work that requires context or professional accountability it cannot carry. The four areas where errors reach clients are legal and contractual positions, financial analysis, advice carrying professional liability, and communications drafted without checking the client’s actual context.

The Air Canada ruling from early 2024 is the example risk professionals reach for most readily. The British Columbia Civil Resolution Tribunal found the company liable after its customer service chatbot gave misleading information about bereavement fares. The liability traced back to a process that allowed an AI system to communicate policy positions to customers without human oversight.

For a professional services firm, the parallel is direct. If an AI-drafted proposal carries a fee schedule, a regulatory reference, or a scope limitation that is wrong, the firm owns that. The NCSC guidance is explicit: staff should not input sensitive information into public generative AI tools, and outputs carrying legal, financial, or contractual weight require human review before reaching clients.

A practical starting point is a short list of which categories of client output require a named person’s approval before they leave the building. That list is your first quality control.

When does AI output need a human check, and when can it run without one?

The UK Government AI Playbook says AI systems are probabilistic rather than deterministic, and should be treated as assistants rather than oracles. For a services firm, the working rule is that human review is mandatory for anything carrying professional liability, anything referencing client-specific data, anything with legal or contractual effect, and anything going to a client under your name. Internal work where an error means a correction can run lighter.

The ICO’s guidance on AI and data protection is directly relevant here. Feeding personal data into external AI services counts as data processing under UK GDPR, which requires a lawful basis, purpose limitation, and security controls. For client work involving personal data, tools with clear data residency policies, such as Microsoft Copilot or Google Workspace AI, carry lower risk than public generative AI services because prompts are not typically retained for model training.

The EU AI Act, which applies to providers and users serving the EU market, requires effective human oversight for high-risk applications. Even if your firm is not directly subject to it today, the direction of regulatory travel is towards more accountability. Planning for it while your AI use is still small and auditable is considerably easier than retrofitting controls after the processes have scaled.

What makes more difference than the tool you choose?

Microsoft’s Copilot research found that users who received training on AI limitations reported 11 percentage points higher productivity gains than those who did not. The BCG consultant study found that participants guided on when not to use AI made fewer errors than those who went unguided. In both cases the gain came from training and shared norms, not from a different platform. A 90-minute internal session will do more for quality than switching tools.

Alongside training, a one-page AI policy is the second practical control. The Scottish AI Playbook and the UK Government’s Central Digital and Data Office guidance both recommend a simple policy covering approved tools, banned uses, data handling rules, and review requirements. Modelled on an existing data protection or social media policy, it takes an afternoon to write and makes review expectations explicit rather than leaving them to individual judgement each time.

The third lever is piloting before scaling. The UK Government AI Playbook recommends starting with low-risk use cases, setting quality metrics such as error rate, rework time, and client complaints, then expanding only what passes your own thresholds. For a 10-to-50-person services firm, that means starting with internal drafts and meeting notes, measuring outcomes for 60 days, and then moving to client-facing work once review habits are established.

None of this requires a technology programme. A decision about which outputs need review, written down and followed consistently, is where quality control in AI-assisted client work actually begins.

Sources

- Boston Consulting Group (2023). How generative AI improves (and complicates) the work of professionals. Study of 758 consultants showing a 40% quality improvement when AI was used correctly, and increased errors when consultants skipped their own checks. https://www.bcg.com/publications/2023/how-generative-ai-improves-and-complicates-consulting-work - McKinsey & Company (2023). The economic potential of generative AI: the next productivity frontier. Estimates 60-70% of employee work time is potentially automatable, with quality outcomes dependent on human-in-the-loop design. https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier - UK Government (2024). Artificial Intelligence Playbook for the UK Government. Advises that AI systems are probabilistic rather than deterministic and require clear human oversight protocols before outputs reach clients or affect decisions. https://assets.publishing.service.gov.uk/media/67aca2f7e400ae62338324bd/AI_Playbook_for_the_UK_Government__12_02_.pdf - Information Commissioner's Office (2024). Guidance on AI and data protection. Covers obligations on organisations feeding personal data into AI systems, including lawful basis and data residency requirements under UK GDPR. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/ - National Cyber Security Centre (2023). Guidelines for secure use of generative AI. Advises UK organisations not to input sensitive information into public generative AI tools, and sets expectations for evaluating AI system security. https://www.ncsc.gov.uk/guidance/guidelines-for-secure-ai-system-development - UK Central Digital and Data Office (2023). Guidance on the use of generative AI in government. Recommends setting organisational policy on acceptable use, risk thresholds, and human oversight requirements for generative AI tools. https://www.gov.uk/government/publications/generative-ai-in-government-guidance - European Parliament (2024). EU Artificial Intelligence Act. Requires effective human oversight for high-risk AI applications; applies to providers and users serving the EU market. https://www.europarl.europa.eu/legislative-train/theme-a-europe-fit-for-the-digital-age/file-artificial-intelligence-act - Cisco (2023). Data Privacy Benchmark Study. Found 61% of employees using generative AI at work, with only 27% reporting a clear employer policy, indicating widespread unmanaged quality and compliance risk. https://www.cisco.com/c/en/us/products/security/security-reports/data-privacy.html - KPMG (2023). 2023 CEO Outlook: UK findings. Found 81% of UK CEOs would only adopt generative AI if they could trust it, signalling growing supplier expectations around AI governance and quality controls. https://kpmg.com/uk/en/home/insights/2023/10/ceo-outlook-2023.html - Microsoft (2023). Work Trend Index: Copilot early use research. Found that users who received training on AI limitations reported 11 percentage points higher productivity benefits than those who received no training. https://www.microsoft.com/en-us/worklab/work-trend-index/copilot-early-use-research

Frequently asked questions

What types of AI output should always be reviewed by a human before going to a client?

Anything carrying professional liability, referencing client-specific data, with legal or contractual effect, or going to a client under your firm's name requires a human review step. The categories that produce the most errors are proposals with financial or scope detail, regulatory references, and advice in areas where your firm carries professional accountability.

Does using AI on client work breach UK GDPR?

It depends on the tool and what data you input. Feeding client personal data into a public AI service counts as data processing under UK GDPR and requires a lawful basis, purpose limitation, and security controls. Tools with clear data residency policies, such as Microsoft Copilot or Google Workspace AI, are lower risk because prompts are not typically retained for model training. Check your tools' data processing terms before using them on client work.

Do we need a formal AI policy in a firm of under 20 people?

Written expectations matter more than a formal document. A one-page policy covering approved tools, banned uses, what data cannot go into AI tools, and who reviews output before it reaches clients removes the ambiguity that produces quality errors. The Scottish AI Playbook and UK Government guidance both recommend this for organisations of any size. It takes an afternoon to write and prevents the "I assumed it was fine" problem.

Written by Dr Dave Heath, AI consultant and business strategist.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Using AI without letting quality slip in client work

Key takeaways

What does quality control mean when AI is drafting your client work?

Why does this matter more for owner-managed businesses than large companies?

Where in a services business does quality actually fail?

When does AI output need a human check, and when can it run without one?

What makes more difference than the tool you choose?

Sources

Frequently asked questions

What types of AI output should always be reviewed by a human before going to a client?

Does using AI on client work breach UK GDPR?

Do we need a formal AI policy in a firm of under 20 people?

Ready to talk it through?

If any of this sounds familiar, let's talk.

Using AI without letting quality slip in client work

Key takeaways

What does quality control mean when AI is drafting your client work?

Why does this matter more for owner-managed businesses than large companies?

Where in a services business does quality actually fail?

When does AI output need a human check, and when can it run without one?

What makes more difference than the tool you choose?

Sources

Frequently asked questions

What types of AI output should always be reviewed by a human before going to a client?

Does using AI on client work breach UK GDPR?

Do we need a formal AI policy in a firm of under 20 people?

Ready to talk it through?

Related reading

AI theatre or real progress: how a founder tells the difference

How safe is AI for business use, and where do the risks sit?

How accurate is AI translation for business documents?

If any of this sounds familiar, let's talk.