Big Four AI rollouts, what they tell SMEs

The owner of a thirty-person professional services firm is reading another LinkedIn post about a Big Four AI deployment over her second coffee of the morning. PwC and Anthropic, an expanded partnership, twenty to fifty per cent productivity gains in development work, agentic build across financial services and pharma and healthcare. EY rolling AI capabilities across one hundred and sixty thousand audit engagements. Deloitte’s GenAI audit platform. KPMG’s healthcare generative AI report. The numbers are vast, the language is glowing, and the gap between her resources and theirs is the size of the Atlantic.

She closes the tab. Her frustration is not with the work itself, plenty of it is genuinely impressive, but with the coverage, which seems written for nobody her size. The case studies read like vendor marketing because they partly are vendor marketing. Anthropic benefits from a glowing PwC write-up. PwC benefits from a glowing Anthropic write-up. Both parties have an interest in the figures being at the top of the distribution.

The useful question is not what the Big Four are doing in the abstract, but what they are doing that an owner-managed firm could borrow at a hundredth the scale. Three patterns repeat under all four firms’ public stories and each one is portable.

What are the Big Four actually doing with AI right now?

The four firms are pursuing similar strategies at different speeds. PwC has deepened its alliance with Anthropic into three areas, agentic build using Claude Code to ship production software in weeks, AI-native deal-making that compresses transactions end-to-end, and reinvention of enterprise functions through bespoke internal applications. The firm reports twenty to fifty per cent productivity gains in development work and plans a full AI-driven audit solution in 2026.

EY launched an integrated AI platform in 2023 that spans strategy, transactions, risk, insurance and tax, with subsequent rollouts of AI capabilities supporting one hundred and sixty thousand global audit engagements. Deloitte has built generative AI into its audit-documentation review and publishes its annual State of AI in the Enterprise research as part of its public positioning. KPMG has gone deep on sector-specific applications, including its generative AI in healthcare report and parallel work in financial services and tax.

The coverage of all four follows a recognisable shape. A vendor partnership announcement, a headline productivity figure, a list of functions touched, a forward statement about the next phase. The case studies are useful inputs and marketing assets at the same time. The first move when reading them is to separate the underlying implementation from the press release describing it.

Why does this matter for an owner-managed firm?

It matters because three patterns under the four firms’ stories are genuinely portable and the LinkedIn coverage usually skips them. The patterns sit underneath the budget numbers and the partnership announcements. They show up in every credible Big Four AI rollout, they show up in the MIT NANDA research on why ninety-five per cent of generative AI pilots fail to deliver measurable impact, and they are independent of firm scale.

The first pattern is internal productivity before client-facing AI. PwC’s engineering productivity work and EY’s audit-engagement integration are both internal-first moves, the AI is deployed on the firm’s own back office before it touches a client. The MIT research is clear that this is the higher-ROI pattern and that more than half of generative AI budgets are still pointed at sales and marketing rather than back-office automation. The Big Four are not making that mistake. Many smaller firms are.

The second pattern is bespoke build on top of vendor models. PwC’s in-house applications layered on top of Claude are the cleanest example, but each of the four firms is shaping vendor tools to its specific workflow rather than expecting an off-the-shelf model to learn the firm’s work unaided. The third pattern is long horizons. None of these rollouts are quarterly. PwC’s audit solution is a 2026 milestone. EY’s platform integration is a multi-year programme. The Big Four are operating on a horizon that gives the technology time to compound.

Where will you actually meet these patterns in your own firm?

You meet the internal-productivity-first pattern the moment you ask where AI should go first in your firm. The instinct for many owner-operators is to point AI at the client-facing surface, the website chatbot, the proposal generator, the marketing copy. The MIT NANDA evidence and the Big Four behaviour both say the opposite. The largest measurable ROI sits in the back office, in the boring internal places, not the visible external ones.

You meet the bespoke-on-top-of-vendor pattern as soon as your first general-purpose AI tool stalls in real use. ChatGPT and Claude on their own are powerful for an individual contributor and weak as enterprise infrastructure, because they do not learn from or adapt to your specific workflow. The fix at PwC’s scale is an engineering team building applications on top of the vendor model. The fix at your scale is a thoughtfully written system prompt, a small library of templates codifying how your firm does the work, and a clear human-in-the-loop step. Same principle, different scale.

You meet the long-horizon pattern when you set the success criterion for your first AI rollout. If the criterion is a quarterly productivity win the firm will compress the timeline and the work will fail. If the criterion is a twelve to twenty-four-month compounding capability with checkpoint reviews, the firm gives the technology the time it actually needs. The Big Four are giving themselves years. A smaller firm cannot afford less patience, it can afford less budget.

When should you copy the Big Four and when should you ignore them?

Copy the principles, ignore the budget lines. The internal-productivity-first move transfers cleanly and is the right place to start. Pick the highest-friction back-office task in your firm, the one a senior person grumbles about every week, deploy AI there, measure honestly against a real baseline, and build the next thing on top of what you learned. The Big Four are running this play with three more zeroes of budget. The play is the same.

Ignore the bespoke vendor partnerships. Anthropic does not sign strategic alliances with thirty-person firms and you do not need one. Ignore the dedicated AI engineering teams, the internal AI platforms, the dedicated change-management functions, the multi-year reinvention programmes. These are the artefacts of operating at Big Four scale and they are not the source of the value. The principles are the source of the value. The artefacts are downstream of the principles.

Ignore the published productivity figures as targets. Twenty to fifty per cent development productivity is the top end of a wide distribution, reported by parties with an interest in the figure being high. The British Chambers of Commerce evidence on UK SME AI adoption is more useful as a calibration, fifty-four per cent of UK firms now use AI, ninety-five per cent report no workforce reduction, the actual experience is incremental rather than headline-grabbing. Set your own baseline and measure your own delta against it.

What does this mean for how you read AI case studies generally?

The honest filter for any Big Four AI case study is to ask what the firm actually does differently as a result, rather than what the announcement said it would do. The productivity figures sit at the top of the distribution. The strategic narratives are co-written with the vendor. The internal implementation work is usually real. Reading for the patterns, not the figures, makes the coverage useful at your scale.

Three discipline moves help when reading AI coverage. Discount the headline numbers by at least half until you find a non-vendor source corroborating them. Look for the operational specifics buried near the bottom of the case study, the function rolled out, the team affected, the time horizon, those are the portable details. Cross-reference against the MIT NANDA evidence on where AI investment actually produces measurable ROI, because the vendor coverage and the ROI evidence often point in different directions.

The Big Four are useful precisely because their rollouts are at sufficient scale and duration to test the underlying patterns. The principles survive at your scale. The methods do not. If you want to think through which back-office task in your firm is the right first move and what the twelve-month horizon should look like, book a conversation.

PwC, EY, Deloitte, KPMG, what their AI rollouts tell SMEs

Key takeaways

What are the Big Four actually doing with AI right now?

Why does this matter for an owner-managed firm?

Where will you actually meet these patterns in your own firm?

When should you copy the Big Four and when should you ignore them?

What does this mean for how you read AI case studies generally?

Sources

Frequently asked questions

Are the Big Four AI rollouts genuinely useful examples for a thirty-person firm or are they mostly marketing?

Which Big Four AI move should an owner-managed firm copy first?

Should I be building bespoke AI on top of vendor models like PwC does with Anthropic?

Ready to talk it through?

If any of this sounds familiar, let's talk.

PwC, EY, Deloitte, KPMG, what their AI rollouts tell SMEs

Key takeaways

What are the Big Four actually doing with AI right now?

Why does this matter for an owner-managed firm?

Where will you actually meet these patterns in your own firm?

When should you copy the Big Four and when should you ignore them?

What does this mean for how you read AI case studies generally?

Sources

Frequently asked questions

Are the Big Four AI rollouts genuinely useful examples for a thirty-person firm or are they mostly marketing?

Which Big Four AI move should an owner-managed firm copy first?

Should I be building bespoke AI on top of vendor models like PwC does with Anthropic?

Ready to talk it through?

Related reading

Practical AI ideas for small business operations

Healthcare AI use cases that reduce admin and improve flow

What digital marketing teams are actually doing with AI

If any of this sounds familiar, let's talk.