The pilot-to-scale valley of death, and how to cross it

A professional reviewing documents at a conference table in an office
TL;DR

The pilot-to-scale valley of death is the gap between a proof of concept that works in a controlled setting and a capability that runs across the whole business. BCG finds roughly half of companies stuck at this stage. Crossing the valley is an integration, ownership, and change management problem, and the work it requires is more substantial than many rollout plans account for.

Key takeaways

- The pilot-to-scale valley of death is the gap between a proof of concept that works and a capability that is embedded across the business. - BCG finds roughly half of companies stuck in stagnating or emerging AI stages, unable to scale past proof of concept. - Scaling fails on integration, ownership, and change management, not on the technology itself. - A named owner for the post-pilot workflow, established before the handoff, is among the most preventable single points of failure in an AI programme. - Workflow readiness is as important as tool choice; AI placed inside an inconsistent or undocumented process produces inconsistent output regardless of the tool.

The pilot worked. Everyone who saw it said so. The output was cleaner, the turnaround faster, and the team running it was visibly proud of what they’d built. Then someone asked the obvious question: when does the rest of the business get this?

Six months later, you’re still answering it. The pilot sits exactly where it landed, a single pocket of capability that hasn’t moved, despite clear proof that the approach works. The founder is asking for an update. The board wants to see traction. You’re holding the space between “we proved it works” and “we actually did it.”

That space has a name.

What is the pilot-to-scale valley of death?

The pilot-to-scale valley of death is the gap between a proof of concept that works in a controlled setting and a capability that is embedded across the business. BCG’s research on AI adoption finds roughly half of companies are stuck in this position, able to demonstrate value in isolated tests but unable to move it into standard practice. The valley is a recognised stage, not an anomaly.

The term comes originally from product development, where it describes the gap between early-stage funding and commercial viability, where promising ideas exhaust their momentum before becoming products. Applied to AI rollouts, the equivalent is operational. The pilot team has built something that works for them, integrating it into their habits and refining the output over weeks of real use. Everyone else is still working the way they always have.

The integration work the pilot deliberately avoided is what scaling requires. That means a documented process, a named owner, and people who weren’t involved in the original test brought in and given a genuine stake in what happens next. In many pilot plans, none of that is budgeted for.

Why does this matter when you’re holding an AI mandate?

Your mandate was to get AI working across the business. A pilot that stays a pilot does not fulfil that mandate. MIT’s GenAI research finds roughly 95% of generative AI pilots show no measurable revenue impact; workflow integration, not model quality, is consistently identified as the primary bottleneck. The valley is where AI mandates go to stall, and where the delegate absorbs the blame.

The pressure builds quickly. Sponsors who backed the initiative see a demonstration, see it work, then watch months pass without visible change in how the business actually operates. The founder or board starts asking when the rest of the organisation gets the same capability. Your window to maintain momentum is narrower than it looks.

There is also a confidence effect inside the team. People who were not part of the pilot are watching to see whether this is real change or another initiative that will fade. A slow transition to scale confirms their scepticism before you’ve had a chance to address it, and that is harder to reverse than a technical problem.

Turning a proof of concept into a repeatable operation is what you were hired to do. The pilot proved it was possible. Scaling has to prove it was worth it.

Where does the wall actually appear?

The wall appears at the handoff. When the pilot team completes its work and responsibility moves to a broader group, three things typically break down. The incoming team was not part of building the workflow. No one owns the ongoing quality check. And the new users have no obvious reason to change habits that were already working for them.

Behind this is an alignment gap. Kyndryl’s 2024 research finds around 70% of leaders say their workforce is not ready for AI change. The gap is rarely about technical skill; the people doing the work have often not been helped to understand why the change is happening, or to feel any ownership over it.

A pilot sidesteps this by design. You recruit willing participants, work in a contained space, and get clean results precisely because the conditions are controlled. Moving that to a wider team with different pressures and no stake in what was built strips those conditions away.

Ownership is the crux. Someone in the broader rollout needs to care enough about the workflow to maintain it, adapt it when the output drifts, and advocate for it when something goes wrong. Pilot teams generate that ownership naturally because they built the thing. The rest of the business needs it designed in before the handoff, not rebuilt after the stall has already started.

When is a stall a signal to slow down, and when to push through?

A stall means one of two things. Either the rollout is missing the integration and ownership work that scaling requires, a solvable problem if you address it directly. Or the pilot solved a problem that matters to a small team but carries no weight for the broader business, in which case the right answer is to scope more tightly rather than scale more widely.

The diagnostic is whether the scaling work was done or skipped. Have the people who will use the new workflow been involved in adapting it to their reality? Is there a named owner who will keep it running after the transition? Has the process been documented well enough that someone outside the pilot team can follow it reliably?

Clear yes answers across those questions, combined with a rollout that is still stalling, suggest the problem itself was the issue. Not every successful pilot deserves to become standard practice. Some work beautifully in a small test precisely because the test was controlled and the participants were self-selected.

The honest answer usually sits in the middle. The pilot proved value in the right place, but scaling was treated as a delivery task rather than as a project in its own right, one that needs the same preparation the pilot had. Recognising that distinction early saves a significant amount of recovery work later.

What sits alongside the valley, and what to sort first?

The valley does not appear in isolation. Its depth, and whether you cross it, depends on three things that sit outside the technology itself. Active executive sponsorship is what makes hesitation cost something for the team. Clear decision rights are what stop the handoff from becoming a power vacuum. And workflow readiness is what determines whether the output holds once the AI is running outside the pilot team.

Visible, sustained C-suite sponsorship is among the strongest predictors of adoption across change management research. BrainStorm’s analysis of enterprise technology rollouts found organisations with active senior sponsorship consistently reaching higher activation rates within ninety days than those managing rollouts without it. When the most senior person has made it clear that this is how the business works now, hesitation carries a cost. Without that signal, waiting costs nothing.

Decision rights are closely related. The pilot-to-scale problem often surfaces as a power vacuum, where the pilot team has finished, the wider team is waiting for direction, and no one has been explicitly named as the person responsible for what comes next. Naming that owner at the start of the rollout, rather than after the stall begins, closes one of the most predictable failure points in an AI programme.

Workflow readiness is the third factor and the least discussed. AI does not straighten a crooked process. If the work the tool sits inside is informal, inconsistent, or undocumented, the output will be inconsistent too. Sorting the underlying process before layering AI on top of it is slower at the start and faster everywhere after.

Sources

- BCG (2025). The AI Adoption Puzzle: Why Usage Is Up but Impact Is Not. Finds roughly half of companies stuck in stagnating or emerging AI stages, unable to scale past proof-of-concept. https://www.bcg.com/publications/2025/ai-adoption-puzzle-why-usage-up-impact-not - Fortune / MIT NANDA (2025). MIT Report: 95% of Generative AI Pilots at Companies Are Failing. Reports MIT NANDA research finding only ~5% of generative AI pilots achieve measurable revenue acceleration; workflow integration failure is the primary cause. https://fortune.com/2025/08/18/mit-report-95-percent-generative-ai-pilots-at-companies-failing-cfo/ - Guo et al. (2021). Factors Affecting the Adoption of Artificial Intelligence. Peer-reviewed evidence that technology adoption fails when people and leadership work is underestimated, not the technology itself. https://pmc.ncbi.nlm.nih.gov/articles/PMC7784639/ - McKinsey (2025). Superagency in the Workplace. Documents the gap between widespread AI use and measurable business impact when workforce integration and ownership are absent. https://www.mckinsey.com/capabilities/tech-and-ai/our-insights/superagency-in-the-workplace-empowering-people-to-unlock-ais-full-potential-at-work - Spencer Stuart (2025). Don't Delegate AI: A Power-User Playbook for CEOs. Analysis of how active senior leadership involvement drives AI adoption outcomes, relevant to the executive sponsorship requirement at scale. https://www.spencerstuart.com/research-and-insight/dont-delegate-ai-a-power-user-playbook-for-ceos - HRDive / Kyndryl (2024). Employers Say Employees Are Resistant or Hostile to AI. Reports Kyndryl 2024 research finding ~70% of leaders say their workforce is not ready for AI change, pointing to alignment as the core gap. https://www.hrdive.com/news/employers-employees-resistant-hostile-to-AI/749730/ - TechClass (2025). From Pilot to Scale: How Mid-Sized Companies Can Successfully Expand AI Adoption. Practitioner analysis of the integration work required when moving AI from controlled test to wider rollout. https://www.techclass.com/resources/learning-and-development-articles/from-pilot-to-scale-how-mid-sized-companies-can-successfully-expand-ai-adoption - BrainStorm (2024). Executive Sponsorship in Technology Rollouts. Vendor research (indicative) finding active C-suite sponsorship correlates with significantly higher adoption rates within ninety days of enterprise technology deployment. https://www.brainstorminc.com/blog/executive-sponsorship-technology-rollouts

Frequently asked questions

Why do AI pilots succeed but still fail to scale across the business?

The pilot environment is deliberately controlled, with willing participants and managed conditions. Scaling means moving into teams with different habits, different pressures, and no ownership of what was built. Without a documented process, a named owner, and some involvement from the incoming team, the pilot remains an isolated capability.

How do I know whether to push a stalled rollout or step back and rescope?

Ask whether the scaling work was done or skipped. If there is no named owner, no documented process, and no involvement from the incoming team, the stall is fixable. If all three are in place and the rollout is still stalling, the problem may be that the original scope was too narrow to matter at scale.

What is the most important thing to have in place before a pilot tries to scale?

A named owner in the broader team, established before the handoff, is the single most preventable failure point. That person needs enough stake in the outcome to maintain the workflow, adapt it when the output drifts, and advocate for it when something goes wrong. Designing that ownership in early is faster than rebuilding it after a stall.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation