The hidden margin tax of AI, staff time on oversight, evaluation and recovery

An owner at a desk comparing an AI-generated draft on one screen with a client document on another, handwritten margin notes beside the keyboard in natural daylight
TL;DR

Every AI use case carries a labour cost that does not get logged, staff time spent supervising AI work, checking AI output, and fixing what AI got wrong. The cost is invisible because it is scattered across hours nobody codes as AI work and is often performed by the most expensive people in the firm. Sized honestly, this hidden labour tax commonly runs five to fifteen per cent of gross margin and never shows up on the books.

Key takeaways

- The labour tax of AI breaks into three categories, oversight while the tool runs, evaluation of what it produced, and recovery when something went wrong, none of which is usually coded as AI work - Workday's 2026 research finds about 37 per cent of AI time savings are lost to rework, Gartner-CPA.com data in accounting puts the figure as high as 69 per cent in regulated work - The labour tax is highest in regulated, fact-heavy, or decision-recommendation work, lowest in low-stakes drafting and internal first-passes, the mix decides whether AI is paying back or quietly leaking - You can size it in one week without timesheets, pick three to five workflows, ask the people running them to annotate their calendars, then sit down at the end of the week and do the arithmetic - The point of the measurement is to move the cost from anecdote to line item so you can decide where to redesign, where to reprice, and where to scale back

An owner I sat with last month had her AI subscription totals on a printed page in front of her. £800 a month across five tools, growing. She knew that number to the pound. When I asked her how many staff hours a week the firm now spent checking AI output, supervising AI runs, or rewriting something AI got wrong, she paused, looked at the ceiling, and said honestly that she had no idea. Probably a lot. She had not been counting. Nobody had.

That gap is the post. Every AI use case carries a labour cost that does not get logged. The visible cost is the subscription bill, sitting tidily in the management accounts. The invisible cost is the staff time spent overseeing the tool while it runs, evaluating what it produced, and recovering when something slipped through. Sized honestly, that hidden labour tax commonly runs between five and fifteen per cent of gross margin in firms running heavier AI use, and it never shows up in the AI column of the P&L because nobody is coding their hours that way.

The aim here is to put the second half of the cost equation back on the table so owners can make decisions with both numbers in view. Where the gain is real, you keep going. Where the oversight is eating the gain, you redesign or you stop. Neither call is possible if you only see the subscription side of the ledger.

What is the hidden labour tax of AI?

It is the staff time spent on three things that AI use generates and traditional accounting does not separate. Oversight is the time staff spend supervising AI tools as they operate, keeping a human in the loop. Evaluation is the deeper checking of AI output against quality, accuracy and risk standards. Recovery is the time spent fixing, rewriting, or remediating AI work that was unusable or wrong.

Workday’s 2026 research puts a number on it. Across a large employee survey, the report finds that nearly forty per cent of AI time savings are consumed by rework, summarised as “for every ten hours of efficiency gained through AI, nearly four hours are lost to fixing its output”. CPA.com’s 2025 accounting data, citing Gartner, runs higher in regulated work, around sixty-nine per cent of the time saved is reabsorbed.

Why does it matter for your business?

The maths only works if both sides of the cost equation are visible. The vendor brochure shows the time saved. The subscription bill shows the licence cost. Neither shows the hours your senior people now spend on oversight, evaluation and recovery, often the highest hourly cost in the firm. Hubstaff finds that while eighty-five per cent of professionals say they use AI, AI-labelled activity is about four per cent of tracked work time.

For a twenty-person firm at 50 pounds loaded per hour, two hours per person per week of AI oversight is roughly 100,000 pounds a year. Against an AI tool stack of 30,000 pounds a year, the labour tax is three times the visible cost and entirely absent from the AI line. In a firm doing 1.5 million pounds of revenue at 35 per cent gross margin, that is around 19 per cent of gross profit going to a cost you have never sized.

Where will you actually meet it?

You meet it in the gap between the productivity numbers people quote in conversation and the bottom-line numbers in the management pack. Staff feel busier, delivering more output, or output of similar quality faster. The vendor case study said forty per cent time saved. Profit per head is unchanged, or worse. That disconnect is the labour tax sitting underneath the surface, absorbing the gain into oversight and expanded workload nobody priced for.

You meet it most acutely in regulated and high-stakes work. A junior uses AI to structure a client memo, a senior spends longer than usual unpicking subtle inaccuracies, the extra time is booked as “partner review” with no flag that the inaccuracies came from AI. A tax manager runs an AI-assisted reconciliation, then an hour goes into investigating anomalies that turn out to be AI edge cases, the hour is booked as “reconciliation review” and the source is invisible.

When does it kill the ROI, and when does it not?

It kills the ROI in three categories of work. High-stakes regulated work where oversight is structurally required, audit, compliance, regulated financial advice. Fact-heavy client-facing work where every claim needs verification before it leaves the firm and the verification time often exceeds the drafting time AI saved. Decision-recommendation work where the cost of acting on a wrong AI output is high and so the evaluation has to be careful and slow.

It does not kill the ROI in three other categories. Low-stakes drafting where minor errors carry little cost. Throughput tasks where AI provides a first pass that humans accept or reject quickly. Internal first-passes that are never seen externally and only need to be roughly right to be useful. The strategic move is to be honest about which category each AI use case sits in before you commit to a year of licences.

How do you size it in a week, without timesheets?

You size it by sampling, not by tracking everything. Pick three to five workflows where AI is in regular use and the work affects revenue or risk. Client report drafting. Management accounts. Marketing campaign production. Audit workpapers. Whichever fits. For each, pick one or two staff members already using AI in that workflow, tell them what you are doing, ask for their help for a week. The goal is honest ratios.

During the week, ask them to annotate their calendars and the artefacts they produce with two simple markers. Where AI was used, and roughly how much time was spent supervising or correcting its output. A one-hour block labelled “Client X report” might be annotated as thirty minutes of AI drafting, twenty-five minutes of intensive review, and five minutes of cleanup. You are after orders of magnitude, not stopwatch readings.

At the end of the week, sit down with each person for thirty minutes and walk the entries. Classify the AI-related time into the three buckets, oversight, evaluation, recovery. Note any incidents where AI mistakes caused downstream problems, client emails, or rework, even if those were not recorded at the time. Then do the arithmetic. Total task time, AI-assisted share, oversight-and-recovery share within that, extrapolated to staff numbers and loaded hourly cost.

The figure will not be precise. It will be honest, which is what is missing. Once the line exists, you can decide where to redesign the workflow, where to invest in better prompts and templates to cut oversight, where to reprice the work because the firm is now delivering it faster, and where to scale AI back because the oversight burden is eating the gain. There is a sister post on the staff time tax of running AI consulting engagements, which sits one level up at the vendor and project layer, where this post stays inside the firm’s own tool use.

If you want to find out what your own AI labour tax actually is, Book a conversation.

Sources

- Workday (2026). Beyond Productivity, Measuring the Real Value of AI. Survey finding that nearly 40 per cent of AI time savings are lost to rework, the "AI tax on productivity". https://www.workday.com/en-us/resources/research/beyond-productivity-real-value-of-ai.html - CPA.com (2025). AI in Accounting Report. Cites Gartner's 2024 Productivity Impact of AI Survey showing 5.4 hours per week gross AI time savings in finance, of which around 69 per cent is lost to rework, training and non-value-added tasks. https://www.cpa.com/ai-in-accounting-report-2025 - Thomson Reuters (2026). Future of Professionals Report, AI in Professional Services. Finds organisation-wide AI use rose from 22 per cent in 2025 to 40 per cent in 2026, only 18 per cent of firms track AI ROI at all. https://www.thomsonreuters.com/en/reports/future-of-professionals.html - Anthropic (2024). Economic Index analysis of 100,000 Claude conversations, median task time reduction of 84 per cent, with the authors cautioning that estimates likely overstate gains because post-conversation verification is excluded. https://www.anthropic.com/economic-index - Hubstaff (2025). Hidden AI Usage in the Workplace, finding that while 85 per cent of professionals report using AI, AI-labelled activity accounts for only about 4 per cent of tracked work time. https://hubstaff.com/blog/hidden-ai-usage-workplace - McKinsey & Company (2025). The State of AI, Superagency in the Workplace, on the need to redesign work and not just tasks to convert AI productivity into sustainable performance. https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai - Bommarito, M. et al (2024). UC Berkeley Haas ethnographic study of generative AI adoption in a 200-person technology company over eight months, on work intensification and the erosion of natural pauses. https://haas.berkeley.edu/research-news/the-ai-productivity-paradox - Valere (2025). The Hidden AI Infrastructure Tax, survey data showing 84 per cent of enterprises see gross margin erosion of six points or more from AI infrastructure costs, rising to sixteen points in heavy workloads. https://www.valere.io/insights/ai-infrastructure-tax - Simon-Kucher (2025). The AI pricing shift in professional services, argues firms should move from a single time-based model to a toolkit of pricing metrics aligned with output and value as AI compresses delivery time. https://www.simon-kucher.com/en/insights/ai-pricing-professional-services - ICAEW (2024). Pricing Professional Services, guidance on value-based pricing for owner-managed accounting and advisory firms in an AI-enabled delivery context. https://www.icaew.com/insights/practice-resources/pricing

Frequently asked questions

How big is the hidden labour tax in a typical owner-managed firm?

Five to fifteen per cent of gross margin in firms running heavier AI use. Workday's 2026 data puts rework at about 37 per cent of AI time savings, Gartner-CPA.com puts it as high as 69 per cent in regulated accounting work. For a twenty-person consultancy at 50 pounds loaded per hour, two hours per person per week of AI oversight is roughly 100,000 pounds a year. None of that lands on the AI line in the P&L.

How do I size it without putting timesheets in?

Pick three to five workflows where AI is already running, ask one or two people in each to annotate their calendars for a week with where AI was used and how long they spent supervising or correcting it. At the end of the week, sit down and walk the entries. You are not after minute-by-minute precision, you are after the ratio of oversight time to total task time. Multiply by staff numbers and loaded hourly cost.

Does this mean AI is not worth running?

AI is worth running when the gain is real and you have measured both sides of the ledger. Some oversight is structurally required, especially in regulated work. The point of sizing the labour tax is so you can decide where to redesign workflow, where to reprice work, and where to scale AI back because the oversight burden is eating the gain. Without that figure, every decision is vendor brochure arithmetic.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation