Version control for SME knowledge, the proportionate approach

A small business owner and her operations lead at a desk, looking at a laptop screen showing a folder of documents, the owner pointing at one file
TL;DR

Version control grew up in software, where multiple uncoordinated versions of the same artefact are catastrophic. Many SMEs accept the cost for non-code work as the price of doing business. The proportionate discipline for a five-to-fifty-person firm is three rules: one canonical version per artefact, consistent simple naming, and superseded versions moved to a clearly marked archive. Built-in version history in Google Docs, Microsoft 365 and Notion does the mechanical work. Explicit discipline carries the rest.

Key takeaways

- Version chaos costs more once AI is reading your documents, because the model retrieves what looks plausible without asking which version is current. Air Canada paid a customer compensation because its chatbot quoted superseded policy. - The three rules that do most of the work: one canonical version per artefact, consistent simple naming with incrementing numbers and no FINAL labels, and superseded versions moved to an Archive folder. - Built-in version history in Google Docs, Microsoft 365 and Notion handles change tracking inside a single collaborative tool. The discipline still has to cover the boundaries where documents escape into email, PDF and shared drives. - The naming convention that works outside collaborative tools is a descriptive name, a hyphen and an incrementing number. ISO date in front if chronological order matters. No FINAL, no DRAFT, no parenthetical v2 (1). - The proportionate approach runs out when document volume explodes, approval workflow needs sign-off, regulation demands tamper-proof records, or AI moves into core workflow. The step up is lightweight DMS at around £100 to £300 a month, not enterprise content management.

The owner of a twelve-person consultancy noticed it on a Friday afternoon. Her account director had emailed a client the v3 of a proposal that morning. When the client came back with a follow-up question, the AI summary tool the firm uses pulled its answer from the v4 sitting in a different folder, written two days later and never sent. The two replies contradicted each other on price. Nobody on the team had done anything wrong. The firm had four versions of the same proposal in three different places, and the AI tool had taken the most recent one it could find.

She is not unusual. For many owner-managed firms, version chaos has been the price of doing business since the day someone first typed Proposal_v4_dh_FINAL_final2.docx and emailed it round. The cost was tolerable when only humans were reading the documents, because humans ask clarifying questions and notice when things contradict. AI tools do not. The discipline that solves it is three rules, applied with consistency, and almost none of the work needs new software.

Why does version control matter more once AI is reading your documents?

Humans reading documents ask clarifying questions, notice contradictions and verify intent. Generative AI does none of those things. When two versions of the same document sit in the retrieval pipeline, the model conflates them and returns something plausible without asking which one is current. The version with the older price wins as often as the version with the new one, and nobody on the team is in the loop.

Research by Dr Philippa Hardman found that around 80% of leading models misread documents regularly, because the model compresses everything into one numerical representation and updates it as it reads more. The Air Canada case put a number on the cost. The airline’s chatbot quoted a bereavement discount policy that had been superseded, and a tribunal ordered Air Canada to compensate the customer. The error sat in the document pipeline that fed the model an outdated policy alongside the current one. For a small firm, the equivalent is the AI tool reading the wrong proposal, the wrong terms, the wrong procedure. The cost is paid in client confidence.

What are the three rules that do most of the work?

Three rules carry the load. One canonical version per artefact, kept in one place everyone knows about. Consistent simple naming, incrementing numbers, no status labels in the filename. Superseded versions moved into an Archive subfolder once a new version is approved. The discipline is small enough to teach a new hire in an hour and large enough to prevent the version conflict that has AI tools quoting the wrong price.

Rule one, one canonical version per artefact

Every document the business cares about should have one version the team considers current. That version lives in one folder, on one system, with a name everyone can recognise. Drafts are either deleted once the current version is approved or moved to archive. For documents created collaboratively, the canonical home is the shared folder in Google Workspace, Microsoft 365 or Notion. For documents created by one person and approved by others, it is a designated Approved or Current folder. Everything else moves out.

Rule two, consistent and simple naming

Descriptive name, hyphen, incrementing version number. Client-onboarding-01.docx, then 02, then 03. Minor correction inside a version, add a decimal: 03_01. If chronological order matters, put the ISO date in front: 2026-05-11-client-onboarding-03. No FINAL. No DRAFT. No parenthetical v2 (1). The reason is mechanical: filenames travel. A file called Proposal_FINAL.docx tells the recipient it is final even if it has been wrong for six weeks. A file called Proposal_04.docx tells them nothing except its position in the sequence, which is the honest signal. Status belongs in a version table at the front of the document or in the system metadata, never in the filename. This matches the US National Archives guidance and Princeton’s records-management standard.

Rule three, superseded versions move to archive

Once a new version is approved, the previous version moves to a subfolder explicitly named Archive or Superseded. This does three things. It removes the older version from everyday search results so the team is less likely to retrieve the wrong one. It creates an audit trail if the document ever needs to be reconstructed for a dispute. And it clarifies for everyone, including the AI tool, that the version in the active folder is the current one. For critical documents, contracts, signed deliverables, approved procedures, rename the archived copy with the date it was superseded: client-onboarding-02-archived-2026-05-11.pdf. The Archive folder holds the history. The active folder holds the truth.

Where do built-in tools do the heavy lifting?

Modern collaborative platforms handle a useful slice of the discipline automatically. Google Docs keeps a full edit history with named contributors and the ability to restore any earlier state. Microsoft 365 SharePoint lets administrators set retention limits, automated archival and audit logs. Notion records change history and supports version templates. For a team inside a single collaborative platform, the mechanical version-tracking is handled. The three rules cover the boundaries the platforms cannot reach.

Documents leave collaborative tools the moment they are exported to PDF, sent by email, downloaded for offline editing or printed for a meeting. At every boundary the version metadata vanishes. The proposal exported as a PDF for a client carries no record of which Google Docs version it came from. The contract amended in Word from a PDF the client returned with tracked changes has lost its connection to the cloud original. For SMEs running a hybrid stack, accounting in one system, CRM in another, knowledge in shared drives, contracts in email, no single tool covers everything. The explicit naming and archiving rules earn their keep where the platforms cannot reach.

When do the proportionate rules run out?

The three rules work well for a stable firm with modest document volume, a few thousand active documents and a team of fifty or under. They start to creak when one of four things happens: document volume explodes, formal approval workflow arrives, regulation demands tamper-proof records, or AI moves into the core workflow and the cost of a version error rises sharply. When one of those signals lands, manual discipline is past its useful life.

Each signal has a recognisable shape. Volume becomes the problem when documents pile up faster than anyone can archive them. Approval workflow becomes the problem when drafts need team review, legal check and sign-off, with audit evidence at each step. Regulation becomes the problem when FCA, ICO or ISO rules require role-based access and retention enforced automatically. AI becomes the problem when the team is no longer occasionally summarising a document but routinely asking an assistant to read contracts, policies or procedures.

The step up is lightweight document management software, not enterprise content management. Tools like DocuWare, the SME-scale Hyland products and cloud-native DMS layers that sit on Microsoft 365 or Google Workspace cost roughly £100 to £300 a month for a firm of twenty to fifty people. They bake the three rules into the system: the canonical version is the only one users see by default, version numbering is enforced, and superseded versions archive automatically. Enterprise content management platforms cost ten times that and are built for organisations of several hundred people with multi-site governance. They are almost never the right next step.

What does the starter version look like for an owner reading this on a Friday afternoon?

Half a day of work, no software purchase. Pick the five or six document types that matter most to your business, proposals, contracts, standard operating procedures, client deliverables, internal policies. For each, name the canonical folder, the naming convention you want the team to follow, and where superseded versions go. Write the decisions down on one page, send it round, and make someone accountable for each document type.

Run the discipline through a fifteen-minute monthly tidy. Walk the active folders, move anything superseded into archive, confirm the canonical version is the one the AI tool would find when it goes looking. That is the proportionate version. It works, it costs nothing, and it holds until the business has genuinely outgrown it. If you want help working out which document types matter most and what the next step looks like once the rules run out, book a conversation.

Sources

- National Archives and Records Administration (2017). Best Practices for File Naming. The US federal records-management standard for filename uniqueness, length, ISO 8601 dates and the case against status labels inside filenames. https://records-express.blogs.archives.gov/2017/08/22/best-practices-for-file-naming/ - Princeton University Records Management (2024). File Naming Conventions and Version Control. The university records-management reference for incremental version numbering, version control tables and decimal sub-versions. https://records.princeton.edu/records-management-manual/file-naming-conventions-version-control - Information Commissioner's Office. Information Governance for Your Small Business. The UK regulator guidance that documents must be findable, properly named and saved where the business expects them, with information governance treated as part of GDPR compliance. https://ico.org.uk/media2/migrated/4020350/information-governance-for-your-small-business-v-1-0.docx - Trustible (2024). Poor Document Retrieval Accuracy in AI Systems. The case file behind the Air Canada chatbot compensation order and other AI errors traced to stale policy documents in the retrieval pipeline. https://trustible.ai/ai-risks/poor-document-retrieval-accuracy/ - Hardman, P. (2024). 80% of AI Models Misread the Docs. Research on how language models compress documents into a single numerical representation, which degrades comparison across versions and conflates semantically similar but temporally different content. https://drphilippahardman.substack.com/p/80-of-ai-models-misread-the-docs - Microsoft Learn (2024). Document Library Version History Limits in SharePoint. The Microsoft 365 reference for configuring automated version retention, audit logging and version limits inside the collaborative tool. https://learn.microsoft.com/en-us/sharepoint/document-library-version-history-limits - Crown Records Management (2024). Signs Your SME Has Outgrown Manual Document Handling. Industry benchmark for the productivity cost of manual document handling, with an average of around 2.5 hours per person per day spent on document search at firms past the proportionate stage. https://www.crownrms.com/dms/signs-your-sme-has-outgrown-manual-document-handling/ - LogicalDOC (2024). How Much Time Is Wasted Searching for Documents in a Company. The headline figure for fifteen to thirty minutes per person per day lost to document search in firms without version discipline, used here to ground the SME cost band. https://www.logicaldoc.com/blog/650-how-much-time-is-wasted-searching-for-documents-in-a-company - Hyland (2024). ECM vs DMS, the Difference and When Each Fits. The reference for why a fifty-person firm should reach for lightweight document management rather than enterprise content management, with the cost band that applies. https://www.hyland.com/en/resources/articles/ecm-vs-dms

Frequently asked questions

Do we really need version control if everyone uses Google Docs or SharePoint?

The collaborative tools handle change tracking inside themselves well. Where the discipline still matters is at the boundaries. A document gets exported as a PDF and emailed to a client. A procedure is downloaded and saved locally before a flight. A contract is signed in Word and the signed copy lives in a different folder. At every boundary the version history is lost, and that is where the three rules earn their keep. Built-in tools cover the middle. You cover the edges.

What naming convention do you actually use?

Descriptive name, hyphen, incrementing version number. Client-onboarding-01, then 02, then 03. For a small correction inside a version, add a decimal: 03_01. If chronological sort matters, put the ISO date in front: 2026-05-11-client-onboarding-03. No FINAL. No DRAFT. No parenthetical duplicates. The status of a document belongs in a version table at the front of the document or in the system, not in the filename, because filenames get separated from their context the moment a file is copied or sent.

When have the three rules run out and what is the next step?

Four signals. Document volume passes a few thousand and manual archiving becomes a chore. Approval workflows need formal sign-off and an audit trail. Regulation, FCA, ICO, ISO or sector-specific, demands tamper-proof records. Or AI moves from occasional tool to core infrastructure and version errors start surfacing in chatbot outputs. When one of these lands, the step up is lightweight document management software at roughly £100 to £300 a month, not enterprise content management. The rules stay the same; the software enforces them.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation