A practical method for checking ChatGPT answers before you act

A person at a desk reviewing a document with a pen in hand, natural window light, home office
TL;DR

ChatGPT produces confident, well-formatted text whether its claims are accurate or invented. Owner-operated businesses remain fully accountable for the content they send to clients, regardless of how it was generated. A practical three-check method covering names and numbers, currentness, and source authority takes under ten minutes per output and satisfies both good business practice and UK regulatory expectations around accuracy and oversight.

Key takeaways

- ChatGPT does not signal when it is wrong, so verifying outputs against primary sources is a deliberate step that must happen before acting on them. - OpenAI itself recommends against treating model outputs as a sole source of truth, and UK regulators including the ICO and FCA hold businesses accountable for accuracy regardless of AI involvement. - The 2024 Air Canada chatbot tribunal ruling confirmed that businesses own their AI-generated outputs and cannot transfer liability to the model developer. - A three-check method covering names and numbers, currentness, and source authority can be applied to any business document in under ten minutes. - Customer-facing, regulated, and citation-bearing outputs always warrant the full checking sequence; internal brainstorming carries lower risk but is not automatically exempt.

A services firm puts together a client briefing. One section summarises recent regulatory guidance, and three of the cited references are fabricated. ChatGPT generated the paragraph with complete confidence, no hedges, no uncertainty. The person who approved the document never searched for the sources, because the text looked right.

This is not a fringe scenario. OpenAI’s own guidance says users should not rely on model outputs as a sole source of truth, and recommends checking critical facts, including legal, financial and medical information, against authoritative sources before acting. For an owner-operated business using ChatGPT in client work, that means having a checking method: one that takes minutes rather than hours and that anyone in your team can apply consistently.

What does it mean to check a ChatGPT answer?

Checking a ChatGPT answer means separating what the model asserts from what you can actually verify. ChatGPT produces confident, well-formatted text whether its claims are correct or invented, and it does not signal the difference. OpenAI’s own guidance says users should not rely on model outputs as a sole source of truth and recommends checking critical facts against authoritative sources before acting.

The practical failure modes are three. Fabricated citations: the model produces references that do not exist, formatted correctly, with plausible-sounding titles. Outdated information: training data has a cutoff, and the model cannot know what changed after it. Wrong attribution: the model associates a real source with a claim that source does not actually make.

None of these failures announces itself. The text reads normally. The numbers are formatted. The source looks real. For a small services firm using ChatGPT to draft proposals, briefings, or client communications, this is the core risk: the output looks authoritative because the language is polished, not because the facts have been verified.

Why do wrong answers cause real damage?

For an owner-operated business, accountability for what goes to clients stays with you regardless of how the content was generated. The ICO is clear that organisations using generative AI remain responsible for accuracy obligations under UK data protection law. The FCA has made the same point for regulated firms: AI-generated outputs do not transfer a firm’s accountability for misleading or incorrect information to the model’s developer.

The Air Canada case from 2024 is the most cited illustration of what happens when this is ignored. A BC tribunal held Air Canada responsible after its chatbot gave a passenger a misleading bereavement fare explanation. The airline’s defence, that the chatbot was a separate entity with its own responsibility, was rejected. The business owned the output.

For small firms, the damage often lands before any regulatory action. A fabricated citation in a client-facing document damages professional credibility. A wrong number in a pricing summary creates a false expectation. A misquoted regulation in a compliance briefing creates liability exposure well before anyone files a complaint.

The ICO’s guidance adds a UK-specific dimension: if ChatGPT output contains or concerns personal data, the firm needs a lawful basis, an accuracy check, and a clear processing purpose. AI use does not remove UK GDPR obligations, and for any firm preparing materials that reference individuals, clients, or employees, this applies directly.

Where are you most likely to meet this problem?

The highest-risk outputs are the ones that look the most credible. ChatGPT is fluent and well-structured even when it is wrong, which means the errors most likely to cause harm appear in high-stakes documents that nobody double-checks precisely because they look authoritative. Client briefings, proposals with financial figures, regulatory summaries, and complaint responses are all high-risk categories.

For services firms specifically, four output types carry the most risk. Financial summaries where the model produces plausible-looking numbers without basis in the source data. Regulatory or legal references where the model invents citation-formatted claims about rules that do not exist. Research blending real and invented sources, presenting fabricated papers alongside genuine ones. Customer-facing explanations of products, policies, or processes where imprecision creates misleading impressions.

The NCSC identifies a parallel risk: if staff are pasting sensitive client or business information into public AI tools to produce these outputs, data exposure runs alongside accuracy risk. Protecting what goes into the prompt matters as much as verifying what comes out.

When should you run the full check, and when can you be lighter?

The checking method should be proportionate to the stakes of the output. A ChatGPT response used for internal brainstorming or rough-drafting before a subject-matter expert reviews it carries lower risk than text going directly to clients or regulators. The question to ask before checking is: who acts on this output, and what do they lose if it turns out to be wrong?

Four categories warrant the full sequence every time: customer-facing communications, materials with legal or financial implications, anything that cites external sources, and any output that will be treated as a recommendation rather than a starting point.

Two categories where a lighter approach is defensible: internal documents where the primary reader has the expertise to spot errors, and brainstorming outputs where factual precision is not the goal.

One counterintuitive point worth holding: if your firm already has strong editorial or quality controls, a formal checking sequence is additive rather than a replacement. The value of a named method is consistency. It means the review happens the same way every time, by everyone in the team, not only by the people who happen to be careful.

What does a practical checking method look like?

The three-check rule is a method any non-technical team can apply in under ten minutes per output. Check names and numbers against the best available primary source. Check whether the dates and facts are current. Assess whether the sources cited actually exist and whether they are authoritative on the specific point the model has used them to support.

The sequence runs in this order. First, underline every specific claim in the output: numbers, dates, citations, named organisations, regulatory references. Separate facts from opinions and recommendations. Second, check each factual claim against a primary source, your own documents, Companies House, a regulator’s website, contract terms, or the cited source itself if it exists. Third, cross-check important claims with one independent secondary source, a reputable trade publication, a professional body, or an industry database.

Reject or rewrite any claim you cannot source. The temptation is to leave uncertain content in and add a hedge. That approach is usually worse than removing the claim, because a hedged wrong answer still propagates the error.

For customer-facing materials, add one more step: sign-off from someone who can justify the claim from source documents, not from the AI output. The reviewer’s job is to confirm accuracy, not to judge whether the text reads well.

Keep a short audit trail. Store the prompt, the output, the sources checked, and the approved wording. This takes two minutes and means you can reconstruct the checking process if a client queries a claim later. The ICO’s guidance on accuracy and proportionate human oversight supports exactly this kind of documented approach.

If you want to apply this method across your business rather than just for individual documents, Book a conversation and we can work through where the verification gaps actually sit.

Sources

- OpenAI (2024). Usage policies and safety guidance for ChatGPT. States that model outputs can include false or misleading information and should not be used as a sole source of truth. https://openai.com/policies/usage-policies/ - OpenAI (2024). ChatGPT disclaimers and error acknowledgement. Direct acknowledgement that ChatGPT can make mistakes and that important information should be verified against authoritative sources. https://help.openai.com/en/articles/6825453-chatgpt-release-notes - ICO (2024). Generative AI guidance for organisations. Sets out accuracy, fairness and transparency obligations under UK GDPR when using generative AI in contexts involving personal data. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/generative-ai/ - ICO (2024). AI and data protection guidance. Requires human oversight proportionate to risk and evaluation of AI output for accuracy before use. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/ - FCA (2024). AI and machine learning in financial services. States that firms remain accountable for decisions and customer treatment even when AI generates the output. https://www.fca.org.uk/firms/artificial-intelligence-machine-learning-financial-services - NCSC (2024). Large language models and generative AI: guidance for organisations. Advises limiting sensitive data in prompts and reviewing outputs for security concerns before use. https://www.ncsc.gov.uk/collection/large-language-models-and-generative-ai - UK Government / Cabinet Office (2023). Generative AI framework for civil servants. Recommends reviewing AI outputs for accuracy, bias and inappropriate content before use, a practical baseline for SMEs. https://www.gov.uk/government/publications/generative-ai-framework-for-civil-servants - BC Civil Resolution Tribunal (2024). Moffatt v. Air Canada (2024 BCCRT 149). Held Air Canada responsible for misleading information given by its chatbot, confirming that businesses own their AI-generated customer communications. https://www.canlii.org/en/bc/bccrt/doc/2024/2024bccrt149/2024bccrt149.html - EUR-Lex (2024). Artificial Intelligence Act (Regulation (EU) 2024/1689). Sets out transparency, governance and risk-management obligations for certain AI applications, with obligations phasing in from 2025. https://eur-lex.europa.eu/eli/reg/2024/1689/oj - CMA (2024). Consumer protection law guidance for traders. Confirms that misleading statements in AI-generated content remain subject to consumer protection rules on misleading actions and omissions. https://www.gov.uk/government/collections/consumer-protection-law-for-business

Frequently asked questions

How do I know if a ChatGPT citation is real?

Search for the source directly. If the citation has an author, title and publication, verify each component separately. Many fabricated citations look plausible because the model constructs realistic-sounding references. Searching for the exact title is the fastest test; if no credible result appears, the citation is almost certainly invented. Never include a source in client materials without confirming it exists.

Does fact-checking ChatGPT output take a lot of time?

For a standard business document, a structured check runs in under ten minutes. The sequence is to check names and numbers against a primary source, confirm the information is current, and verify that any cited sources actually exist. The time-consuming version is checking everything after a client has already raised a concern.

Am I legally responsible if ChatGPT gives my client wrong information?

Yes. The ICO and FCA are clear that businesses remain accountable for the accuracy and fairness of content sent to clients, regardless of whether AI generated it. The 2024 Air Canada tribunal ruling confirmed the same principle: the business owns the content, not the tool. Professional indemnity exposure applies the moment you pass on an unverified AI claim.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation