Compact AI models for finance teams: what works in regulated workflows

A professional reviewing printed documents at a desk with a laptop open beside them
TL;DR

Compact AI models can reduce document-heavy admin in finance teams and regulated workflows, but model size does not determine governance risk. The FCA, ICO, and NCSC all hold firms accountable for AI outputs regardless of how the model is deployed. Assistive use cases such as invoice classification, policy lookup, and first-draft compliance summaries are the safest starting point, provided outputs are validated and a human signs off before they affect clients or regulatory processes.

Key takeaways

- A compact AI model carries the same governance obligations as any other AI tool in a regulated workflow. Model size affects cost and latency, not regulatory responsibility. - The FCA, ICO, and NCSC all hold firms accountable for AI outputs in regulated processes, even where the model runs on the firm's own infrastructure. - The safest use cases in finance are assistive: invoice classification, policy lookup, first-draft compliance summaries, and meeting notes with human review. Decisive use cases such as credit decisions require a named human sign-off process before deployment. - The "constrained retrieval plus human review" architecture is the most defensible pattern for regulated finance work, producing traceable, citable outputs aligned with FCA and ICO expectations. - UK firms with EU clients or group exposure should assess whether the EU AI Act applies to their compact model deployments, particularly where AI assists in credit, insurance, or employment-related decisions.

The vendor described it as safe. A compact AI model running on the firm’s own servers, processing invoices and drafting compliance summaries with no data leaving the building. The finance director liked the logic: private infrastructure, controlled environment, no cloud exposure. The compliance officer then asked the question nobody had answered. If this model produces output that touches a client account or a regulated process, who is accountable for that output?

That question is the one the FCA, ICO, and NCSC are all shaping their guidance around right now.

What is a compact model in a finance context?

A compact model (sometimes called a small language model) is an AI system with far fewer parameters than a frontier model like GPT-4. In practice, it can run on a firm’s own servers rather than sending data to an external provider. For finance teams, the appeal is keeping client data inside the firm’s own infrastructure while still getting AI help with document-heavy tasks.

Size is a cost and latency variable, not a governance one. Compact models built for domain-specific tasks can be highly capable at classification, extraction, and text generation within a narrow scope. Some are designed specifically for finance applications: expense categorisation, ledger reconciliation notes, policy Q&A over internal documents. What actually matters from a governance standpoint is what the model does with the data it sees, and who takes responsibility for the output.

That distinction matters because the UK’s regulatory framework for AI in financial services contains no size exemption.

Why does this matter for regulated firms?

The FCA, ICO, and NCSC have all issued AI guidance since 2023, and none of it scales down its expectations based on model size. The FCA’s discussion paper on AI in financial services focuses on governance, accountability, and consumer harm. The ICO’s 2024 generative AI guidance focuses on lawful basis, human oversight, and data protection impact assessments. Neither cares how small the model is.

The FCA has been direct on accountability: using a third-party model or outsourcing to a vendor does not transfer responsibility away from the authorised firm. The FCA’s Principles for Businesses apply regardless. The ICO adds that where AI processes personal data in high-risk ways, firms must complete a Data Protection Impact Assessment before deployment. In finance, that obligation covers AI used in credit processing, complaints handling, customer profiling, and fraud detection.

The NCSC’s guidance adds a further dimension. A model running on your own servers is not automatically safer than one accessed via an API. Prompt injection, weak access controls, and unlogged outputs are risks that live at the infrastructure level. Running a model internally is an operational choice; it is not a substitute for the security controls the NCSC expects. Treating private deployment as sufficient control is one of the clearest mistakes firms make when rolling out AI in regulated workflows.

Where will you actually meet compact models in finance work?

The use cases with the best track record are assistive, not decisive. Invoice and expense classification, policy and procedure lookup, first-draft compliance summaries, meeting notes with human review, internal Q&A over a restricted knowledge base. These work because the model helps without making the final call. That fits what the FCA, ICO, and NCSC all emphasise: accountability, transparency, and human oversight of outputs.

The Bank of England has long recognised that financial reporting is burdensome for smaller firms. A compact model that extracts, classifies, and routes routine information from documents can reduce that load, provided the firm validates outputs and maintains an audit trail. UK Finance and the government’s 2025 Financial Services Growth and Competitiveness Strategy both point towards AI-enabled productivity as a priority for the sector. The direction is towards safe experimentation, and safe experimentation requires governance infrastructure to match.

The practical architecture that works well here is retrieval-augmented generation (RAG). The model does not answer from its general training data; it retrieves from a set of firm-approved documents, then generates a response grounded in those sources. For a compliance team, that means traceable, reviewable outputs rather than plausible-sounding answers with no basis in the firm’s actual policies. This pattern, constrained retrieval plus human review, is more defensible under UK governance expectations than open-ended chat over live client records.

When does a compact model make sense, and when should you step back?

The practical divide is between assistive AI and decisive AI. Assistive use cases include drafting, summarising, classifying, and searching. Decisive use cases include creditworthiness assessments, suitability determinations, claims denials, and suspicious activity report triage. The FCA and ICO apply much stronger governance expectations to the second group, and a compact model cannot substitute for the human sign-off those decisions require.

The commercial case is strongest where AI removes repetitive manual steps, not where it replaces professional judgement. For many SME services firms, the likely gains are minutes saved per document, faster turnaround on routine queries, and better consistency on first drafts. The risks arrive when change control is weak, when staff use unsanctioned tools on regulated work, or when nobody can identify who reviews and approves outputs before they affect a client.

The practical move before deploying any compact model in a regulated workflow is to map each use case onto one of two columns: assistive (model helps, human decides) or decisive (model output triggers or informs a consequential action). Any use case in the second column needs a documented review process, a named reviewer, and a clear policy on how model-generated output is recorded and challenged. That mapping exercise costs nothing and resolves many of the governance questions before deployment begins.

Three concepts come up regularly when finance teams and regulated firms start deploying compact models. Retrieval-augmented generation (RAG) is the architecture that makes constrained, citable AI practical in finance work. Data protection impact assessments (DPIAs) are what the ICO expects before AI deployment in high-risk contexts. The EU AI Act also creates binding obligations for UK firms with EU clients or group exposure.

RAG matters because it constrains where the model looks for answers. It retrieves from a curated document set you control, then generates a response grounded in those documents. For a compliance team, that means traceable outputs rather than hallucinated versions of internal policies.

DPIAs are increasingly relevant as finance AI use scales. The ICO’s guidance is clear: where AI is used in high-risk contexts including credit, insurance, or employment-related decisions, a DPIA is expected before deployment begins.

The EU AI Act is worth reviewing even for UK-only operations. If the firm serves EU-based clients, processes data on behalf of EU entities, or operates within a group with EU exposure, the Act’s high-risk AI rules may apply. Those rules include mandatory conformity assessments and registration requirements for certain finance-related applications, and they are stricter than the UK’s current framework. A UK firm that assumes its domestic compliance position covers EU exposure is taking on risk it may not have mapped.

If you want to talk through where your firm sits on the assistive-versus-decisive line, Book a conversation.

Sources

- FCA (2023). AI and Machine Learning in Financial Services (Discussion Paper DP5/23). Sets out the FCA's expectations on governance, accountability, and consumer harm for AI in financial services. https://www.fca.org.uk/publications/discussion-papers/dp5-23-ai-and-machine-learning-financial-services - ICO (2024). AI and data protection guidance. Sets out lawful basis, transparency, accuracy controls, and human oversight expectations for organisations using generative AI. https://ico.org.uk/for-organisations/uk-gdpr-guidance-and-resources/artificial-intelligence/ - FCA. Principles for Businesses (PRIN 2.1). Confirms that FCA Principles still apply to authorised firms even when using third-party AI or outsourced models. https://www.handbook.fca.org.uk/handbook/PRIN/2/1.html - NCSC (2024). AI and cyber security guidance. Covers secure deployment controls including access control, logging, prompt injection risks, and the limits of private hosting as a security control. https://www.ncsc.gov.uk/guidance/artificial-intelligence-and-cyber-security - EU (2024). Regulation (EU) 2024/1689 (EU AI Act). Establishes risk-based obligations for AI systems, including high-risk rules for credit, insurance, and employment AI relevant to UK firms with EU exposure. https://eur-lex.europa.eu/eli/reg/2024/1689/oj - Bank of England (2021). Data collection reform plan for the UK financial sector. Recognises the document burden on smaller firms and the potential for AI to assist with classification and routing of regulatory information. https://www.bankofengland.co.uk/-/media/boe/files/paper/2021/transforming-data-collection-from-the-uk-financial-sector-a-plan.pdf - UK Finance (2025). Plan for Growth report. Industry body perspective on AI-enabled productivity and regulatory simplification in UK financial services. https://www.ukfinance.org.uk/system/files/2025-03/Plan%20for%20Growth%20Report.pdf - HM Treasury (2025). Financial Services Growth and Competitiveness Strategy overview. Sets out the UK government's regulatory reform and innovation agenda for financial services. https://www.gov.uk/government/calls-for-evidence/financial-services-growth-and-competitiveness-strategy/outcome/financial-services-growth-and-competitiveness-strategy-overview - European Parliament (2023). Recent Trends in UK Financial Sector Regulation and Possible Divergence (study). Analyses UK/EU regulatory divergence post-Brexit, relevant for UK firms assessing EU AI Act exposure. https://www.europarl.europa.eu/RegData/etudes/STUD/2023/740067/IPOL_STU(2023)740067_EN.pdf

Frequently asked questions

Can I use a compact AI model for regulated financial workflows without informing the FCA?

The FCA's governance expectations apply to the workflow, not to whether the AI is large or small. If the model assists with a regulated activity or affects a client, the FCA's Principles for Businesses still apply to the authorised firm. The ICO also requires firms to have a lawful basis and transparency controls in place before using AI that processes personal data.

What is a data protection impact assessment, and do I need one for a compact model?

A DPIA is a structured assessment of the data protection risks in a processing activity. The ICO expects organisations to complete one before deploying AI in high-risk contexts, which in finance includes credit processing, complaints handling, and customer profiling. The requirement applies regardless of whether the AI is a small local model or a large cloud-based one.

What is retrieval-augmented generation and why does it matter for finance AI?

Retrieval-augmented generation (RAG) is an architecture where the model retrieves information from a curated set of documents before generating a response, rather than answering from its training data alone. For finance teams, this means the model cites the firm's actual policies and procedures rather than producing plausible-sounding answers from general training. That makes outputs traceable, reviewable, and more defensible under FCA and ICO governance expectations.

This post is general information and education only, not legal, regulatory, financial, or other professional advice. Regulations evolve, fee benchmarks shift, and every situation is different, so please take qualified professional advice before acting on anything you read here. See the Terms of Use for the full position.

Ready to talk it through?

Book a free 30 minute conversation. No pitch, no pressure, just a useful chat about where AI fits in your business.

Book a conversation

Related reading

If any of this sounds familiar, let's talk.

The next step is a conversation. No pitch, no pressure. Just an honest discussion about where you are and whether I can help.

Book a conversation