Government and public sector AI: where sovereignty isn't optional

2026-05-28 · Avery NXR

Government services have started deploying AI at scale. Citizen service chatbots that answer questions about benefits and processes. Document processing for benefits applications, permits, and licenses. Translation services for multilingual citizen interactions. Internal AI for analysts in defense, intelligence, and policy work. Court and judicial document workflows.

The deployments are real. The volume is enormous — government services touch most of the population, after all. And the constraints under which the work has to happen are the strictest of any operational AI category. Data sovereignty. FedRAMP and equivalent international frameworks. Citizen privacy frameworks. Procurement requirements that favor known vendors with security-cleared deployments.

For most government and public-sector AI deployments, the cloud LLM that the private sector uses is not just expensive — it is structurally inadmissible. The local-SLM case here is foundational, not marginal.

The shape of the work

Public sector AI workloads fall into several categories.

Citizen-facing service: the chatbot or virtual agent on a government website that answers questions about benefits, permits, voting, taxes, healthcare programs. Volume scales with population.

Document processing: applications for unemployment, social security, veterans' benefits, immigration, business licensing, permits. Each application is a document (or multiple), each goes through a workflow that increasingly includes AI augmentation.

Translation and accessibility: government communications translated into the languages of the population served, plus accessibility variants for vision, hearing, and cognitive needs.

Internal analyst work: in defense, intelligence, law enforcement, and policy research, AI is used to summarize documents, identify patterns, draft briefs, and analyze open-source intelligence.

Court and judicial: AI is being introduced into document review for litigation, legal research, and case management. The applications are early and the regulatory framework is still evolving.

Each category has its own volume profile, its own privacy constraints, and its own regulatory framework — but they all share the underlying property that the data being processed belongs to citizens, or to the state, and is subject to oversight regimes that the private sector doesn't have to navigate.

The constraints

The constraints make the cloud-LLM-default architecture structurally difficult.

Data sovereignty: many governments require that data about their citizens be processed in their jurisdiction, by entities subject to their law. Sending citizen data to a US-based cloud LLM is, for non-US governments, often legally prohibited. Sending non-US citizen data to a US cloud is sometimes prohibited by the citizen's home jurisdiction.

FedRAMP and equivalents: the US federal government's cloud authorization framework is detailed about which cloud services can be used for which kinds of data. Cloud LLMs operating under FedRAMP authorizations exist, but they cover specific model versions and specific use cases, and the authorization process is slow. Many state and local governments have similar frameworks.

Citizen privacy: government data about citizens is uniquely protected. PII, eligibility data, medical information, family information, financial information — all of it is constrained by frameworks that vary by jurisdiction but are generally more restrictive than private-sector frameworks.

Procurement: government procurement favors known vendors with security clearances, established contract vehicles, and demonstrable compliance histories. A startup with a clever cloud LLM integration faces significant procurement friction.

Audit and FOIA: government decisions are subject to public records requests, audits, and judicial review. A model's reasoning has to be inspectable and reproducible in ways that cloud LLM workflows often aren't.

For most workloads in government, "let's just use ChatGPT" is not a valid plan. The procurement office, the CISO, and the privacy officer will block it.

Why this is structurally a local-SLM case

Every standard property is present, plus several that are unusually strong.

The work is narrow. The model needs to know one agency's processes, one agency's terminology, one agency's eligibility rules. A model trained on the agency's own corpus outperforms a general model.

The work is repetitive. Government services involve enormous volumes of similar interactions — applications follow predictable patterns, eligibility checks follow predictable rules, citizen questions cluster into predictable categories.

The volume is enormous. A state department processing unemployment applications during an economic downturn handles tens of thousands of applications per day. Federal benefits programs handle millions per year.

The privacy constraints are structural, not optional. The local-inference architecture answers the regulatory questions in ways that cloud-LLM architectures cannot.

The audit trail matters. Government decisions are subject to public scrutiny in ways private decisions aren't. A local model that writes structured decisions alongside its work produces an audit trail that maps onto what oversight regimes expect.

What changes with local inference

A government AI workflow on a local SLM looks like this.

A model is fine-tuned on the agency's corpus — historical applications, eligibility rules, citizen interaction patterns, internal documentation. The fine-tuning is done in a compliance-controlled environment.

The model runs on infrastructure the agency controls — on-premises, in a FedRAMP-authorized private cloud, or in a sovereign cloud meeting the relevant jurisdictional requirements. The deployment is documented, audited, and approved.

Citizen interactions and document workflows flow through the inference pipeline. The model produces decisions, recommendations, and drafts. The audit trail is local and reviewable.

For multilingual deployments, the model may have multi-lingual fine-tuning to handle the languages of the population served — Spanish, Mandarin, Vietnamese, Tagalog, and so on for US deployments; similar lists for other countries.

The cost flips from per-interaction to fixed. Citizen interaction volume can scale during high-demand periods (open enrollment, tax season, disaster response) without the bill spiking.

When the cloud LLM is still possible

A narrow set of cases.

For workflows operating on fully public, non-PII data — open government data analysis, public-facing FAQ generation from already-public documents — the cloud LLM can be acceptable.

For specific cloud LLM services that have explicit FedRAMP authorization for the specific use case and data class, in the specific federal context, with all the procurement and security review complete.

For some research and analysis workflows in policy or program evaluation where the data is aggregate or fully de-identified.

For the bulk of public sector AI work — the citizen-facing, document-processing, eligibility-determining, benefits-administering work that constitutes most of what governments do at scale — the cloud LLM is structurally inadmissible, and the local-SLM architecture is the only one that works.

The pattern, in the public sector

Avery NXR is not a government tool. It scaffolds Next.js applications. The architectural pattern repeats.

Public sector AI is a narrow (within each agency's domain), repetitive, extreme-volume, sovereignty-constrained, audit-critical workload. The economics, the privacy story, the regulatory framework, the procurement requirements, the audit trail — every dimension favors local inference. The cloud LLM is not the right tool for this category; in many cases, it's not even an allowable tool.

The vendors that build excellent government AI on local infrastructure — with appropriate fine-tuning, sovereign cloud deployment models, FedRAMP and equivalent authorizations, and procurement-friendly business models — will own the public sector AI market. The cloud-LLM-default products will be relegated to a narrow set of use cases that fit within the existing authorization frameworks.

The pattern continues. Government is the workflow category where the architectural choice is made by procurement and compliance long before the cost case enters the conversation. The vendors that recognize this will build for sovereignty first, and find a large, durable, well-funded customer base waiting for them.