Government AI: The Illusion of Trust at Federal Scale

AI Infrastructure
Arjun Vedanta
May 21, 2026
0
6
6 minutes read

The Elusive Search for Trustworthy AI in Government

Washington’s earnest discussion of ‘trustworthy AI’ and ‘best practices for scaling’ in federal agencies often feels like a polite sidestep around the fundamental, intractable problems that plague large-scale AI deployment. While the Department of Energy (DOE) maps out risk mitigation and the General Services Administration (GSA) details vendor vetting, the unspoken truth is that true algorithmic trust remains an elusive ideal, particularly when grappling with the sheer complexity and opacity of systems designed to operate at petabyte and exabyte scales across myriad government functions. The current narrative, while well-intentioned, risks creating a false sense of security, delaying a more critical reckoning with what AI can and cannot realistically deliver in the public sector.

Pamela Isom, Director of the AI and Technology Office at the DOE, speaks passionately about driving a holistic view on AI and mitigating risk, emphasizing the need for data representativeness and trustworthy outcomes. Her team’s focus on policies and standards, guided by Executive Orders like 14028 and 13960, aims to transform the DOE into a ‘world-leading AI enterprise.’ This ambition, however, implicitly places the burden of ensuring trust onto internal processes and playbooks, like the forthcoming external version of her AI Risk Management Playbook. The playbook attempts to quantify ‘trust,’ for instance, by flagging when an 80% accuracy rate falls short of a 90% target, as if the remaining 10% is simply a matter of tuning rather than a potential indicator of systemic bias or fundamental model limitations.

This approach, while pragmatic for internal governance, reveals a critical blind spot: the presumption that trust is a definable, achievable metric rather than an ongoing, deeply complex human-machine interaction. The quest for ‘trustworthy AI’ within federal agencies frequently overlooks the inherent trade-offs between speed, scale, and true interpretability in black-box models that even their creators struggle to fully explain. The very assertion that AI is ‘beyond human capability’ and ‘can tell me what I’m going to do next before I contemplate it myself,’ as Isom noted, simultaneously underscores its power and the profound challenge of ever truly trusting a system whose mechanisms are opaque even to its operators.

Scaling AI: A Vendor-Driven Narrative?

Anil Chaudhry, Director of Federal AI Implementations for the GSA’s AI Center of Excellence (CoE), outlines a series of ‘best practices’ for implementing AI at scale, framing the government’s role as partnering with industry subject matter experts. This partnership model is pervasive in federal digital transformation efforts, driven by the CoE’s mission to accelerate modernization and improve efficiency. While logical on the surface, this structure also establishes a profound dependency on external vendors, whose incentive is to present AI solutions as ready for immediate, large-scale deployment.

Chaudhry’s five best practices — vetting commercial experience with vast datasets, assessing AI talent, ensuring access to financial capital, securing logistical capital in the form of authoritative data, and planning physical infrastructure — essentially become a procurement checklist. Each point, from navigating petabytes of data to predicting the volatile flow of capital for AI projects, highlights the sheer resource intensity required. Yet, by focusing on these tactical criteria, the conversation sidesteps the strategic implications of outsourcing core algorithmic intelligence. Agencies are advised to ask vendors about their strategies for trend analysis and sustainability against ‘drift of data’ for bots like Robotic Process Automation (RPA), but these inquiries often probe process rather than fundamental architectural choices that dictate true trustworthiness.

This framework tacitly shifts the responsibility for the deeper ethical and societal impacts of scaled AI from the federal purchaser to the private provider, whose commercial imperatives may not always align with public good. When Chaudhry asks, “If you buy something, how will you know you got what you wanted when you have no way of evaluating it?” he inadvertently articulates the central paradox of large-scale AI adoption. Without robust, independent evaluation capabilities within government, agencies are left to trust the assurances of vendors who profit directly from expanded deployment. This incentive structure ensures a continuous push towards scaling, regardless of whether the underlying AI truly meets the lofty ‘trustworthy’ standard or merely appears to do so on paper.

The Unacknowledged Costs of Rapid AI Integration

The federal government, with ‘every federal agency hav[ing] some sort of AI project going on right now,’ is aggressively pursuing AI adoption. This rapid integration, while framed as essential for national security and economic vitality, masks a significant risk: the systemic erosion of institutional knowledge and control over critical operational capabilities. By relying heavily on commercial off-the-shelf solutions and vendor expertise, government agencies risk becoming mere consumers of AI, rather than informed developers or even truly sophisticated overseers.

The emphasis on external talent and capital access, while practical, also underlines a deeper deficiency in organic federal AI capabilities. While Chaudhry rightly stresses the importance of training the workforce to ‘imagine AI tools,’ the deeper question of whether the government is developing its own algorithmic literacy – the capacity to critically assess, audit, and even develop novel AI solutions independently – remains largely unaddressed. Without this internal capacity, the ‘trust’ in AI becomes entirely dependent on the integrity and competence of external partners, creating vulnerabilities far beyond mere technical risk.

Ultimately, the conversations emerging from events like AI World Government, while providing useful operational guidance, are symptomatic of a broader institutional reflex: to manage the novel by imposing familiar frameworks. ‘Trustworthy AI’ becomes a set of compliance checkboxes, and ‘scaling best practices’ a procurement roadmap. What’s often missing is a frank acknowledgment that the current generation of AI, powerful as it is, carries inherent limitations, particularly regarding bias, interpretability, and true generalizability across diverse, high-stakes government applications. Until this deeper, more skeptical interrogation occurs, the federal push for AI risks building impressive, yet potentially fragile, digital edifices on foundations that remain largely unexamined.

The Elusive Search for Trustworthy AI in Government

Scaling AI: A Vendor-Driven Narrative?

The Unacknowledged Costs of Rapid AI Integration

Arjun Vedanta

Follow us: