Federal

AI Leadership in 2026: Governance Is Table Stakes, Delivery Is the Job

Tony Adams6 min readMay 2026

Here’s the uncomfortable thing about federal AI leadership in 2026: almost everyone now has the things that used to signal it. Every CFO Act agency has an AI strategy, a Chief AI Officer (CAIO), an AI governance board, and a public use-case inventory — because federal policy now requires all of them. Governance is the floor. So if your AI partner’s pitch is “we’ll help you comply with M-25-21,” they’re describing the price of admission, not a reason to pick them. The differentiator moved. In 2026, AI leadership is the discipline of shipping bounded, accountable, governed software that survives contact with a real workflow — and the maturity to know what not to build. This is the capstone of the VA AI modernization cluster: the point of view that ties the rest of it together.

What actually changed in federal AI policy — and what didn’t?

It’s worth being precise and even-handed here, because the policy shift is easy to over-read. In January 2025, Executive Order 14179 revoked the prior administration’s EO 14110; in April 2025, OMB issued M-25-21 (“Accelerating Federal Use of AI through Innovation, Governance, and Public Trust”) and M-25-22 (on AI acquisition), rescinding the earlier M-24-10 and M-24-18; and in July 2025 the White House released “Winning the Race: America’s AI Action Plan.” What changed was posture and mechanics: a more explicitly pro-innovation tone, a Buy-American preference in procurement, and a consolidation of the old “rights-impacting” and “safety-impacting” categories into a single “high-impact AI” tier — which still carries mandatory minimum practices like pre-deployment testing, impact assessments, ongoing monitoring, human oversight, and the ability to pause or discontinue a non-compliant system. What stayed is more striking than what changed: every covered agency still must have a Chief AI Officer, an AI governance board, a public AI strategy, and an annual use-case inventory. The honest read, across the EO 13960 → M-24-10 → M-25-21 arc and both administrations, is that the governance architecture is recognizably continuous. That’s the point. When the scaffolding is universal, having it is not leadership.

every agency

By policy in 2026 every CFO Act agency must have a CAIO, an AI governance board, a public strategy, and an annual use-case inventory — governance is the floor, not the differentiator— OMB M-25-21 / July 2025 AI Action Plan

If governance is universal, where’s the bottleneck?

In delivery — and the evidence is consistent. The GAO has documented federal AI use cases nearly doubling in a year (from 571 to 1,110 across selected agencies) while finding that agencies “had to figure out how to acquire AI on their own,” with none of the four it studied requiring systematic capture of acquisition lessons learned. Industry research tells the same story from the other side: ICF’s 2025 government survey found that while 41% of agencies are running AI pilots, only about 8% have reached scaled, measurable deployment, and MIT’s widely-cited “GenAI Divide” study reported that roughly 95% of enterprise AI pilots delivered no measurable bottom-line impact — failing not on model quality but on the “messy middle” of data, integration, and change management. The VA’s own inventory makes it concrete and a little bracing: of 227 use cases in 2024, 72 were marked retired in the 2025 inventory. That ruthless deprecation is a healthy sign — it’s what an organization that’s actually trying to ship looks like. The lesson for anyone claiming to lead: the model is rarely the constraint. Working software is.

What does responsible AI leadership actually look like in practice?

Six concrete things — none of which fit on a strategy slide.

Honest scoping. Refuse to call something “AI” when a deterministic rules-engine is the right tool, and refuse to ship anything without a defined kill-criterion.
Human-in-the-loop by default. The ambient scribe drafts the note; the clinician edits and signs it. Automated decision support assists the claims processor; it does not deny the claim. REACH VET flags risk; a human does the outreach. The principle is constant: the human stays in control, and the consequential decision stays inside the VA.
Governance documentation as a first-class artifact — impact assessments, monitoring plans, and discontinuation criteria are how trust is built and audited, covered in the M-25-21 spoke.
Bounded scope. A 90-day fixed-scope build with a real “scale, re-scope, or kill” decision at day 90 beats a 24-month strategy engagement on every dimension that matters to the mission.
Knowing what not to build. The clearest test of AI leadership in 2026 is the maturity to say: this is a sensitive domain — suicide prevention, crisis triage, benefits and clinical decisions — and the bar is augment a trained human, never replace one. The REACH VET evidence is the case study: associated with measurable process improvements and a modest reduction in documented suicide attempts in the 2021 evaluation, but without a demonstrated statistically significant reduction in suicide deaths, and with later analyses flagging low predictive precision. Honest leaders state that evidence plainly rather than overselling it.
Equity and bias auditing as leadership work. Reporting on REACH VET’s earlier model found it weighted some demographic combinations and omitted risk factors disproportionately affecting women veterans — the kind of finding that should inform every high-impact deployment, not surface in a post-mortem.

Why does this favor small, technical, veteran-owned firms?

Because delivery culture is the part of AI work that strategy decks can’t pre-cook — and because the structural tailwinds are real. The FY2024 NDAA raised the governmentwide SDVOSB goal from 3% to 5% and ended self-certification for goal credit; SBA cleared its 2,700-case VetCert backlog in November 2025 and cut average processing from 81 days to 12; and the VA’s Veterans First priority (38 U.S.C. 8127) puts SDVOSBs at the top of its set-aside order — the VA awarded about 23.6% of its prime dollars to SDVOSBs in FY2024, versus roughly 5% governmentwide. And the AI work is genuinely flowing to veteran-owned firms: Rise8 (an SDVOSB) and Thoughtworks Federal were awarded the contract in March 2026 to scale the VA’s ambient scribe from a 10-site pilot to more than 130 medical centers, and the Oddcore SDVOSB joint venture won a ~$15M VA.gov AI chatbot task on SPRUCE. The honest caveat — the one running through this whole cluster — is that small firms rarely prime the megaprograms; they win by priming bounded, accountable work and subbing on the capability-heavy task orders under T4NG2 and other primes. Leadership at that scale isn’t measured in ceiling value. It’s measured in what reaches production.

Myth versus reality: the 2026 scorecard

// The old AI-leadership story vs the 2026 reality — what's table stakes and what's actually hard
The old story (~2023)	Table stakes in 2026	The real bottleneck in 2026
“We have an AI strategy.”	Every CFO Act agency has one.	Bounded, accountable delivery against it.
“We have a Chief AI Officer.”	Every covered agency must.	A culture where the CAIO can actually pause or kill a non-compliant system.
“We have a use-case inventory.”	Mandatory and public.	Ruthless deprecation — like the VA retiring 72 use cases in a year.
“We do governance.”	M-25-21 minimums are required.	Operationalizing them in real software, in real workflows.
“We use generative AI.”	Most agencies have a GPT (VA GPT: 120,000+ users).	Grounding it in authoritative data without hallucination harm.
“We do human-in-the-loop.”	Stated everywhere.	Designing the UX and decision rights so the human actually is.
“We’re FedRAMP-compliant.”	Moderate is the floor.	Choosing the right boundary and the right edge for the workload.

Frequently asked

Isn't governance still a differentiator?

No — it's mandatory for every agency partner now. It's necessary, not sufficient. Ask a vendor what they've shipped to production, with named users and named failure modes.

What's the single best signal of AI leadership in a vendor?

Bounded scope with a kill-criterion. A partner who'll define, at day 90, whether the thing goes to production or gets killed is a partner who's actually trying to ship.

Where does the edge / Cloudflare bet fit?

Serverless inference and retrieval close to users, inside a recognized federal boundary, with policy enforced at the gateway. The realistic caveat: Cloudflare for Government is FedRAMP Moderate today, with FedRAMP High and its AI stack in process for 2026 — so the production posture now is Moderate plus careful design.

Do I need CMMC for VA AI work?

No — CMMC is DoD-only and doesn't apply to VA work.

Working with Truvisory

A word on who’s saying this. Truvisory is a brand-new SDVOSB — solo technical founder, combat veteran, Denver — SBA VetCert-certified.

A firm that new can’t credibly claim to lead the federal AI conversation on scale or decades of past performance, and we won’t pretend to. What a small technical firm can lead on is point of view, scoping discipline, and the practiced habit of shipping: working software, not strategy decks; fixed 90-day scopes; Cloudflare-native; FedRAMP-aware (not CMMC); bounded to use cases where the human stays in control. That ethos is itself the argument. The VA’s AI estate in 2026 doesn’t need more strategy — it has a strategy, a CAIO, an inventory, and 367 use cases. It needs disciplined delivery against the strategy that already exists. If that’s the kind of partner you’re looking for — to sub on a task order or ship one bounded capability — that’s the work we do. And if you want the rest of the map, start at the pillar: the compliance trilogy, the capability spokes, the vehicles and teaming, and the BD how-tos — the capability statement, forecasting, events, and the buyer map.

Tony Adams is the founder of Truvisory®. He builds Cloudflare-native AI systems for federal and commercial clients. SBA-verified SDVOSB and VOSB, SAM.gov-registered.

AI Leadership in 2026: Governance Is Table Stakes, Delivery Is the Job

What actually changed in federal AI policy — and what didn’t?

If governance is universal, where’s the bottleneck?

What does responsible AI leadership actually look like in practice?

Why does this favor small, technical, veteran-owned firms?

Myth versus reality: the 2026 scorecard

Frequently asked

Working with Truvisory

More in this series

VA AI Modernization for SDVOSBs: The $10.2B Procurement Ground Truth

How to Forecast VA AI Work Before the RFP Drops

Contact-Center AI at the VA: The Honest SDVOSB Role

VA EHRM (Oracle Health): The Honest SDVOSB Subcontracting Map

NVSBE Is Gone: How a New SDVOSB Works VA Events in 2026

Polaris for VA AI Work: The One SDVOSB GWAC With a Door Still Open

REACH VET: The Honest SDVOSB Role Near the VA's Suicide-Risk Model

Who Actually Buys VA AI: TAC vs. SAC for SDVOSBs

Teaming for VA AI: How an SDVOSB Gets Onto T4NG2 and SPRUCE Work

VA AI Tech Sprint Follow-Ons: How an SDVOSB Turns a Demo Into a VA Contract

VA ATO, VAEC, and FedRAMP: How AI Actually Ships on VA Infrastructure

The SDVOSB Capability Statement That Actually Wins VA AI Work

VA Claims Automation and ADS: What AI Can and Can't Do

VA Compliance: Why FedRAMP-Aware Is Right and CMMC Doesn't Apply

VA Document Automation with AI: The SDVOSB Capability Play

VA M-25-21 and High-Impact AI: The Governance Layer Vendors Keep Missing

VA RAG Policy Assistants: Grounded, Cited Answers — Not a Chatbot

VA SDVOSB Sole-Source for AI: The $5M Veterans First Fast Lane

VA SPRUCE for SDVOSBs: The $2.4B 100% Set-Aside Vehicle for VA Digital and AI Work

VA T4NG2 for SDVOSBs: How to Win AI and Automation Work on the $60.7B Vehicle

VA Veterans First and the Rule of Two: Why the VA Buys From SDVOSBs First

VETS 2 for VA AI Work: A Closed GWAC You Team Into, Not Onto

One email a month. Not a vendor blog.