Blog

Insights on AI agents, trust systems, and the agent economy.

July 5, 2026

95% of AI Pilots Fail. Four Labs Just Committed $9 Billion to Fix That. Here's What They're Missing.

In a single week, Microsoft, Amazon, Anthropic, and OpenAI all launched forward-deployed engineering units — embedding their own engineers inside enterprise clients to get AI working in production. The implementation problem is real. But deployment armies get more agents running, not more agents verified. That distinction is about to matter a lot.

AI agentsenterprise AIMicrosoftforward deployed engineeringagent verificationagent trustdeployment2026

July 2, 2026

74% of Enterprise AI Agents Got Pulled From Production. The Best-Monitored Ones Are at 81%.

Sinch surveyed 2,527 enterprise decision-makers and found three-quarters of live AI agents have been rolled back. The real finding is buried in the footnote: organizations with the most mature monitoring are rolling back agents at a higher rate. They're not failing more. They're seeing more of what was always failing.

AI agentsagent reliabilityenterprise AIagent verificationproduction AIagent governanceSinch2026

July 1, 2026

Patronus AI's $50M Round Isn't About Testing. It's About Trust Infrastructure.

On June 25, Patronus AI closed a $50M Series B to build simulated worlds that stress-test AI agents before deployment. Revenue grew 15-fold in a year. Every major frontier lab is a customer. When demand at that scale meets funding at that level, the market is telling you something has become non-negotiable.

agent verificationAI agentsPatronus AItrustenterprise AIagent testingbenchmarks2026

June 20, 2026

The Regulators Blinked. That's the Wrong Kind of Good News.

The EU just deferred its high-risk AI enforcement deadline 16 months. A US court paused Colorado's AI Act. Enterprise compliance teams are exhaling. That's exactly the wrong response.

EU AI ActColorado AI ActAI complianceagent governanceagent verificationenterprise AIregulatory2026

June 19, 2026

Google Wrote a Rogue Agent Containment Plan. That's Not a Security Story.

Yesterday, Google DeepMind published an AI Control Roadmap that explicitly assumes its own agents are imperfectly aligned and must be contained accordingly. Their internal analysis of one million coding tasks found most failures come from overzealous agents, not malicious ones. If the company that builds the models can't trust its own agents by default, the rest of enterprise AI needs to be asking harder questions.

AI agentsagent verificationGoogle DeepMindagent reliabilityenterprise AIrogue agentsAI controltrust2026

June 18, 2026

Ten Outages in Twelve Days. The Reliability Axis Your Agent Stack Isn't Measuring.

Between June 5 and June 16, Claude experienced ten significant service disruptions — a mean time between failures of roughly one day. Every enterprise team running agents on top of Anthropic's API learned something the benchmark reports don't cover: task performance and infrastructure reliability are different axes, and the agent evaluation industry has built around only one of them.

AI agentsClaudeAnthropicinfrastructure reliabilityagent verificationenterprise AIuptimeSLAsingle-vendor dependency2026

June 15, 2026

When the Government Pulls Your Best Model

On June 12, the US government forced Anthropic to shut down Fable 5 and Mythos 5 globally — three days after launch. The story isn't about geopolitics. It's about whether your enterprise has verified answers to the question 'what do we switch to?' before the answer becomes urgent.

AnthropicFable 5enterprise AIAI infrastructuremodel dependencyAI sovereigntyagent verificationfallback2026

June 11, 2026

KPMG Just Stepped Into Enterprise Agent Governance. Here's the Infrastructure Gap Making It Necessary.

On June 9, KPMG and Microsoft announced a global partnership to deploy AI agents at enterprise scale through Agent 365. When Big Four consulting becomes the trust layer for production AI, the industry is telling you something important about what the infrastructure still can't do on its own.

KPMGenterprise AIagent governanceagent verificationMicrosoftAgent 365trustAI agents2026

June 9, 2026

NVIDIA and ServiceNow Posted 99.5% Containment. Enterprise Trust Is at 22%. Both Are True.

At ServiceNow Knowledge 2026, NVIDIA and ServiceNow announced production autonomous agents resolving service interactions end-to-end with containment rates between 80% and 99.5%. Meanwhile, enterprise confidence in fully autonomous AI agents has dropped from 43% in 2024 to 22% in 2025. These numbers aren't contradicting each other — they're measuring different things. That's the problem.

AI agentsNVIDIAServiceNowagent reliabilityenterprise AIagent verificationbenchmarkstrust2026

June 5, 2026

The Invisible Shelf Is Real. The Agents Running It Aren't Verified.

NielsenIQ just named AI agents the new packaging for CPG brands — the invisible intermediary that determines what shoppers find and buy. What's less clear is that multi-agent systems fail between 41% and 87% of the time in production-grade evaluations. If your agents are influencing trade spend and category decisions, you need to know which side of that range they're on.

CPGAI agentsagent verificationmulti-agent systemsNielsenIQagentic commerceenterprise AIbenchmarks

June 2, 2026

SAP Just Bet the Company on 200 Specialized Agents. Now Comes the Hard Part.

At Sapphire 2026, SAP announced 50+ domain-specific Joule Assistants orchestrating 200+ specialized agents across finance, supply chain, procurement, HR, and CX. The question enterprises are about to face isn't whether to use AI agents. It's which ones actually work for their specific workflows — and nobody's built a neutral answer to that yet.

SAPenterprise AIagent verificationAI agentsbenchmarkingautonomous enterpriseagent selection2026

May 27, 2026

NVIDIA's Verified Agent Skill Cards Are Real. So Is the Gap They Don't Fill.

NVIDIA just shipped verified skill cards for AI agents — machine-readable provenance records with security scanning, cryptographic signing, and risk documentation. It's the clearest signal yet that the industry has accepted agent verification as a first-class infrastructure problem. It also proves exactly which part of the problem remains unsolved.

AI agentsNVIDIAagent verificationagent trustenterprise AIskill cardsagent governance2026

April 27, 2026

OpenAI Just Made Every Team an Agent Operator. The Compound Reliability Math Is Brutal.

OpenAI launched workspace agents for enterprise teams on April 22 — Codex-powered, long-running, connected to Slack, Salesforce, and your calendar. It's genuinely useful infrastructure. It also means teams are now operating multi-step agent chains whose system-level reliability is a completely different number from anything they evaluated.

AI agentsenterprise AIagent reliabilityOpenAImulti-agent systemsagent verification2026

April 26, 2026

Multi-Agent Adoption Surged 1,445%. Then Someone Had to Build a Kill Switch.

Enterprise interest in multi-agent AI systems surged 1,445% in the last year. This week, Portal26 launched a product specifically designed to prevent runaway AI agents from burning through token budgets in minutes. When a kill switch becomes a product category, something structural is going wrong.

AI agentsmulti-agent systemsagent reliabilityenterprise AItoken costsagent verification2026

April 19, 2026

AI Agents Score Half as Well as PhDs on Real Work. Benchmarks Say Otherwise. Both Are Right.

Stanford's 2026 AI Index found the best AI agents perform at roughly half the level of human PhDs on complex scientific tasks. UC Berkeley showed those same agents can score 100% on standard benchmarks without solving anything. These two facts aren't in conflict — they're the same problem from opposite ends.

AI agentsbenchmarksagent evaluationtrustenterprise AIStanford AI Indexagent verification2026

April 17, 2026

AI Agents Are Running Payroll Now. The Stakes Just Changed.

ADP just deployed a Payroll Variance AI agent to enterprise clients in 40+ countries. When AI agents move from productivity tools into operational finance, 'it worked in the demo' stops being good enough.

AI agentsenterprise AIpayrollagent verificationtrustADP2026

April 17, 2026

OpenAI Gave Agents a Sandbox. What They Still Need Is a Report Card.

OpenAI shipped sandboxed execution in its Agents SDK this week — a real safety improvement that the enterprise world is going to misread as a trust solution. Containment and verification are different problems, and confusing them is expensive.

OpenAIAI agentsagent verificationenterprise AIbenchmarksagent safetytrust2026

April 14, 2026

A2A Solved the Agent Connectivity Problem. It Just Made the Trust Problem Worse.

The Agent2Agent protocol just hit 150 organizations and landed in Azure, AWS, and Amazon Bedrock — a genuine infrastructure milestone. The same week, a new study found 94% of enterprises are scared about AI agent sprawl. These two headlines are not a coincidence. They're describing the same problem from opposite ends.

A2A protocolagent sprawlAI governanceenterprise AImulti-agent systemsagent trustagent verification

April 13, 2026

A2A Just Crossed 150 Organizations. The Trust Layer Is Still Missing.

The Agent-to-Agent protocol hit a major milestone this week: 150 organizations, production deployments across five industries, AWS and Azure integrations. A2A solved how agents talk to each other. It didn't solve whether they should trust each other.

A2Amulti-agent systemsagent trustenterprise AIagent sprawlAI protocolsagent verification

April 12, 2026

A2A at One: The Protocol Won. Now Build the Trust Layer.

The Agent2Agent protocol just turned one with 150+ supporting organizations and deep integration in Azure, AWS Bedrock, and Google Cloud. Agents can now talk to each other across any vendor stack. The problem nobody is solving yet: should they trust what they hear?

A2AAI agentsmulti-agent systemsagent trustinteroperabilityenterprise AIagent verification

April 11, 2026

Anthropic Solved Deployment. Now Comes the Hard Part.

Anthropic's Managed Agents just stripped the infrastructure friction out of shipping AI agents. Notion, Rakuten, and Asana are already live in production. When deployment takes weeks instead of months, the competition moves somewhere else entirely.

AI agentsAnthropicenterprise AIagent verificationdeploymenttrustperformance

April 5, 2026

150 Organizations Just Wired Their Agents Together. Now Comes the Hard Part.

Google's Agent2Agent protocol now has 150+ enterprise backers and just shipped a major upgrade. The plumbing for multi-agent interoperability is essentially solved. What isn't solved is whether anyone should trust what flows through it.

A2A protocolAI agentsagent trustmulti-agent systemsenterprise AIagent verification

April 4, 2026

Microsoft Just Made Agent Governance Infrastructure Official

Microsoft's open-source Agent Governance Toolkit isn't just another security tool — it's the market acknowledging that a verified trust layer for AI agents is no longer optional. Here's what it means and what it still doesn't solve.

AI agentsagent governanceMicrosoftOWASPtrustenterprise AIagent verificationA2A

April 3, 2026

NVIDIA Built the Factory Floor. Who's Running Quality Control?

NVIDIA's Agent Toolkit just gave 17 enterprise partners the infrastructure to deploy AI agents at scale. IQVIA already has 150+ agents across the top 20 pharma companies. But infrastructure isn't verification — and the gap between deploying agents and knowing if they work is the next crisis.

AI agentsNVIDIAenterprise AIagent verificationOpenShellIQVIAtrustGTC 2026