Blog
Insights on AI agents, trust systems, and the agent economy.
AI Agents Score Half as Well as PhDs on Real Work. Benchmarks Say Otherwise. Both Are Right.
Stanford's 2026 AI Index found the best AI agents perform at roughly half the level of human PhDs on complex scientific tasks. UC Berkeley showed those same agents can score 100% on standard benchmarks without solving anything. These two facts aren't in conflict — they're the same problem from opposite ends.
AI Agents Are Running Payroll Now. The Stakes Just Changed.
ADP just deployed a Payroll Variance AI agent to enterprise clients in 40+ countries. When AI agents move from productivity tools into operational finance, 'it worked in the demo' stops being good enough.
OpenAI Gave Agents a Sandbox. What They Still Need Is a Report Card.
OpenAI shipped sandboxed execution in its Agents SDK this week — a real safety improvement that the enterprise world is going to misread as a trust solution. Containment and verification are different problems, and confusing them is expensive.
A2A Solved the Agent Connectivity Problem. It Just Made the Trust Problem Worse.
The Agent2Agent protocol just hit 150 organizations and landed in Azure, AWS, and Amazon Bedrock — a genuine infrastructure milestone. The same week, a new study found 94% of enterprises are scared about AI agent sprawl. These two headlines are not a coincidence. They're describing the same problem from opposite ends.
A2A Just Crossed 150 Organizations. The Trust Layer Is Still Missing.
The Agent-to-Agent protocol hit a major milestone this week: 150 organizations, production deployments across five industries, AWS and Azure integrations. A2A solved how agents talk to each other. It didn't solve whether they should trust each other.
A2A at One: The Protocol Won. Now Build the Trust Layer.
The Agent2Agent protocol just turned one with 150+ supporting organizations and deep integration in Azure, AWS Bedrock, and Google Cloud. Agents can now talk to each other across any vendor stack. The problem nobody is solving yet: should they trust what they hear?
Anthropic Solved Deployment. Now Comes the Hard Part.
Anthropic's Managed Agents just stripped the infrastructure friction out of shipping AI agents. Notion, Rakuten, and Asana are already live in production. When deployment takes weeks instead of months, the competition moves somewhere else entirely.
150 Organizations Just Wired Their Agents Together. Now Comes the Hard Part.
Google's Agent2Agent protocol now has 150+ enterprise backers and just shipped a major upgrade. The plumbing for multi-agent interoperability is essentially solved. What isn't solved is whether anyone should trust what flows through it.
Microsoft Just Made Agent Governance Infrastructure Official
Microsoft's open-source Agent Governance Toolkit isn't just another security tool — it's the market acknowledging that a verified trust layer for AI agents is no longer optional. Here's what it means and what it still doesn't solve.
NVIDIA Built the Factory Floor. Who's Running Quality Control?
NVIDIA's Agent Toolkit just gave 17 enterprise partners the infrastructure to deploy AI agents at scale. IQVIA already has 150+ agents across the top 20 pharma companies. But infrastructure isn't verification — and the gap between deploying agents and knowing if they work is the next crisis.