The most striking number this week comes from a Kyndryl survey of 1,100 leaders across eight countries: more than 80% expect autonomous AI agents to make decisions with material business impact within a year, two-thirds have already given AI read and write access to core systems, and 57% say AI is broadly embedded in core processes. Only 25% say they fully trust those systems. That is the autonomy–assurance gap stated almost literally — agents have been handed the keys faster than anyone has built confidence they will use them well.
The security picture under that deployment curve is mixed in an instructive way. VentureBeat, citing CrowdStrike's 2026 Global Threat Report and OWASP's latest LLM rankings, documents an 89% year-over-year increase in AI-enabled attack volume, with prompt injection now the top-ranked LLM vulnerability for a second consecutive edition and zero-click exploits like EchoLeak landing CVSS 9.3. Customer chatbots, RAG pipelines, ticketing and HR automation are all named as live attack surfaces. Against that, Simon Willison's writeup of a public challenge — roughly 2,000 participants sending around 6,000 adversarial emails to a frontier assistant for about $500 in tokens — recorded zero successful prompt-injection breaches, suggesting model-level defenses are meaningfully harder to break than they were. Willison himself is careful: a clean result on a public challenge is not a security audit.
OpenAI's own report on how agents are transforming work, while useful as a directional briefing, is a vendor document and does not close the assurance side of the gap. Taken together, the evidence points the same way: agents are being wired into operations on the assumption that frontier-model defenses will hold and that governance can catch up later. For the executives accountable when an agent does something irreversible — a COO whose workflow it corrupts, a CTO whose credentials it leaks, a CEO who has to explain it — the practical question this window raises is whether tiered authority, human-approval gates on high-impact actions, and ongoing monitoring exist in the same systems where read/write access has already been granted.