AI & ML in Security · Issue June 7, 2026

AI & ML in Security

June 7, 2026 · LLM red-teaming, agent identity, MCP, and the platforms reshaping enterprise AI security

At a glance

The defining AI-security signal of the last two weeks is fragility under multi-turn attack. Cisco’s threat-intel team ran roughly 37K attacks against 15 frontier models and pushed multi-turn jailbreak success as high as 88% on Grok 4.1 Fast — widening the gap between vendor safety benchmarks and observed resilience. Microsoft open-sourced RAMPART and Clarity for agent red-teaming, and Hadrian’s OpenHack packaged multi-agent vuln-research harnesses for the defender side of the same coin.

In parallel, an agent-security framework is taking shape. Microsoft articulated platform-level capabilities (prompt-injection resistance, agent identity, runtime sandboxing, model supply-chain controls) and shipped Windows 365 for Agents, which gives AI agents enterprise-IAM-bound cloud PCs. NSA’s CSI on Model Context Protocol security gave the agent-protocol layer its first national-authority guidance, while Adversa’s May roundup catalogued the wider MCP research ecosystem. Only 11% of production agents clear a baseline security bar — the starting point for everything that follows.

Vendor platforms are converging around agent operations: Netskope AgentSkope, Google AI Threat Defense, Tamnoon TAMI AI Skills, and Okta’s vendor-neutral agent identity push. Microsoft is reportedly bundling Copilot into a super app. And on the offensive side, AI is now both target and weapon — reshaping vuln research, fraud, synthetic media, and the economics of agent runtimes that token-discipline and observability work now have to tame.

Topic map — vendors, frameworks, protocols, and how they cluster

Every major entity in this issue’s 24 articles plotted across the four themes we’re tracking this cycle — LLM red-teaming, agent security frameworks & identity, vendor agent platforms, and AI ops/evals/economics.

Topic map of AI & ML in Security issue June 7, 2026

Article index

LLM red-teaming & defensive testing

Cisco’s multi-turn jailbreak data, Microsoft’s RAMPART/Clarity release, Hadrian’s OpenHack, and the spam-flood problem now hitting maintainers as AI-generated vuln reports drown legitimate triage.

Article

Source

Published

Frontier AI models collapse under multi-turn attacks, Cisco finds

Help Net Security

May 28, 2026

OpenHack: Open-source AI-powered vulnerability research

Help Net Security

May 25, 2026

AI is drowning software maintainers in junk security reports

Help Net Security

May 18, 2026

Macro Evals for Agentic Systems

OpenAI Cookbook

May 2026

Agent security frameworks & identity

RAMPART/Clarity, Windows 365 for Agents, the NSA’s MCP CSI, the MCP ecosystem roundup, the systems-vs-models reframe, and identity plays from Okta and Microsoft’s broader agent-security platform pitch.

Article

Source

Published

Microsoft Open-Sources RAMPART and Clarity to Secure AI Agents

The Hacker News

May 21, 2026

Microsoft’s new cloud PCs place AI agents under enterprise controls

Help Net Security

May 28, 2026

Microsoft responds to security challenges facing code, AI agents, and models

Help Net Security

June 3, 2026

NSA CSI: Securing Model Context Protocol Implementations

NSA

May 2026

Top MCP security resources — May 2026

Adversa AI

May 2026

Okta pushes vendor-neutral identity governance for AI agents

BiometricUpdate

May 2026

AI security needs a shift from models to systems, researchers argue

CSO Online

May 2026

The AI agent bottleneck isn’t model performance — it’s permissions

VentureBeat

May 2026

Vendor agent platforms & integrations

Google AI Threat Defense, Netskope AgentSkope, Tamnoon TAMI AI Skills, Microsoft’s Copilot “super app,” Gemini-app integrations, and DNS-AID’s push to make agents discoverable.

Article

Source

Published

Google AI Threat Defense targets attackers using AI to find flaws faster

Help Net Security

May 27, 2026

Netskope Revolutionizes Security and Network Operations with AgentSkope

GlobeNewswire

May 5, 2026

Tamnoon introduces skill-based AI orchestration for autonomous cloud defense

Help Net Security

May 26, 2026

Coming Soon: Gemini to Add Adobe, Canva, and CapCut for AI Editing

eWeek

May 2026

Microsoft is building a super app combining coding, chat, and other Copilot AI tools

Fortune

May 29, 2026

DNS-AID will make AI agents easier to discover, says Linux Foundation

InfoWorld

May 2026

Reactor, real-time AI video startup founded by ex-Apple engineers, raises $59M

Variety

May 2026

AI ops, evals & economics

Production-agent security pass rates, agent-skill design pitfalls, domain-tuned LLMs, observability for probabilistic systems, and the token-discipline pressure that Opus 4.8’s capabilities have created.

Article

Source

Published

Only 11% of production agents pass the AI agent security bar

Help Net Security

June 3, 2026

Agent Skills Work, but Most Teams Are Building Them Wrong

O’Reilly Radar

May 2026

21 LLMs tuned for special domains

InfoWorld

May 2026

Opus 4.8 Made Claude Smarter. Token Discipline Got Urgent.

The New Stack

May 2026

Debugging the undebuggable: building observability into probabilistic AI systems

The New Stack

May 2026

Detailed write-ups

1. Frontier AI models collapse under multi-turn attacks, Cisco finds

Help Net Security · May 28, 2026

Cisco’s AI threat-intelligence team tested 15 frontier models across roughly 30,000 single-turn and 7,000 multi-turn attacks and saw single-turn safety hold up much better than multi-turn — with multi-turn success climbing to 88% against Grok 4.1 Fast. The work formalizes the divergence between vendor safety benchmarks and observed jailbreak resilience: single-turn evals consistently understate risk because real adversaries iterate. CISOs and AI red teams should treat single-turn safety scores as a floor rather than a ceiling and require multi-turn coverage in any model-acceptance pipeline.

AI & ML in Security

At a glance

Topic map — vendors, frameworks, protocols, and how they cluster

Article index

LLM red-teaming & defensive testing

Agent security frameworks & identity

Vendor agent platforms & integrations

AI ops, evals & economics

Detailed write-ups

1. Frontier AI models collapse under multi-turn attacks, Cisco finds

2. Microsoft Open-Sources RAMPART and Clarity to Secure AI Agents

3. Google AI Threat Defense targets attackers using AI to find flaws faster

4. OpenHack: Open-source AI-powered vulnerability research

5. Microsoft’s new cloud PCs place AI agents under enterprise controls

6. Only 11% of production agents pass the AI agent security bar

7. Microsoft responds to security challenges facing code, AI agents, and models

8. AI is drowning software maintainers in junk security reports

9. NSA CSI: Securing Model Context Protocol Implementations

10. Top MCP security resources — May 2026

11. Netskope Revolutionizes Security and Network Operations with AgentSkope

12. Agent Skills Work, but Most Teams Are Building Them Wrong

13. 21 LLMs tuned for special domains

14. Okta pushes vendor-neutral identity governance for AI agents

15. Tamnoon introduces skill-based AI orchestration for autonomous cloud defense

16. AI security needs a shift from models to systems, researchers argue

17. Coming Soon: Gemini to Add Adobe, Canva, and CapCut for AI Editing

18. The AI agent bottleneck isn’t model performance — it’s permissions

19. Opus 4.8 Made Claude Smarter. Token Discipline Got Urgent.

20. DNS-AID will make AI agents easier to discover, says Linux Foundation

21. Microsoft is building a super app combining coding, chat, and other Copilot AI tools

22. Debugging the undebuggable: building observability into probabilistic AI systems

23. Reactor, real-time AI video startup founded by ex-Apple engineers, raises $59M

24. Macro Evals for Agentic Systems

On our watch list