Skip to content

CyberSecurity Institute

Security News Curated from across the world

Menu
Menu

AI-ML Security Brief — May 24, 2026

Posted on May 24, 2026May 25, 2026 by admini
AI-ML Bulletin · Issue May 24, 2026
The AI-ML Brief

AI in security · AI for security · agentic AI in operations

This week at a glance

The week the agent-platform wars went enterprise. At Code with Claude London, Anthropic shipped self-hosted sandboxes and MCP tunnels for Claude Managed Agents — tool execution and MCP traffic now stay inside customer networks while orchestration stays at Anthropic. Anthropic also published the first quantified Project Glasswing results: ~50 partner organizations (AWS, Apple, CrowdStrike, Google, JPMorgan, NVIDIA, Palo Alto, Cloudflare, Mozilla and more) using Claude Mythos Preview have identified 10,000+ high/critical-severity vulnerabilities.

Microsoft AI Red Team open-sourced RAMPART and Clarity — a pytest-native adversarial test framework on top of PyRIT plus a structured design-review tool — pulling agent safety testing into CI. Google I/O 2026 launched Gemini 3.5 and 3.5 Flash with strengthened cyber/CBRN safeguards, expanded CodeMender into the Gemini Enterprise Agent Platform, and rolled out DLP-enforced Agent Gateway, identity and observability across the stack. And the NSA AI Security Center dropped a 17-page CSI on Model Context Protocol security design — filtering proxies, DLP, sandboxing, message integrity, output filtering, local MCP scans — the first primary-source MCP guidance from a U.S. signals-intelligence agency.

Underneath the platform launches, the agent-governance and runtime-control category exploded. Babel Street put tradecraft-trained agents into threat-intel investigations. ASAPP added Continuous Red Teaming on Promptfoo aligned to OWASP LLM Top 10 and NIST AI RMF. CTERA shipped InsightAI for agentic analysis of unstructured data. LangChain launched LangSmith Engine to close the agent debug loop. NanoCo AI raised $12M (turning down a $20M buyout) to scale NanoClaw — an open-source agent harness with a ~500-line auditable core in MicroVM Docker sandboxes. Help Net Security framed the underlying threat model: LLMs in operational roles as a confused-deputy problem, with prompt injection through tickets/wikis, retrieval poisoning and telemetry manipulation as the dominant agentic-AI attack vectors.

The bug-bounty economy buckled under AI-generated submissions: HackerOne slashed Internet Bug Bounty payouts 70–85% as valid-report rates fell below 5%, while Cisco Talos’s own test of LLMs for IR-report drafting cut drafting time 50% but produced cross-case data contamination, hallucinated IOCs and inconsistent style. And on the M&A side, Cyera acquired five-month-old Genie Security for ~$50M — Genie’s endpoint tech catches sensitive-data leaks via human action or via generative AI tools such as Claude, formalizing the GenAI-DLP category as its own vendor segment.

Strategic context for the 12-month roadmap: Dan Shipper’s 12 predictions for the AI work era (super-agent-in-Slack patterns, users-bring-own-tokens, forward-deployed engineers as the new essential role) and CIO’s 7 signs your data isn’t ready for AI both land squarely on the security architect’s desk — governance gaps become permissions gaps, shadow BI becomes shadow data exfil. With 25 articles across NSA primary source, vendor research, agent governance, Glasswing, Google I/O, Anthropic, bug-bounty stress, M&A, and strategic context, this is the issue to share with the AI security committee.

Entity graph — agents, harnesses, frontier labs, and how they cross-correlate

Every named entity extracted from this week’s 25 articles, with edges showing the agent-platform / governance / capital web.

Topic map for ai ml

Article index

NSA — MCP security design (primary source)

Article Source Published
NSA Releases Security Design Considerations for AI-Driven Automation Leveraging the Model Context Protocol NSA May 20, 2026

Microsoft AI red team — RAMPART and Clarity open-sourced

Article Source Published
Introducing RAMPART and Clarity: Open source tools to bring safety into Agent development workflow Microsoft Security Blog May 20, 2026
Microsoft open-sources tools for designing and testing AI agents (Clarity & RAMPART) Help Net Security May 21, 2026
AI red teaming agents change how LLMs get tested Help Net Security May 21, 2026

Anthropic — Code with Claude London, sandboxes & MCP tunnels

Article Source Published
New in Claude Managed Agents: self-hosted sandboxes and MCP tunnels Anthropic May 19, 2026
Anthropic Introduces MCP Tunnels for Private Agent Access to Internal Systems InfoQ May 19, 2026
Anthropic enhances Claude Managed Agents with two new privacy and security features 9to5Mac May 19, 2026

Anthropic Glasswing — first quantified results

Article Source Published
Project Glasswing: An initial update (10,000+ critical vulnerabilities) Anthropic May 22, 2026
Anthropic Moves Closer to Public Claude Mythos Release: 10,000 Critical Bugs Found First TechTimes May 24, 2026

Google I/O 2026 — Gemini 3.5 and the agentic Gemini era

Article Source Published
I/O 2026: Welcome to the agentic Gemini era Google (blog.google) May 19, 2026
Innovations from Google I/O 26 on Google Cloud Google Cloud Blog May 19, 2026

AI policy — postponed Trump AI cybersecurity EO

Article Source Published
Trump delays AI security executive order, saying language ‘could have been a blocker’ TechCrunch May 21, 2026

Agent governance, runtime control, and the confused-deputy problem

Article Source Published
When your AI assistant has the keys to production Help Net Security May 20, 2026
Babel Street targets AI-driven threats with new agentic investigation capabilities Help Net Security May 19, 2026
ASAPP expands adversarial testing for enterprise AI systems Help Net Security May 21, 2026
NanoClaw’s creators are turning the secure, open source AI agent harness into an enterprise ‘second brain’ VentureBeat May 20, 2026
CTERA brings AI insights and automation for unstructured data (InsightAI) Help Net Security May 21, 2026
LangSmith Engine closes the agent debugging loop automatically — but multi-model enterprises still need a neutral layer VentureBeat May 19, 2026
NanoClaw creator turns down $20M buyout offer, raises $12M seed instead TechCrunch May 20, 2026

Bug bounties buckle — the AI-slop crisis

Article Source Published
HackerOne takes an axe to its bug bounty rewards The Register May 21, 2026
Bug Bounty Schemes Buckle Under Flood of AI-Generated Junk Reports Technology.org May 18, 2026
Cisco used AI to write security incident reports, with mixed results The Register May 22, 2026

M&A — AI-era DLP becomes its own vendor segment

Article Source Published
Cyber unicorn Cyera acquires five-month-old startup Genie Security in $50 million deal CTech / Calcalistech May 20, 2026

Strategic context — AI work era, data readiness, monthly roundup

Article Source Published
The AI paradox: More automation, more humans, more work | Dan Shipper Lenny’s Newsletter May 24, 2026
7 signs your data isn’t ready for AI CIO May 19, 2026

Detailed write-ups

NSA publishes Model Context Protocol security design considerations (May 20)

The NSA AI Security Center released a 17-page Cybersecurity Information Sheet on security design considerations for AI-driven automation leveraging MCP. The CSI walks through filtering proxies, DLP, sandboxing, message integrity, output filtering, and local MCP scans — the first primary-source MCP guidance from a U.S. signals-intelligence agency. Read it next to Anthropic’s MCP tunnels announcement: the agencies and the protocol’s authors are landing on a compatible threat model where MCP traffic stays inside customer perimeter, with structured controls in the proxy and gateway layers.

Sources: NSA

Microsoft AI Red Team open-sources RAMPART and Clarity (May 20–21)

RAMPART is a pytest-native adversarial test framework built on top of PyRIT — teams write pytest cases that simulate prompt injection, data exfiltration, jailbreaks and tool-abuse against agents in CI. Clarity is a structured design-review tool that bakes threat-modeling into the agent SDLC before code is even written. Together they pull AI red-teaming out of one-off security engagements and into the developer inner loop. Help Net Security’s coverage adds the operator angle: agent-orchestrated red-teaming where autonomous agents pick attacks, run them, produce structured findings — compressing weeks of testing into hours.

Sources: Microsoft Security Blog · Help Net Security (RAMPART/Clarity) · Help Net Security (red-teaming agents)

Anthropic ships self-hosted sandboxes and MCP tunnels at Code with Claude London (May 19)

Announced at the London developer day: self-hosted sandboxes (public beta) and MCP tunnels (research preview) for Claude Managed Agents. The enterprise-perimeter angle is the key story — orchestration stays at Anthropic, but tool execution, credentials, and MCP traffic all stay inside the customer’s network. MCP tunnels in particular let agents reach private MCP servers without exposing them to the public internet. This is Anthropic answering the same concern NSA put in writing the next day: how do you connect an external frontier model to internal systems without giving up control of the data plane.

Sources: Anthropic · InfoQ · 9to5Mac

Project Glasswing — 10,000+ critical vulnerabilities found (May 22–24)

Anthropic’s first quantified update: ~50 partner organizations using Claude Mythos Preview have identified 10,000+ high/critical-severity vulnerabilities. Disclosed partner list includes AWS, Apple, CrowdStrike, Google, JPMorgan, NVIDIA, Palo Alto Networks, Cloudflare, and Mozilla. Anthropic’s stance: Mythos-class models get general release only with stronger safeguards in place — the partner program is now serving as both a defensive proof-of-value and a public test of whether frontier-tier vulnerability discovery can be operated responsibly at scale. Read alongside the OpenAI Daybreak launch from last week’s bulletin: every frontier lab is now shipping a defensive program with a high-profile partner network.

Sources: Anthropic · TechTimes

Google I/O 2026 — Gemini 3.5, CodeMender, and the agentic Gemini era (May 19)

Sundar Pichai’s keynote launched Gemini 3.5 and 3.5 Flash with strengthened cyber and CBRN safeguards and pulled CodeMender — Google’s autonomous vulnerability-remediation agent — into the Gemini Enterprise Agent Platform. The Cloud-side announcements added DLP-enforced Agent Gateway, identity and observability components, and managed sandboxing — the same enterprise-perimeter pattern Anthropic shipped at Code with Claude London. CodeMender external API access is new this week, which puts an autonomous OSS-patch-submitting agent inside reach of any enterprise running Gemini Enterprise.

Sources: Google · Google Cloud Blog

Trump AI cybersecurity executive order postponed (May 21)

The White House pulled a planned executive order on AI cybersecurity at the last minute, with the President telling reporters the language “could have been a blocker.” The withdrawn draft, per earlier Axios reporting, would have required AI labs to share covered frontier models with the federal government 90 days pre-release and given critical-infrastructure providers early access. The pause leaves the U.S. policy posture on frontier-model access for cyber defense undefined while NSA, Microsoft, Anthropic, and Google all ship their own controls-by-design. Track this one for re-introduction with softer language — the underlying signal is that federal early-access to frontier models is no longer hypothetical policy.

Sources: TechCrunch

When your AI assistant has the keys to production — the confused-deputy problem (May 19–21)

Help Net Security framed this week’s underlying threat model bluntly: LLMs in operational roles are a confused-deputy problem. The dominant agentic-AI attack vectors are prompt injection through tickets and wikis (the agent ingests adversary-controlled content as instructions), retrieval poisoning (the RAG index has been seeded with hostile content), and telemetry manipulation (the agent’s observation surface gets gamed). This week’s vendor announcements all map directly onto that threat model: Babel Street’s tradecraft-trained agents add transparent research plans for investigative workloads; ASAPP’s Continuous Red Teaming on Promptfoo runs OWASP LLM Top 10 / NIST AI RMF aligned probes against jailbreaks, many-shot attacks and system-override attempts; CTERA InsightAI brings agentic analysis to unstructured data with audit-log and file-activity hooks; LangSmith Engine closes the agent debug loop by detecting, diagnosing, drafting fixes for, and regression-testing agent failures.

Sources: Help Net Security (confused deputy) · Help Net Security (Babel Street) · Help Net Security (ASAPP) · Help Net Security (CTERA) · VentureBeat (LangSmith Engine)

NanoCo raises $12M for verifiable-sandbox agent harness (May 20)

NanoCo AI turned down a $20M buyout and instead raised $12M seed to scale NanoClaw — an open-source agent harness whose ~500-line auditable core runs in MicroVM Docker sandboxes. The thesis: verifiable sandboxing is the differentiator that wins the enterprise-agent procurement. The story bookends the agentic-infrastructure thread — on one end, Anthropic shipping a managed enterprise perimeter; on the other, an open-source project betting that an inspectable, minimal core is the only acceptable trust boundary for production agents. Worth watching alongside the broader agent-harness ecosystem (OpenClaw, EnterpriseClaw, etc.).

Sources: TechCrunch · VentureBeat

Bug-bounty economics buckle under AI slop (May 18–22)

HackerOne cut Internet Bug Bounty payouts 70–85% as AI-generated submissions overwhelm triage. Valid-report rates have fallen below 5% on some programs. The original economics — humans hunting bugs, programs paying for verified, novel finds — presumed adversary effort would be the rate-limiter; LLM-assisted automated discovery has broken that assumption. Cisco Talos’s own LLM-in-IR-reports test rhymes with the bug-bounty story: drafting time fell 50% but the output produced cross-case data contamination, hallucinated IOCs, and inconsistent style — useful for the first 80% and dangerous for the last 20%. The lesson for AI-ML security architects: AI-generated security work product needs structured validation, not just acceleration.

Sources: The Register (HackerOne) · Technology.org · The Register (Cisco Talos)

Cyera acquires Genie Security — AI-era DLP becomes its own segment (May 20)

Data-security unicorn Cyera ($9B valuation, ~1,500 employees, deployed in 20% of Fortune 500) acquired five-month-old, five-employee Genie Security for ~$50M. Genie’s endpoint tech detects, in real time, attempts to leak sensitive information whether through human action or via generative AI tools such as Claude. The Genie team joins Cyera’s enterprise DLP division. Founders Nadav Noy (ex-Unit 8200) and Noam Dotan (ex-Matzov, ex-Legit Security founding team) were backed by Mensch Capital, Dynamic Loop, and angels including Assaf Rappaport (Wiz CEO). This is Cyera’s fifth acquisition in a year (after Ryft $100M, Trail Security $162M, Otterize, Shape AI) and a marquee signal that endpoint controls catching GenAI prompt-paste exfil are now a discrete vendor segment — not a feature line in an existing DLP product.

Sources: CTech / Calcalistech

Strategic context for the 12-month AI security roadmap

Two non-vendor pieces to share with the AI security committee. Dan Shipper’s 12 predictions for the AI work era (super-agent-in-Slack patterns, users-bring-own-tokens, forward-deployed engineers as the new essential role, CLIs over, automation as a “lie”) set up several security-relevant design questions: how do you do agent IAM and identity for a Slack-resident super-agent; what’s the DLP and credential-surface story when users bring their own AI tokens into apps; what does “building software for humans and agents together” mean for your access-control model. CIO’s 7 signs your data isn’t ready for AI pulls the parallel security threads: governance gaps become permissions gaps (“AI may access content it was never intended to see”); shadow BI workarounds are shadow data exfil paths; data debt is the hidden driver of hallucination and bad-decision risk in production agents. If your agentic deployments are running ahead of your data foundation, this is your evidence file.

Sources: Lenny’s Newsletter · CIO

Calls to action for the next 7 days

  1. Adopt the NSA MCP CSI as your reference design. Map your current MCP deployments against filtering proxies, DLP, sandboxing, message integrity, output filtering, and local MCP scans — gap-list everything that doesn’t have a control owner today.
  2. Wire RAMPART into CI for at least one production agent. Start with pytest cases that simulate prompt injection through user-supplied tickets/wikis and a credential-exfil tool-abuse probe. Use Clarity for the next agent design review.
  3. Pilot self-hosted sandboxes and MCP tunnels if you run Claude Managed Agents in production — or the equivalent Gemini Enterprise Agent Gateway / DLP-enforced gateway pattern if you’re on Google. Move tool execution and MCP traffic inside the customer perimeter.
  4. Stand up an AI-era DLP review in light of the Cyera/Genie deal — specifically endpoint controls that catch GenAI prompt-paste exfil and outbound traffic to consumer AI tools. Treat it as a separate vendor category, not a feature line in your existing DLP.
  5. Update your AI-assisted security work-product policy. If your SOC or AppSec team is using LLMs for incident reports, triage notes, or bounty-report intake, build a validation step that catches cross-case data contamination and hallucinated IOCs (the Cisco Talos failure modes).
  6. Inventory agent-shaped attack surface against the confused-deputy model: enumerate every place an agent ingests adversary-influenceable content (tickets, emails, wiki pages, RAG indices, telemetry feeds), and apply the OWASP LLM Top 10 / NIST AI RMF checks (ASAPP-style Continuous Red Teaming is one off-the-shelf option).

The AI-ML Brief · a Newshunter publication

Weekly news items are from the previous seven days. Foundational reading is refreshed each week.

Unsubscribe · View in browser

Newsletter design, layout, and editorial curation © 2026 Security Radar LLC. All rights reserved.

Article titles and summaries are excerpted for review and commentary; all linked articles remain the copyright of their respective publishers and authors.

Recent Posts

  • The CISO Brief — June 7, 2026
  • DevSecOps Weekly — June 7, 2026
  • Agentic NetOps Weekly — June 7, 2026 (Cisco Live US 2026 Edition)
  • AI & ML in Security — June 7, 2026
  • Security Operations Weekly — June 7, 2026

Archives

  • June 2026
  • May 2026
  • April 2026
  • November 2025
  • April 2024
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • April 2023
  • March 2023
  • February 2022
  • January 2022
  • December 2021
  • September 2020
  • October 2019
  • August 2019
  • July 2019
  • December 2018
  • April 2018
  • December 2016
  • September 2016
  • August 2016
  • July 2016
  • April 2015
  • March 2015
  • August 2014
  • March 2014
  • August 2013
  • July 2013
  • June 2013
  • May 2013
  • April 2013
  • March 2013
  • February 2013
  • January 2013
  • October 2012
  • September 2012
  • August 2012
  • February 2012
  • October 2011
  • August 2011
  • June 2011
  • May 2011
  • April 2011
  • February 2011
  • January 2011
  • December 2010
  • November 2010
  • October 2010
  • August 2010
  • July 2010
  • June 2010
  • May 2010
  • April 2010
  • March 2010
  • February 2010
  • January 2010
  • December 2009
  • November 2009
  • October 2009
  • September 2009
  • June 2009
  • May 2009
  • March 2009
  • February 2009
  • January 2009
  • December 2008
  • November 2008
  • October 2008
  • September 2008
  • August 2008
  • July 2008
  • June 2008
  • May 2008
  • April 2008
  • March 2008
  • February 2008
  • January 2008
  • December 2007
  • November 2007
  • October 2007
  • September 2007
  • August 2007
  • July 2007
  • June 2007
  • May 2007
  • April 2007
  • March 2007
  • February 2007
  • January 2007
  • December 2006
  • November 2006
  • October 2006
  • September 2006
  • August 2006
  • July 2006
  • June 2006
  • May 2006
  • April 2006
  • March 2006
  • February 2006
  • January 2006
  • December 2005
  • November 2005
  • October 2005
  • September 2005
  • August 2005
  • July 2005
  • June 2005
  • May 2005
  • April 2005
  • March 2005
  • February 2005
  • January 2005
  • December 2004
  • November 2004
  • October 2004
  • September 2004
  • August 2004
  • July 2004
  • June 2004
  • May 2004
  • April 2004
  • March 2004
  • February 2004
  • January 2004
  • December 2003
  • November 2003
  • October 2003
  • September 2003

Categories

  • AI-ML
  • Augment / Virtual Reality
  • Blogging
  • Cloud
  • DR/Crisis Response/Crisis Management
  • Editorial
  • Financial
  • Make You Smile
  • Malware
  • Mobility
  • Motor Industry
  • News
  • OTT Video
  • Pending Review
  • Personal
  • Product
  • Regulations
  • Secure
  • Security Industry News
  • Security Operations
  • Statistics
  • Threat Intel
  • Trends
  • Uncategorized
  • Warnings
  • WebSite News
  • Zero Trust

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
© 2026 CyberSecurity Institute | Powered by Superbs Personal Blog theme