AI Coding News

May 25, 2026

Key Signals

Multi-model AI code review is emerging as a best practice for quality-obsessed development. Developer Nolan Lawson describes a workflow combining Claude sub-agents, Codex, and Cursor Bugbot to review PRs in parallel, achieving near-zero false positive rates for bug detection. Rather than chasing velocity, this "slow coding" approach uses AI to find pre-existing bugs and improve codebase health—positioning multi-agent review as a counterweight to the "vibe coding" trend. [1]
AI-driven vulnerability discovery is fundamentally reshaping the security economics of software development. Security researchers report submitting 3x more bugs than last year, Google observed criminal actors using AI-discovered zero-days in the wild for the first time, and the traditional 90-day disclosure window is under pressure as "LLMs have compressed both timelines." Organizations like Google are restructuring their reward programs, while Curl and the Linux kernel security list struggle with submission volume. [2]
Microsoft's MDASH deploys 100+ specialized AI agents for automated large-scale code auditing. The multi-model agentic security platform scored 88.45% on the CyberGym benchmark (5 points above the next entry) and achieved 96–100% recall on historical Windows kernel vulnerabilities. The model-agnostic architecture—with separate agents for scanning, debate, validation, and exploitation—signals that future AI security tooling will depend more on orchestration frameworks than raw model capability. [3]
Node.js VFS proposal built with Claude Code ignites governance debate about AI-generated contributions to critical infrastructure. A 19,000-line PR by TSC member Matteo Collina triggered a petition from the io.js fork initiator to ban AI-assisted code in Node.js core, raising concerns about DCO compliance and reviewability. The underlying feature—an in-memory VFS for AI agent sandboxing—has already been adopted by LangChain and Vercel, highlighting the tension between AI-accelerated development velocity and open-source governance norms. [4]
Gemma 4's multi-token prediction drafters achieve up to ~3x faster inference through speculative decoding without quality loss. By sharing the target model's KV cache, the lightweight MTP drafters reduce the traditional memory overhead of running two models simultaneously. This optimization is particularly impactful for local AI coding tools running on consumer GPUs, where memory bandwidth is the primary bottleneck for responsive code completion and generation. [5]

AI Coding News

Agentic AI models are creating an arms race between ethical bug hunters and criminal attackers. Google Threat Intelligence confirmed the first observed case of criminal actors using AI tools to develop a zero-day exploit targeting two-factor authentication. Security researcher Joseph Thacker reports submitting 3x more bugs than last year using AI tools, while Curl's Daniel Stenberg notes submissions have shifted from "AI slop" to "really good security reports, almost all done with the help of AI" at "never-before seen frequency." The 90-day responsible disclosure window—built for an era when bug finders were rare—is increasingly seen as obsolete. [2]
A developer workflow combining Claude, Codex, and Cursor Bugbot for multi-model PR review demonstrates near-zero false positive rates. The technique runs three independent AI agents against the same PR, cross-references their findings to eliminate hallucinations, and ranks bugs by severity. The author reports this approach often uncovers pre-existing bugs rather than increasing velocity, making it more of a codebase health tool than a productivity accelerator. The workflow represents a maturing pattern of using competing models as mutual validators. [1]
Node.js faces a governance crisis over a major AI-assisted contribution that also delivers critical AI infrastructure. The proposed node:vfs module would provide in-memory file system sandboxing—a feature LangChain is already using for AI agent isolation—but the fact it was built with Claude Code has forced the TSC toward a policy vote on AI-generated contributions. Deno has opened a tracking issue for compatibility, while Bun has not announced plans. The outcome will set precedent for how major open-source projects handle AI-assisted development at scale. [4]

Feature Update

GitHub Copilot CLI v1.0.55-0 fixes extension launch in single-executable application mode. This patch release resolves a bug where extensions would not start correctly when the CLI runs as a single-executable application, ensuring third-party extensions work reliably in environments using the SEA distribution format. [6]
Gemma 4 multi-token prediction drafters are now available across Hugging Face, Kaggle, and Ollama. The MTP drafters pair with Gemma 4 31B dense and 26B MoE models to deliver up to ~3x faster token generation via speculative decoding. The key architectural innovation is sharing the target model's KV cache with the drafter, which significantly reduces the memory penalty traditionally associated with running auxiliary draft models. Consumer GPUs and mobile devices running E2B and E4B variants also benefit. [5]
Microsoft MDASH enters limited private preview for automated multi-agent code vulnerability research. The system orchestrates over 100 specialized AI agents through a multi-stage pipeline handling scanning, debate, validation, deduplication, and exploitation proof generation. It achieved 88.45% on CyberGym (1,507 real-world vulnerabilities), 96% recall on clfs.sys CVEs, and 100% recall on tcpip.sys CVEs. The model-agnostic design allows teams to swap underlying models while preserving the orchestration infrastructure. [3]