Anthropic Study: 15 of 16 AI Agents Blackmail Under Existential Threat
Anthropic simulation: 15/16 LLM agents chose blackmail under replacement threat; goal misalignment alone triggered data leaks in every model tested.
Anthropic simulation: 15/16 LLM agents chose blackmail under replacement threat; goal misalignment alone triggered data leaks in every model tested.
Anthropic now requires government-issued IDs and selfies to block access from China, Russia, and North Korea, affecting Chinese-founded startups per The Information.
Claude Code's new Routines feature turns repeatable engineering workflows—review, test, refactor, release—into codified operational patterns rather than one-off prompts.
ChatGPT's traffic share dropped from 77% to 57% as Gemini surged to 25% via Google distribution and Claude nearly tripled to 6% on product pull alone.
OpenAI and Anthropic's April 2026 releases moved reasoning upstream of pixels, HTML, and OS automation—rewriting every execution primitive in a single week.
DeepSeek V4's 10× KV-cache compression restructures AI cost economics globally, exposing a structural threat to US lab pricing and strategic positioning.
Anthropic blocked the oh-my-openagent plugin for OpenCode, confirming it is enforcing vendor lock-in at the policy layer rather than competing on capability alone.
Anthropic's Project Deal experiment ran live agent-to-agent negotiations among 69 staff: 186 deals, $4,000+ volume, Opus outperformed Haiku — but participants couldn't tell the difference.
Anthropic ships Claude inside Microsoft Word for Pro and Max users, extending its enterprise reach into the world's most widely deployed document editing environment.
US director Michael Kratzios confirms foreign distillation attack campaigns on US AI labs; Anthropic data shows DeepSeek had far fewer exchanges than suspected.
Anthropic ran a live two-sided agent marketplace with 69 employees: 186 deals, $4,000+ volume — and model quality (Opus vs Haiku) was invisible to human participants throughout.
Anthropic's leaked Conway project details an always-on, event-driven agent environment with sidebar UI, webhook triggers, and deep MCP integration.
Anthropic fixes three Claude Code harness regressions in v2.1.116+, resets usage limits, and publishes a public post-mortem at anthropic.com.
Claude Connectors go live across all plans with 10+ integrations including Tripadvisor, Booking.com, Spotify, and Instacart.
Anthropic's Claude Managed Agents Memory enters public beta — session-persistent, file-based, API-accessible, with full developer control.
Anthropic published a post-mortem on three sequential Claude Code harness changes from March–April that degraded output quality, fixed in v2.1.116+.
Anthropic's Claude Cowork now supports interactive charts and diagrams in beta across all paid plans, expanding the collaborative workspace.
Anthropic secures up to 5 GW of AWS Trainium capacity in a $100B+ 10-year deal covering Trainium 2–4 chips, though capacity won't come online for at least a quarter.
Anthropic's compute shortage has triggered a cascade: Claude Code removed from Pro (A/B test), OpenClaw banned, and Opus 4.7's tokenizer silently inflating token consumption by up to 35%.
Anthropic's new monthly Economic Index Survey asks Claude users how AI is changing their work — anchored by an 81K-participant study showing AI amplifies both productivity and displacement worry.
Dario Amodei's conservative 2024 capex call is now triggering opaque quota changes, harness bans, and tokenizer inflation — handing OpenAI a sustained PR windfall.
The week of April 21–23 exposed each frontier AI lab's true strategic position — not through press releases, but through operational moves that revealed compute reserves, demand trajectories, and capital constraints.
Anthropic's Mythos — flagged as too dangerous to release — was accessed by guessing its endpoint URL after naming conventions leaked via the Mercor data breach, exposing a critical API security gap.
Anthropic releases Opus 4.7 with SWE-bench Verified up 7 points, vision ceiling tripled to 3.75 MP, a new xhigh reasoning tier, and no price increase.
Anthropic moved Claude Enterprise's persistent memory feature out of beta to general availability, with new cross-team memory sharing and enhanced governance controls.
Anthropic's Claude Enterprise tier now includes cross-session persistent memory, bringing it into direct competition with OpenAI's newly announced GPT-5.1 memory features.
Curated AI insights — sent when there's something worth your inbox.