Today on the wire
Anthropic launched Claude Fable 5, topping the Artificial Analysis Intelligence Index roughly five points above GPT-5.5, while Microsoft AI debuted its MAI family including the 35B-active MAI-Thinking-1. A social engineering attack using an AI agent to compromise open source maintainers also drew attention, alongside a new executive order mandating pre-release testing of frontier models.
This week, on video
All digestsThe Weekly Digest · June 7, 2026
10.3 MB4 models, 5 papers from the week, featuring Gemma 4 12B: A unified, encoder-free multimodal model; Key capabilities and innovations; Thousand Token Wood: shipping a multi-agent economy on a 3B model.
Recent dispatches
Archive- 117 signals
Google released Gemma 4 12B, an open-weight unified multimodal model handling text and images in a single encoder-free architecture. China announced a $295 billion AI data center investment as the US-China AI race intensifies. Researchers disclosed Crescendo attacks, a multi-turn context poisoning technique that bypasses per-message defenses to hijack AI agents.
- 101 signals
Apple unveiled a new AI architecture built around Google Gemini, deepening the partnership across on-device and cloud intelligence. Xiaomi released MiMo-v2.5-Pro-UltraSpeed, a 1T-parameter model claiming 1000 tokens per second. Anthropic published an analysis asking why AI has advanced faster in coding than biology.
- 93 signals
A software engineer's viral Hacker News post arguing that LLMs are systematically eroding their career has sparked broad debate over AI displacement timelines. On the infrastructure side, the Texas grid flagged voltage stability risks from data center and crypto loads, while llama.cpp merged Gemma4 multi-token prediction support, expanding local inference capabilities.
- 55 signals
Google signed a $920M/month compute deal with xAI and SpaceX, the largest cross-company infrastructure arrangement on record. CZ Biohub released a foundation world model for protein biology, and DeepSeek V4 Flash began running locally via llama.cpp with custom 3-bit quantization. Meta also disclosed thousands of Instagram accounts were compromised through a password reset flaw in its AI chatbot.
- 39 signals
A new paper proves transformers are formally succinct and that key verification problems are EXPSPACE-complete, establishing that rigorous LLM verification is provably intractable. Separately, NVIDIA demonstrated sub-5-second cold-starts for a 120B model on Kubernetes, and Xiaohongshu released dots.tts, a new open text-to-speech model.
- 43 signals
A new pretraining method lets recurrent networks train without sequential recurrence, potentially enabling faster, more parallelizable RNN training. OpenAI expanded GPT-5.5 deployment and introduced a "Dreaming" memory architecture for ChatGPT, while Ollama added support for frontier models including Kimi-K2.6, GLM-5.1, and DeepSeek.
- 73 signals
Google released Gemma 4 12B, a unified multimodal model handling text and images in a single encoder-free architecture. OpenAI also shipped open-weight reasoning models gpt-oss-120b and gpt-oss-20b, and Uber imposed a $1,500 monthly cap on AI coding tools, signaling tighter enterprise cost controls.