Top story
Gemma 4 12B: A Unified, Encoder-Free Multimodal Model. Google releases Gemma 4 12B, an open-weight encoder-free unified multimodal model that processes text, images, and other modalities in a single architecture. Google DeepMind
Research
TRACE: Unified Rollout Budget Allocation for Agentic RL - Introduces a framework for optimally allocating rollout budgets across tasks to improve agentic reinforcement learning efficiency. arXiv
EEVEE: Test-Time Prompt Learning for Self-Improving Agents - Proposes test-time prompt learning methods enabling autonomous agents to self-improve in real-world environments. arXiv
Late-Layer Fusion for Multimodal LLMs - Introduces dual-path vision token routing to handle visual token saturation in multimodal LLMs. Hugging Face
Provenance-Grounded Gating for Synthetic Post-Training Data - Proposes gating and adaptive recovery mechanisms to improve quality of synthetic training data. arXiv
Tools
Apple CoreAI: On-Device Inference for Apple Silicon - Apple quietly announced CoreAI at WWDC, a new on-device inference framework going far beyond CoreML's capabilities, though MLX performance comparison remains unknown. Reddit
North Mini Code: Cohere's First Developer Model - Cohere releases North Mini Code, their first model specifically designed for developer coding tasks. Hugging Face
Migrating GitHub CI to Hugging Face Jobs - Guide on migrating CI workflows to Hugging Face Jobs for ML-native continuous integration. Hugging Face
Industry
China Plans $295B AI Data Center Buildout - China announces a massive AI data center investment as the US-China AI race intensifies. Reddit
Apple's New AI Models Built With Gemini - Apple's latest AI models leverage Gemini while maintaining privacy-first design principles. CNET
Nextdoor Engineers Use Codex to Ship Faster - Nextdoor engineers share how Codex accelerates their development workflows. OpenAI
Community
Crescendo Attacks Hijack AI Agents - Multi-turn context poisoning bypasses per-message injection defenses to silently hijack AI agents. Reddit
Ephemeral Cards for Agentic Payments - Argues that agentic payment infrastructure should use real-time ephemeral card issuance rather than persistent stored credentials. Reddit
Initial Impressions of Claude Fable 5 - Simon Willison shares first impressions of the Claude Fable 5 model release. Simon Willison