Gemma 4 12B: A Unified, Encoder-Free Multimodal Model. Google releases Gemma 4 12B, an open-weight encoder-free unified multimodal model that processes text, images, and other modalities in a single architecture. Google DeepMind

TRACE: Unified Rollout Budget Allocation for Agentic RL - Introduces a framework for optimally allocating rollout budgets across tasks to improve agentic reinforcement learning efficiency. arXiv

EEVEE: Test-Time Prompt Learning for Self-Improving Agents - Proposes test-time prompt learning methods enabling autonomous agents to self-improve in real-world environments. arXiv

Late-Layer Fusion for Multimodal LLMs - Introduces dual-path vision token routing to handle visual token saturation in multimodal LLMs. Hugging Face

Provenance-Grounded Gating for Synthetic Post-Training Data - Proposes gating and adaptive recovery mechanisms to improve quality of synthetic training data. arXiv

Apple CoreAI: On-Device Inference for Apple Silicon - Apple quietly announced CoreAI at WWDC, a new on-device inference framework going far beyond CoreML's capabilities, though MLX performance comparison remains unknown. Reddit

North Mini Code: Cohere's First Developer Model - Cohere releases North Mini Code, their first model specifically designed for developer coding tasks. Hugging Face

Migrating GitHub CI to Hugging Face Jobs - Guide on migrating CI workflows to Hugging Face Jobs for ML-native continuous integration. Hugging Face

China Plans $295B AI Data Center Buildout - China announces a massive AI data center investment as the US-China AI race intensifies. Reddit

Apple's New AI Models Built With Gemini - Apple's latest AI models leverage Gemini while maintaining privacy-first design principles. CNET

Nextdoor Engineers Use Codex to Ship Faster - Nextdoor engineers share how Codex accelerates their development workflows. OpenAI

Crescendo Attacks Hijack AI Agents - Multi-turn context poisoning bypasses per-message injection defenses to silently hijack AI agents. Reddit

Ephemeral Cards for Agentic Payments - Argues that agentic payment infrastructure should use real-time ephemeral card issuance rather than persistent stored credentials. Reddit

Initial Impressions of Claude Fable 5 - Simon Willison shares first impressions of the Claude Fable 5 model release. Simon Willison