The AI Wire

5101 articles — page 8 of 171

DiffUNet^2: Bidirectional Prediction, Probabilistic Generation and Collaborative Visual Discovery for Scientific Data (arxiv.org)

2026-06-03|paper|arxiv

DiffUNet² combines bidirectional prediction, probabilistic generation, and collaborative discovery into a unified diffusion-UNet framework for analyzing scientific imaging data.

FFR: Forward-Forward Learning for Regression (arxiv.org)

2026-06-03|paper|arxiv

Extends Hinton's Forward-Forward algorithm beyond classification by adapting its layer-wise local learning objective to handle continuous-valued regression targets.

Value-Aware Stochastic KV Cache Eviction for Reasoning Models (arxiv.org)

2026-06-03|paper|arxiv

Evicts KV cache entries in reasoning models by weighting eviction decisions according to each token's estimated contribution to the final answer value.

Quadratic integrate-and-fire neurons exhibit less fragmented loss landscapes and outperform leaky integrate-and-fire neurons in spike-based gradient descent (arxiv.org)

2026-06-03|paper|arxiv

Demonstrates that quadratic integrate-and-fire neurons produce smoother loss landscapes than leaky integrate-and-fire neurons, yielding higher accuracy under spike-based backpropagation.

Correcting Neural Operator Spectral Bias via Diffusion Posterior Sampling with Sparse Observations (arxiv.org)

2026-06-03|paper|arxiv

Uses diffusion posterior sampling conditioned on sparse observations to correct the spectral bias of neural operators that over-smooth high-frequency solution components.

Entropy Is Not Enough: Unlocking Effective Reinforcement Learning for Visual Reasoning via Vision-Anchored Token Selection (arxiv.org)

2026-06-03|paper|arxiv

Replaces entropy-based token selection in visual RL with a vision-anchored selection strategy that ties sampled tokens to grounded visual features, improving reasoning performance.

q0: Primitives for Hyper-Epoch Pretraining (arxiv.org)

2026-06-03|paper|arxiv

Introduces low-level programming primitives that organize pretraining into hyper-epochs, enabling structured curriculum control and efficient reuse of data across large-scale training runs.

FlashbackCL: Mitigating Temporal Forgetting in Federated Learning (arxiv.org)

2026-06-03|paper|arxiv

Addresses catastrophic forgetting of older temporal knowledge in federated continual learning by replaying or anchoring to earlier temporal distributions across clients.

SEAOTTER: Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction (arxiv.org)

2026-06-03|paper|arxiv

Embeds sensor metadata directly into an autoencoder architecture with a single transcoding step, reducing reconstruction overhead for sensor data.

MLSkip: Data Skipping for ML Filters via Lightweight Metadata (arxiv.org)

2026-06-03|paper|arxiv

Accelerates ML-based data filters by using lightweight metadata to skip irrelevant data blocks before invoking the learned filter, reducing inference cost.

A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026 (arxiv.org)

2026-06-03|paper|arxiv

Presents a compact, offline-capable simultaneous speech translation model designed for low-resource deployment, submitted as the CUNI system to IWSLT 2026.

VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring (arxiv.org)

2026-06-03|paper|arxiv

Deploys a vision-language agent that monitors human activities in real time and flags safety-critical behaviors for embodied or surveillance applications.

Efficient ASR Training with Conversations that Never Happened (arxiv.org)

2026-06-03|paper|arxiv

Trains ASR models using synthetically generated conversational speech that was never recorded, reducing dependence on real conversational audio corpora.

Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning (arxiv.org)

2026-06-03|paper|arxiv

Reward uncertainty estimates are used to diversify agent policies in RL, encouraging exploration of distinct behavioral modes rather than converging to a single solution.

Self-Refining Agentic Reinforcement Learning for Vision-Conditioned UAV Navigation (arxiv.org)

2026-06-03|paper|arxiv

A UAV navigation system uses agentic RL where the agent iteratively refines its own policy using visual observations without requiring human-labeled correction data.

Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning (arxiv.org)

2026-06-03|paper|arxiv

A steering mechanism controls chain-of-thought length and reasoning paths in LLMs at inference time, trading off computational cost against answer quality.

AlignAtt4LLM: Fast AlignAtt for Decoder-Only LLMs at IWSLT 2026 Simultaneous Speech Translation Task (arxiv.org)

2026-06-03|paper|arxiv

An adapted AlignAtt attention-based simultaneous speech translation method is extended to decoder-only LLM architectures for the IWSLT 2026 shared task.

QUBRIC: Co-Designing Queries and Rubrics for RL Beyond Verifiable Rewards (arxiv.org)

2026-06-03|paper|arxiv

A framework jointly designs evaluation queries and scoring rubrics to generate reward signals for RL in tasks where ground-truth verifiable rewards are unavailable.

Quantifying Faithful Confidence Expression in Large Reasoning Models (arxiv.org)

2026-06-03|paper|arxiv

Metrics are introduced to measure how accurately large reasoning models express calibrated confidence that reflects their actual correctness on reasoning tasks.

Formalizing the Binding Problem (arxiv.org)

2026-06-03|paper|arxiv

A formal computational or mathematical framework is proposed to precisely define the binding problem—how distinct features are combined into unified object representations.

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories (arxiv.org)

2026-06-03|paper|arxiv

A mechanism is proposed for language models to periodically consolidate and restructure acquired knowledge into long-term memory, analogous to sleep-based memory consolidation.

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill (arxiv.org)

2026-06-03|paper|arxiv

A unified reward model maps heterogeneous evaluation criteria onto a shared agent-skill representation space, enabling consistent scoring across diverse task types.

Language Models Compare Quantities Using Number-specific and Unit-specific Heuristics (arxiv.org)

2026-06-03|paper|arxiv

Behavioral analysis reveals that language models resolve quantity comparisons by applying separate heuristics tied to specific number formats and unit types rather than grounded numerical reasoning.

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking (arxiv.org)

2026-06-03|paper|arxiv

A large-scale data and architectural scaling approach enables a humanoid robot controller to track diverse motions zero-shot using GPT-style model capacity.

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models (arxiv.org)

2026-06-03|paper|arxiv

Perception tokens that encode imagined spatial views are injected into multimodal language models to improve their reasoning about 3D spatial relationships.

Neuron Populations Exhibit Divergent Selectivity with Scale (arxiv.org)

2026-06-03|paper|arxiv

Larger neural networks develop neuron subpopulations with increasingly specialized and divergent feature selectivity compared to smaller models, revealing scale-dependent representational heterogeneity.

CS336: Language Modeling from Scratch (cs336.stanford.edu)

2026-06-02|news|hackernews

Stanford's CS336 course teaches students to build language models from scratch, covering architecture, training, and implementation fundamentals.

AI Agent Guidelines for CS336 at Stanford (github.com)

2026-06-02|news|hackernews

Guidelines governing how students may use AI agents when completing assignments in Stanford's CS336 language modeling course.

Can the stockmarket swallow Anthropic, SpaceX and OpenAI?(economist.com)

2026-06-02|news|hackernews

Financial analysis examining whether public equity markets have sufficient capacity to absorb IPOs or valuations of Anthropic, SpaceX, and OpenAI.

OpenAI frontier models and Codex are now available on AWS (openai.com)

2026-06-02|news|hackernews

OpenAI's frontier models and Codex coding API are now accessible to developers through Amazon Web Services infrastructure.

← Prev8 / 171Next →