The AI Wire

5101 articles — page 7 of 171

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues (huggingface.co)

2026-06-03|model|huggingface

Ψ-Bench evaluates how well conversational AI systems tailor persuasive dialogue strategies to individual user personas and psychological profiles.

Value-Aware Stochastic KV Cache Eviction for Reasoning Models (huggingface.co)

2026-06-03|model|huggingface

A KV cache eviction policy for reasoning models selectively discards cache entries based on their estimated contribution to output value, reducing memory without degrading reasoning quality.

MERIT: Learning Disentangled Music Representations for Audio Similarity (huggingface.co)

2026-06-03|model|huggingface

MERIT learns disentangled latent representations of music that separate independent attributes such as melody, rhythm, and timbre to improve audio similarity retrieval.

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations (huggingface.co)

2026-06-03|model|huggingface

Linear probes trained to detect deceptive internal states in LLMs are stress-tested for robustness under adversarial pressure, with analysis of how deception organizes geometrically in representation space.

World Models Meet Language Models: On the Complementarity of Concrete and Abstract Reasoning (huggingface.co)

2026-06-03|model|huggingface

Combines world models handling concrete environment dynamics with language models handling abstract reasoning, showing the two approaches are complementary rather than competing.

A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL (huggingface.co)

2026-06-03|model|huggingface

A local perturbation theory formalizes how policy updates in one domain cause interference in others during multi-domain RL and derives recovery conditions to restore cross-domain performance.

Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces (huggingface.co)

2026-06-03|model|huggingface

Analyzes long chain-of-thought training traces where the final answer is correct but intermediate reasoning steps contain harmful continuations, diagnosing how such traces arise and their training risks.

ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree (huggingface.co)

2026-06-03|model|huggingface

ClawHub analyzes malware signals by reconciling disagreements between VirusTotal verdicts, static analysis findings, and SkillSpector detections to improve security assessments.

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models (huggingface.co)

2026-06-03|model|huggingface

AutoMedBench evaluates agentic AI systems on automated medical research tasks, benchmarking their ability to autonomously conduct and validate biomedical investigations.

TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL (huggingface.co)

2026-06-03|model|huggingface

TRON provides rule-verifiable online environments specifically designed for training visual reasoning agents via reinforcement learning with objectively checkable rewards.

Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling (huggingface.co)

2026-06-03|model|huggingface

A small RL controller guides token sampling decisions of a large language model at test time, improving output quality without retraining the LLM.

Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation (huggingface.co)

2026-06-03|model|huggingface

Decoupled residual denoising separates content and style pathways in a diffusion model to enable unified image-to-image translation across multiple tasks with fewer training examples.

PaddleOCR-VL-1.6: Expanding the Frontier of Document Parsing with Under-Optimized Region Refinement and Progressive Post-Training (huggingface.co)

2026-06-03|model|huggingface

PaddleOCR-VL-1.6 improves document parsing by targeting previously under-optimized layout regions and applying a progressive post-training strategy to boost recognition accuracy.

micropython-wasm 0.1a0 (simonwillison.net)