The AI Wire

5101 articles — page 12 of 171

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation (huggingface.co)

2026-06-01|model|huggingface

Automatically generates reusable AI agent skills by distilling knowledge from human experts, reducing manual skill engineering for complex task pipelines.

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents (huggingface.co)

2026-06-01|model|huggingface

Provides an automated auditing framework that evaluates and surfaces gaps, redundancies, or failures within the open skill ecosystem available to LLM-based agents.

FRAPPE: Full Input, Residual Output Autoencoding with Projection Pursuit Encoder (huggingface.co)

2026-06-01|model|huggingface

An autoencoder architecture that takes full input, produces residual outputs, and uses a projection pursuit encoder to learn compact, disentangled latent representations.

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue (huggingface.co)

2026-06-01|model|huggingface

A zero-shot speech synthesis system that generates expressive, long-form audio for both monologue and multi-speaker dialogue without speaker-specific training data.

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer (huggingface.co)

2026-06-01|model|huggingface

Generates spatially positioned, synchronized audio in a streaming fashion using an autoregressive diffusion transformer that produces multichannel spatial audio in real time.

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios (huggingface.co)

2026-06-01|model|huggingface

Systematically evaluates long-form speech generation systems across diverse scenarios including different speaking styles, domains, and acoustic conditions to expose failure modes.

Frequency-Guided Action Diffusion via Sub-Frequency Manifold Traversal (huggingface.co)

2026-06-01|model|huggingface

Uses frequency-domain decomposition and sub-frequency manifold traversal to guide a diffusion model for generating temporally coherent and smooth action sequences.

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction (huggingface.co)

2026-06-01|model|huggingface

Analyzes when Markov boundary feature selection helps, hurts, or produces mixed results for tabular prediction tasks, clarifying its practical reliability.

AnyMo: Scaling Any-Modality Conditional Motion Generation with Masked Modeling (huggingface.co)

2026-06-01|model|huggingface

Scales human motion generation by conditioning on any combination of input modalities using masked modeling, enabling flexible multimodal control over generated motions.

Count Anything (huggingface.co)

2026-06-01|model|huggingface

A general-purpose counting model that estimates the quantity of arbitrary object categories in images based on open-vocabulary or user-specified targets.

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement (huggingface.co)

2026-06-01|model|huggingface

Uses on-policy data generated during RLHF training to self-supervisedly improve reward model accuracy, addressing reward model degradation caused by policy distribution shift.

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks (huggingface.co)

2026-06-01|model|huggingface

Trains agents on open-ended tasks through self-play where multiple policies co-evolve together, generating increasingly challenging and diverse training signal without human-designed curricula.

Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)?(huggingface.co)

2026-06-01|model|huggingface

Evaluates whether vision-language models can reliably abstain from answering spatial questions they lack sufficient visual information to answer correctly, diagnosing failure modes.

Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents (huggingface.co)

2026-06-01|model|huggingface

Introduces a benchmark and synthetic trajectory generation method for training GUI agents to recover from their own policy-induced errors during task execution.

pydantic-monty investigation (simonwillison.net)

2026-06-01|news|blog/Simon Willison

An investigation into issues or behavior observed in the pydantic-monty library, likely examining bugs, unexpected functionality, or security concerns.

The solution might be cancelling my AI subscription (simonwillison.net)

2026-06-01|news|blog/Simon Willison

A personal account arguing that cancelling an AI subscription was the right practical or financial decision, weighing real utility against cost.

datasette 1.0a32 (simonwillison.net)

2026-06-01|news|blog/Simon Willison

Release notes for version 1.0a32 of Datasette, the open-source tool for exploring and publishing SQLite databases, detailing new features or fixes.

May 2026 newsletter (simonwillison.net)

2026-06-01|news|blog/Simon Willison

A monthly newsletter from May 2026 summarizing recent developments, projects, or curated content relevant to the author's focus area.

Weekend trivia: your process' memory is a file (lcamtuf.substack.com)

2026-06-01|news|blog/lcamtuf (Michal Zalewski)

Explains that a running process's memory is exposed as a file on disk via interfaces like /proc/pid/mem, illustrating Unix's everything-is-a-file design.

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action (huggingface.co)

2026-06-01|news|blog/Hugging Face Blog

NVIDIA releases Cosmos 3, an open multimodal model designed to support physical AI systems by integrating reasoning and action planning across modalities.

The History of "Prisencolinensinainciusol"(dirkdeklein.net)

2026-06-01|news|hackernews

Traces the origin and cultural journey of Adriano Celentano's 1972 nonsense-lyric song deliberately composed to mimic American English sounds without meaning.

Rubin Tracks Skyscraper-Size Asteroids and Failed Supernovas (quantamagazine.org)

2026-06-01|news|hackernews

The Vera Rubin Observatory has detected both very large near-Earth asteroids and failed supernova candidates (stars that collapse without a visible explosion).

If LLMs Have Human-Like Attributes, Then So Does Age of Empires II (arxiv.org)

2026-06-01|paper|arxiv

Argues that criteria used to attribute human-like properties to LLMs are so broad they would equally apply to Age of Empires II, exposing the criteria as flawed.

On the Relationship Between Activation Outliers and Feature Death in Sparse Autoencoders (arxiv.org)

2026-06-01|paper|arxiv

Identifies a mechanistic link between large activation outliers in sparse autoencoders and the phenomenon where learned features permanently stop firing during training.

Separating Secrets from Placeholders: A Hybrid CNN-CodeBERT Framework for Three-Class Credential Leakage Detection (arxiv.org)

2026-06-01|paper|arxiv

A hybrid CNN and CodeBERT model classifies source code tokens into three categories: real secrets, placeholder credentials, and non-credentials, to reduce false-positive leak alerts.

UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception (arxiv.org)

2026-06-01|paper|arxiv

Extends speech-focused audio tokenizers with general audio perception capabilities so a single tokenizer handles diverse audio types without losing speech semantic quality.

Chem-PerturBridge: a harmonized compendium of small molecule perturbation transcriptomic effects (arxiv.org)

2026-06-01|paper|arxiv

Integrates and harmonizes transcriptomic response data from multiple small-molecule perturbation experiments into a unified, consistently formatted compendium for downstream analysis.

Value Functions as Supermartingale Certificates (arxiv.org)

2026-06-01|paper|arxiv

Establishes that reinforcement learning value functions satisfying supermartingale conditions serve as formal safety and stability certificates for stochastic dynamical systems.

Discovering Thermodynamically Admissible Dissipation Potentials via Grammar-Based Symbolic Regression (arxiv.org)

2026-06-01|paper|arxiv

Uses grammar-constrained symbolic regression to automatically infer dissipation potential functions that are guaranteed to satisfy thermodynamic admissibility constraints from data.

Feature-Optimized Vision for Adaptive 3D Scene Reconstruction (arxiv.org)

2026-06-01|paper|arxiv

Optimizes visual features extracted from input images to adaptively guide 3D scene reconstruction, improving quality under varying scene conditions.

← Prev12 / 171Next →