The AI Wire

25 articles tagged "cs-RO" — page 1 of 1

Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics (arxiv.org)

2026-02-25|paper|arXiv

Visual reinforcement learning is appealing for robotics but expensive -- off-policy methods are sample-efficient yet slow; on-policy methods parallelize well but waste samples. Recent work has shown t...

cs-RO cs-CV cs-LG

Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

Recent advances in deep reinforcement learning (RL) have achieved strong results on high-dimensional control tasks, but applying RL to reachability problems raises a fundamental mismatch: reachability...

cs-LG cs-RO math-OC

Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

We present Lifelong Scalable Multi-Agent Realistic Testbed (LSMART), an open-source simulator to evaluate any Multi-Agent Path Finding (MAPF) algorithm in a Fleet Management System (FMS) with Automate...

cs-RO cs-AI

Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment (arxiv.org)

2026-02-13|paper|arXiv

The long-standing vision of general-purpose robots hinges on their ability to understand and act upon natural language instructions. Vision-Language-Action (VLA) models have made remarkable progress t...

cs-RO cs-AI eess-SY

YOR: Your Own Mobile Manipulator for Generalizable Robotics [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

Recent advances in robot learning have generated significant interest in capable platforms that may eventually approach human-level competence. This interest, combined with the commoditization of actu...

cs-RO cs-LG

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI (arxiv.org)

2026-02-11|paper|arXiv

Real-world data collection for embodied agents remains costly and unsafe, calling for scalable, realistic, and simulator-ready 3D environments. However, existing scene-generation systems often rely on...

cs-CV cs-RO

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

The prevalent paradigm in robot learning attempts to generalize across environments, embodiments, and tasks with language prompts at runtime. A fundamental tension limits this approach: language is of...

cs-RO cs-LG

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-04|paper|arXiv

We present SpaceTimePilot, a video diffusion model that disentangles space and time for controllable generative rendering. Given a monocular video, SpaceTimePilot can independently alter the camera vi...

cs-CV cs-AI cs-RO

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-03|paper|arXiv

cs-CV cs-AI cs-RO

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-02|paper|arXiv

cs-CV cs-AI cs-RO

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-01|paper|arXiv

cs-CV cs-AI cs-RO

Atomic Action Slicing: Planner-Aligned Options for Generalist VLA Agents [TOP LAB](arxiv.org)

2025-12-15|paper|arXiv

<think>

cs-LG cs-AI cs-RO

Closing the Train-Test Gap in World Models for Gradient-Based Planning [TOP LAB](arxiv.org)

2025-12-11|paper|arXiv

<think>

cs-LG cs-RO

Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks [TOP LAB](arxiv.org)

2025-12-09|paper|arXiv

<think>

cs-RO cs-LG

Training-Time Action Conditioning for Efficient Real-Time Chunking (arxiv.org)

2025-12-08|paper|arXiv

<think>

cs-RO cs-AI

Autonomous Reinforcement Learning Robot Control with Intel's Loihi 2 Neuromorphic Hardware [TOP LAB](arxiv.org)

2025-12-04|paper|arXiv

<think>

cs-RO cs-AI cs-LG

EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI (arxiv.org)

2025-12-02|paper|arXiv

<think>

cs-RO cs-AI cs-CV

Data-Centric Visual Development for Self-Driving Labs (arxiv.org)

2025-12-02|paper|arXiv

<think>

cs-CV cs-RO

TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos (arxiv.org)

2025-11-29|paper|arXiv

Learning new robot tasks on new platforms and in new scenes from only a handful of demonstrations remains challenging. While videos of other embodiments - humans and different robots - are abundant, d...

cs-RO cs-CV cs-LG

TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos (arxiv.org)

2025-11-28|paper|arXiv

<think>

cs-RO cs-CV cs-LG

In-N-On: Scaling Egocentric Manipulation with in-the-wild and on-task Data (arxiv.org)

2025-11-20|paper|arXiv

<think>

cs-RO cs-AI cs-CV

$π^{*}_{0.6}$: a VLA That Learns From Experience [TOP LAB](arxiv.org)

2025-11-19|paper|arXiv

<think>