The AI Wire

513 articles tagged "c" — page 3 of 18

Knowledge-Embedded Latent Projection for Robust Representation Learning (arxiv.org)

2026-02-19|paper|arXiv

Latent space models are widely used for analyzing high-dimensional discrete data matrices, such as patient-feature matrices in electronic health records (EHRs), by capturing complex dependence structu...

cs-LG math-ST stat-ME

Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

Recent advances in deep reinforcement learning (RL) have achieved strong results on high-dimensional control tasks, but applying RL to reachability problems raises a fundamental mismatch: reachability...

cs-LG cs-RO math-OC

Beyond Match Maximization and Fairness: Retention-Optimized Two-Sided Matching [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

On two-sided matching platforms such as online dating and recruiting, recommendation algorithms often aim to maximize the total number of matches. However, this objective creates an imbalance, where s...

cs-LG

Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

This paper presents the first demonstration of a viable, ultra-fast, radiation-hard machine learning (ML) application on FPGAs, which could be used in future high-energy physics experiments. We presen...

hep-ex cs-LG

Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

We present Lifelong Scalable Multi-Agent Realistic Testbed (LSMART), an open-source simulator to evaluate any Multi-Agent Path Finding (MAPF) algorithm in a Fleet Management System (FMS) with Automate...

cs-RO cs-AI

Ensemble-size-dependence of deep-learning post-processing methods that minimize an (un)fair score: motivating examples and a proof-of-concept solution (arxiv.org)

2026-02-18|paper|arXiv

Fair scores reward ensemble forecast members that behave like samples from the same distribution as the verifying observations. They are therefore an attractive choice as loss functions to train data-...

physics-ao-ph cs-LG

all-mpnet-base-v2 (huggingface.co)

2026-02-14|model|HuggingFace (None)

**all-mpnet-base-v2** is a sentence transformer model designed for generating high-quality sentence embeddings, particularly excelling in semantic textual similarity tasks.[1][3][4]

sentence-transformers pytorch onnx safetensors

mobilenetv3_small_100.lamb_in1k (huggingface.co)

2026-02-13|model|HuggingFace (None)

### 1. Organization/Researcher

timm pytorch safetensors image-classification

T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization [TOP LAB](arxiv.org)

2026-02-13|paper|arXiv

Diffusion large language models (DLLMs) have the potential to enable fast text generation by decoding multiple tokens in parallel. However, in practice, their inference efficiency is constrained by th...

cs-CL cs-LG

Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching [TOP LAB](arxiv.org)

2026-02-13|paper|arXiv

We propose UniDFlow, a unified discrete flow-matching framework for multimodal understanding, generation, and editing. It decouples understanding and generation via task-specific low-rank adapters, av...

cs-CV

Amortized Molecular Optimization via Group Relative Policy Optimization [TOP LAB](arxiv.org)

2026-02-13|paper|arXiv

Molecular design encompasses tasks ranging from de-novo design to structural alteration of given molecules or fragments. For the latter, state-of-the-art methods predominantly function as "Instance Op...

cs-LG

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning [TOP LAB](arxiv.org)

2026-02-13|paper|arXiv

Vision-language-action (VLA) models that directly predict multi-step action chunks from current observations face inherent limitations due to constrained scene understanding and weak future anticipati...

cs-CV

Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment (arxiv.org)

2026-02-13|paper|arXiv

The long-standing vision of general-purpose robots hinges on their ability to understand and act upon natural language instructions. Vision-Language-Action (VLA) models have made remarkable progress t...

cs-RO cs-AI eess-SY

YOR: Your Own Mobile Manipulator for Generalizable Robotics [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

Recent advances in robot learning have generated significant interest in capable platforms that may eventually approach human-level competence. This interest, combined with the commoditization of actu...

cs-RO cs-LG

SCRAPL: Scattering Transform with Random Paths for Machine Learning [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

The Euclidean distance between wavelet scattering transform coefficients (known as paths) provides informative gradients for perceptual quality assessment of deep inverse problems in computer vision, ...

cs-SD cs-LG eess-AS

GameDevBench: Evaluating Agentic Capabilities Through Game Development [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

Despite rapid progress on coding agents, progress on their multimodal counterparts has lagged behind. A key challenge is the scarcity of evaluation testbeds that combine the complexity of software dev...

cs-AI cs-CL cs-SE

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

We present ROCKET, a training-free model compression method that achieves state-of-the-art performance in comparison with factorization, structured-sparsification and dynamic compression baselines. Op...

cs-LG cs-AI cs-CL

SurfPhase: 3D Interfacial Dynamics in Two-Phase Flows from Sparse Videos (arxiv.org)

2026-02-12|paper|arXiv

Interfacial dynamics in two-phase flows govern momentum, heat, and mass transfer, yet remain difficult to measure experimentally. Classical techniques face intrinsic limitations near moving interfaces...

cs-CV

A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models [TOP LAB](arxiv.org)

2026-02-11|paper|arXiv

How can children acquire native-level syntax from limited input? According to the Poverty of the Stimulus Hypothesis (PoSH), the linguistic input children receive is insufficient to explain certain ge...

cs-CL cs-AI

Learning to Detect Baked Goods with Limited Supervision [TOP LAB](arxiv.org)

2026-02-11|paper|arXiv

Monitoring leftover products provides valuable insights that can be used to optimize future production. This is especially important for German bakeries because freshly baked goods have a very short s...

cs-CV

Biases in the Blind Spot: Detecting What LLMs Fail to Mention (arxiv.org)

2026-02-11|paper|arXiv

Large Language Models (LLMs) often provide chain-of-thought (CoT) reasoning traces that appear plausible, but may hide internal biases. We call these *unverbalized biases*. Monitoring models via their...

cs-LG cs-AI

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI (arxiv.org)

2026-02-11|paper|arXiv

Real-world data collection for embodied agents remains costly and unsafe, calling for scalable, realistic, and simulator-ready 3D environments. However, existing scene-generation systems often rely on...

cs-CV cs-RO

Quantum Multiple Rotation Averaging (arxiv.org)

2026-02-11|paper|arXiv

Multiple rotation averaging (MRA) is a fundamental optimization problem in 3D vision and robotics that aims to recover globally consistent absolute rotations from noisy relative measurements. Establis...

cs-CV

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

The prevalent paradigm in robot learning attempts to generalize across environments, embodiments, and tasks with language prompts at runtime. A fundamental tension limits this approach: language is of...

cs-RO cs-LG

CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

Large Language Models (LLMs) often rely on test-time scaling via parallel decoding (for example, 512 samples) to boost reasoning accuracy, but this incurs substantial compute. We introduce CoRefine, a...

cs-AI cs-CL

DynamiQ: Accelerating Gradient Synchronization using Compressed Multi-hop All-reduce [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

Multi-hop all-reduce is the de facto backbone of large model training. As the training scale increases, the network often becomes a bottleneck, motivating reducing the volume of transmitted data. Acco...

cs-LG cs-DC cs-NI

Designing Multi-Robot Ground Video Sensemaking with Public Safety Professionals [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

Videos from fleets of ground robots can advance public safety by providing scalable situational awareness and reducing professionals' burden. Yet little is known about how to design and integrate mult...

cs-HC cs-CV

Autoregressive Image Generation with Masked Bit Modeling (arxiv.org)

2026-02-10|paper|arXiv

This paper challenges the dominance of continuous pipelines in visual generation. We systematically investigate the performance gap between discrete and continuous methods. Contrary to the belief that...

cs-CV

Automatic Detection and Analysis of Singing Mistakes for Music Pedagogy [TOP LAB](arxiv.org)

2026-02-09|paper|arXiv

The advancement of machine learning in audio analysis has opened new possibilities for technology-enhanced music education. This paper introduces a framework for automatic singing mistake detection in...

eess-AS cs-LG

Visual Word Sense Disambiguation with CLIP through Dual-Channel Text Prompting and Image Augmentations [TOP LAB](arxiv.org)

2026-02-09|paper|arXiv

Ambiguity poses persistent challenges in natural language understanding for large language models (LLMs). To better understand how lexical ambiguity can be resolved through the visual domain, we devel...

cs-CL

← Prev3 / 18Next →