The AI Wire

150 articles tagged "cs-LG" — page 1 of 5

Partial recovery of meter-scale surface weather [TOP LAB](arxiv.org)

2026-02-27|paper|arXiv

Near-surface atmospheric conditions can differ sharply over tens to hundreds of meters due to land cover and topography, yet this variability is absent from current weather analyses and forecasts. It ...

cs-LG cs-CV physics-ao-ph

Dynamic Personality Adaptation in Large Language Models via State Machines [TOP LAB](arxiv.org)

2026-02-26|paper|arXiv

The inability of Large Language Models (LLMs) to modulate their personality expression in response to evolving dialogue dynamics hinders their performance in complex, interactive contexts. We propose ...

cs-CL cs-HC cs-LG

Disease Progression and Subtype Modeling for Combined Discrete and Continuous Input Data [TOP LAB](arxiv.org)

2026-02-26|paper|arXiv

Disease progression modeling provides a robust framework to identify long-term disease trajectories from short-term biomarker data. It is a valuable tool to gain a deeper understanding of diseases wit...

cs-LG

Scaling State-Space Models on Multiple GPUs with Tensor Parallelism [TOP LAB](arxiv.org)

2026-02-25|paper|arXiv

Selective state space models (SSMs) have rapidly become a compelling backbone for large language models, especially for long-context workloads. Yet in deployment, their inference performance is often ...

cs-DC cs-LG

Test-Time Training with KV Binding Is Secretly Linear Attention (arxiv.org)

2026-02-25|paper|arXiv

Test-time training (TTT) with KV binding as sequence modeling layer is commonly interpreted as a form of online meta-learning that memorizes a key-value mapping at test time. However, our analysis rev...

cs-LG cs-AI cs-CV

Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics (arxiv.org)

2026-02-25|paper|arXiv

Visual reinforcement learning is appealing for robotics but expensive -- off-policy methods are sample-efficient yet slow; on-policy methods parallelize well but waste samples. Recent work has shown t...

cs-RO cs-CV cs-LG

Rethinking Chronological Causal Discovery with Signal Processing [TOP LAB](arxiv.org)

2026-02-24|paper|arXiv

Causal discovery problems use a set of observations to deduce causality between variables in the real world, typically to answer questions about biological or physical systems. These observations are ...

eess-SP cs-LG stat-ML

A Very Big Video Reasoning Suite (arxiv.org)

2026-02-24|paper|arXiv

Rapid progress in video models has largely focused on visual quality, leaving their reasoning capabilities underexplored. Video reasoning grounds intelligence in spatiotemporally consistent visual env...

cs-CV cs-AI cs-LG

Clapeyron Neural Networks for Single-Species Vapor-Liquid Equilibria [TOP LAB](arxiv.org)

2026-02-23|paper|arXiv

Machine learning (ML) approaches have shown promising results for predicting molecular properties relevant for chemical process design. However, they are often limited by scarce experimental property ...

physics-chem-ph cs-LG

Comparative Assessment of Multimodal Earth Observation Data for Soil Moisture Estimation [TOP LAB](arxiv.org)

2026-02-23|paper|arXiv

Accurate soil moisture (SM) estimation is critical for precision agriculture, water resources management and climate monitoring. Yet, existing satellite SM products are too coarse (>1km) for farm-leve...

cs-CV cs-LG

Asymptotically Optimal Sequential Testing with Markovian Data [TOP LAB](arxiv.org)

2026-02-20|paper|arXiv

We study one-sided and $α$-correct sequential hypothesis testing for data generated by an ergodic Markov chain. The null hypothesis is that the unknown transition matrix belongs to a prescribed set $P...

math-ST cs-LG stat-ML

Almost Sure Convergence of Differential Temporal Difference Learning for Average Reward Markov Decision Processes [TOP LAB](arxiv.org)

2026-02-19|paper|arXiv

The average reward is a fundamental performance metric in reinforcement learning (RL) focusing on the long-run performance of an agent. Differential temporal difference (TD) learning algorithms are a ...

cs-LG cs-AI

Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects [TOP LAB](arxiv.org)

2026-02-19|paper|arXiv

Generalized additive models (GAMs) offer interpretability through independent univariate feature effects but underfit when interactions are present in data. GA$^2$Ms add selected pairwise interactions...

cs-LG cs-AI

Knowledge-Embedded Latent Projection for Robust Representation Learning (arxiv.org)

2026-02-19|paper|arXiv

Latent space models are widely used for analyzing high-dimensional discrete data matrices, such as patient-feature matrices in electronic health records (EHRs), by capturing complex dependence structu...

cs-LG math-ST stat-ME

Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

Recent advances in deep reinforcement learning (RL) have achieved strong results on high-dimensional control tasks, but applying RL to reachability problems raises a fundamental mismatch: reachability...

cs-LG cs-RO math-OC

Beyond Match Maximization and Fairness: Retention-Optimized Two-Sided Matching [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

On two-sided matching platforms such as online dating and recruiting, recommendation algorithms often aim to maximize the total number of matches. However, this objective creates an imbalance, where s...

cs-LG

Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml [TOP LAB](arxiv.org)

2026-02-18|paper|arXiv

This paper presents the first demonstration of a viable, ultra-fast, radiation-hard machine learning (ML) application on FPGAs, which could be used in future high-energy physics experiments. We presen...

hep-ex cs-LG

Ensemble-size-dependence of deep-learning post-processing methods that minimize an (un)fair score: motivating examples and a proof-of-concept solution (arxiv.org)

2026-02-18|paper|arXiv

Fair scores reward ensemble forecast members that behave like samples from the same distribution as the verifying observations. They are therefore an attractive choice as loss functions to train data-...

physics-ao-ph cs-LG

T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization [TOP LAB](arxiv.org)

2026-02-13|paper|arXiv

Diffusion large language models (DLLMs) have the potential to enable fast text generation by decoding multiple tokens in parallel. However, in practice, their inference efficiency is constrained by th...

cs-CL cs-LG

Amortized Molecular Optimization via Group Relative Policy Optimization [TOP LAB](arxiv.org)

2026-02-13|paper|arXiv

Molecular design encompasses tasks ranging from de-novo design to structural alteration of given molecules or fragments. For the latter, state-of-the-art methods predominantly function as "Instance Op...

cs-LG

YOR: Your Own Mobile Manipulator for Generalizable Robotics [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

Recent advances in robot learning have generated significant interest in capable platforms that may eventually approach human-level competence. This interest, combined with the commoditization of actu...

cs-RO cs-LG

SCRAPL: Scattering Transform with Random Paths for Machine Learning [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

The Euclidean distance between wavelet scattering transform coefficients (known as paths) provides informative gradients for perceptual quality assessment of deep inverse problems in computer vision, ...

cs-SD cs-LG eess-AS

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression [TOP LAB](arxiv.org)

2026-02-12|paper|arXiv

We present ROCKET, a training-free model compression method that achieves state-of-the-art performance in comparison with factorization, structured-sparsification and dynamic compression baselines. Op...

cs-LG cs-AI cs-CL

Biases in the Blind Spot: Detecting What LLMs Fail to Mention (arxiv.org)

2026-02-11|paper|arXiv

Large Language Models (LLMs) often provide chain-of-thought (CoT) reasoning traces that appear plausible, but may hide internal biases. We call these *unverbalized biases*. Monitoring models via their...

cs-LG cs-AI

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

The prevalent paradigm in robot learning attempts to generalize across environments, embodiments, and tasks with language prompts at runtime. A fundamental tension limits this approach: language is of...

cs-RO cs-LG

DynamiQ: Accelerating Gradient Synchronization using Compressed Multi-hop All-reduce [TOP LAB](arxiv.org)

2026-02-10|paper|arXiv

Multi-hop all-reduce is the de facto backbone of large model training. As the training scale increases, the network often becomes a bottleneck, motivating reducing the volume of transmitted data. Acco...

cs-LG cs-DC cs-NI

Automatic Detection and Analysis of Singing Mistakes for Music Pedagogy [TOP LAB](arxiv.org)

2026-02-09|paper|arXiv

The advancement of machine learning in audio analysis has opened new possibilities for technology-enhanced music education. This paper introduces a framework for automatic singing mistake detection in...

eess-AS cs-LG

Pseudo-Invertible Neural Networks [TOP LAB](arxiv.org)

2026-02-06|paper|arXiv

The Moore-Penrose Pseudo-inverse (PInv) serves as the fundamental solution for linear systems. In this paper, we propose a natural generalization of PInv to the nonlinear regime in general and to neur...

cs-LG cs-CV

Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps [TOP LAB](arxiv.org)

2026-02-06|paper|arXiv

Flow and diffusion models produce high-quality samples, but adapting them to user preferences or constraints post-training remains costly and brittle, a challenge commonly called reward alignment. We ...

cs-LG cs-AI

Shared LoRA Subspaces for almost Strict Continual Learning (arxiv.org)

2026-02-06|paper|arXiv

Adapting large pretrained models to new tasks efficiently and continually is crucial for real-world deployment but remains challenging due to catastrophic forgetting and the high cost of retraining. W...

cs-LG cs-AI cs-CV

1 / 5Next →