The AI Wire

LookAroundNet: Extending Temporal Context with Transformers for Clinically Viable EEG Seizure Detection (arxiv.org)

2026-01-12|paper|arXiv

Automated seizure detection from electroencephalography (EEG) remains difficult due to the large variability of seizure dynamics across patients, recording conditions, and clinical settings. We introd...

cs-LG

Detecting Stochasticity in Discrete Signals via Nonparametric Excursion Theorem (arxiv.org)

2026-01-12|paper|arXiv

We develop a practical framework for distinguishing diffusive stochastic processes from deterministic signals using only a single discrete time series. Our approach is based on classical excursion and...

stat-ML cs-LG eess-SP

On the Definition and Detection of Cherry-Picking in Counterfactual Explanations [TOP LAB](arxiv.org)

2026-01-11|paper|arXiv

Counterfactual explanations are widely used to communicate how inputs must change for a model to alter its prediction. For a single instance, many valid counterfactuals can exist, which leaves open th...

cs-LG cs-AI

Heterogeneous Low-Bandwidth Pre-Training of LLMs (arxiv.org)

2026-01-06|paper|arXiv

Pre-training large language models (LLMs) increasingly requires distributed compute, yet bandwidth constraints make it difficult to scale beyond well-provisioned datacenters-especially when model para...

cs-LG

DatBench: Discriminative, Faithful, and Efficient VLM Evaluations [TOP LAB](arxiv.org)

2026-01-06|paper|arXiv

Empirical evaluation serves as the primary compass guiding research progress in foundation models. Despite a large body of work focused on training frontier vision-language models (VLMs), approaches t...

cs-LG cs-AI

BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models [TOP LAB](arxiv.org)

2026-01-06|paper|arXiv

Vision language foundation models such as CLIP exhibit impressive zero-shot generalization yet remain vulnerable to spurious correlations across visual and textual modalities. Existing debiasing appro...

cs-CV cs-AI cs-LG

FedHypeVAE: Federated Learning with Hypernetwork Generated Conditional VAEs for Differentially Private Embedding Sharing [TOP LAB](arxiv.org)

2026-01-05|paper|arXiv

Federated data sharing promises utility without centralizing raw data, yet existing embedding-level generators struggle under non-IID client heterogeneity and provide limited formal protection against...

cs-LG cs-AI cs-CV

Memory Bank Compression for Continual Adaptation of Large Language Models [TOP LAB](arxiv.org)

2026-01-05|paper|arXiv

Large Language Models (LLMs) have become a mainstay for many everyday applications. However, as data evolve their knowledge quickly becomes outdated. Continual learning aims to update LLMs with new in...

cs-LG cs-CL

Two Deep Learning Approaches for Automated Segmentation of Left Ventricle in Cine Cardiac MRI (arxiv.org)

2026-01-05|paper|arXiv

Left ventricle (LV) segmentation is critical for clinical quantification and diagnosis of cardiac images. In this work, we propose two novel deep learning architectures called LNU-Net and IBU-Net for ...

cs-CV cs-LG

AI tutoring can safely and effectively support students: An exploratory RCT in UK classrooms [TOP LAB](arxiv.org)

2025-12-31|paper|arXiv

One-to-one tutoring is widely considered the gold standard for personalized education, yet it remains prohibitively expensive to scale. To evaluate whether generative AI might help expand access to th...

cs-CY cs-AI cs-LG

From geometry to dynamics: Learning overdamped Langevin dynamics from sparse observations with geometric constraints [TOP LAB](arxiv.org)

2025-12-31|paper|arXiv

How can we learn the laws underlying the dynamics of stochastic systems when their trajectories are sampled sparsely in time? Existing methods either require temporally resolved high-frequency observa...

math-DS cond-mat-stat-mech cs-LG

Stochastic Siamese MAE Pretraining for Longitudinal Medical Images [TOP LAB](arxiv.org)

2025-12-31|paper|arXiv

Temporally aware image representations are crucial for capturing disease progression in 3D volumes of longitudinal medical datasets. However, recent state-of-the-art self-supervised learning approache...

cs-LG cs-CV

AI tutoring can safely and effectively support students: An exploratory RCT in UK classrooms [TOP LAB](arxiv.org)

2025-12-30|paper|arXiv

One-to-one tutoring is widely considered the gold standard for personalized education, yet it remains prohibitively expensive to scale. To evaluate whether generative AI might help expand access to th...

cs-CY cs-AI cs-LG

From geometry to dynamics: Learning overdamped Langevin dynamics from sparse observations with geometric constraints [TOP LAB](arxiv.org)

2025-12-30|paper|arXiv

How can we learn the laws underlying the dynamics of stochastic systems when their trajectories are sampled sparsely in time? Existing methods either require temporally resolved high-frequency observa...

math-DS cond-mat-stat-mech cs-LG

Stochastic Siamese MAE Pretraining for Longitudinal Medical Images [TOP LAB](arxiv.org)

2025-12-30|paper|arXiv

Temporally aware image representations are crucial for capturing disease progression in 3D volumes of longitudinal medical datasets. However, recent state-of-the-art self-supervised learning approache...

cs-LG cs-CV

Learning to Solve PDEs on Neural Shape Representations [TOP LAB](arxiv.org)

2025-12-27|paper|arXiv

Solving partial differential equations (PDEs) on shapes underpins many shape analysis and engineering tasks; yet, prevailing PDE solvers operate on polygonal/triangle meshes while modern 3D assets inc...

cs-LG

Learning to Solve PDEs on Neural Shape Representations [TOP LAB](arxiv.org)

2025-12-26|paper|arXiv

Solving partial differential equations (PDEs) on shapes underpins many shape analysis and engineering tasks; yet, prevailing PDE solvers operate on polygonal/triangle meshes while modern 3D assets inc...

cs-LG

Learning to Solve PDEs on Neural Shape Representations [TOP LAB](arxiv.org)

2025-12-25|paper|arXiv

Solving partial differential equations (PDEs) on shapes underpins many shape analysis and engineering tasks; yet, prevailing PDE solvers operate on polygonal/triangle meshes while modern 3D assets inc...

cs-LG

LongVideoAgent: Multi-Agent Reasoning with Long Videos (arxiv.org)

2025-12-24|paper|arXiv

Recent advances in multimodal LLMs and systems that use tools for long-video QA point to the promise of reasoning over hour-long episodes. However, many methods still compress content into lossy summa...

cs-AI cs-CV cs-LG

Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning (arxiv.org)

2025-12-23|paper|arXiv

We introduce Perception Encoder Audiovisual, PE-AV, a new family of encoders for audio and video understanding trained with scaled contrastive learning. Built on PE, PE-AV makes several key contributi...

cs-SD cs-CV cs-LG

Learning vertical coordinates via automatic differentiation of a dynamical core [TOP LAB](arxiv.org)

2025-12-22|paper|arXiv

Terrain-following coordinates in atmospheric models often imprint their grid structure onto the solution, particularly over steep topography, where distorted coordinate layers can generate spurious ho...

physics-ao-ph cs-LG physics-flu-dyn

Domain-Aware Quantum Circuit for QML [TOP LAB](arxiv.org)

2025-12-22|paper|arXiv

Designing parameterized quantum circuits (PQCs) that are expressive, trainable, and robust to hardware noise is a central challenge for quantum machine learning (QML) on noisy intermediate-scale quant...

quant-ph cs-LG

Tiny Recursive Control: Iterative Reasoning for Efficient Optimal Control [TOP LAB](arxiv.org)

2025-12-21|paper|arXiv

Neural network controllers increasingly demand millions of parameters, and language model approaches push into the billions. For embedded aerospace systems with strict power and latency constraints, t...

cs-LG math-DS

NRGPT: An Energy-based Alternative for GPT [TOP LAB](arxiv.org)

2025-12-21|paper|arXiv

Generative Pre-trained Transformer (GPT) architectures are the most popular design for language modeling. Energy-based modeling is a different paradigm that views inference as a dynamical process oper...

cs-LG

Tiny Recursive Control: Iterative Reasoning for Efficient Optimal Control [TOP LAB](arxiv.org)

2025-12-20|paper|arXiv

Neural network controllers increasingly demand millions of parameters, and language model approaches push into the billions. For embedded aerospace systems with strict power and latency constraints, t...

cs-LG math-DS

NRGPT: An Energy-based Alternative for GPT [TOP LAB](arxiv.org)

2025-12-20|paper|arXiv

Generative Pre-trained Transformer (GPT) architectures are the most popular design for language modeling. Energy-based modeling is a different paradigm that views inference as a dynamical process oper...

cs-LG

Tiny Recursive Control: Iterative Reasoning for Efficient Optimal Control [TOP LAB](arxiv.org)

2025-12-19|paper|arXiv

<think>

cs-LG math-DS

NRGPT: An Energy-based Alternative for GPT [TOP LAB](arxiv.org)

2025-12-19|paper|arXiv

<think>

cs-LG

Early Warning Index for Patient Deteriorations in Hospitals [TOP LAB](arxiv.org)

2025-12-17|paper|arXiv

<think>

cs-LG

LLmFPCA-detect: LLM-powered Multivariate Functional PCA for Anomaly Detection in Sparse Longitudinal Texts [TOP LAB](arxiv.org)

2025-12-17|paper|arXiv

<think>

stat-ML cs-LG