The AI Wire

Subword-Based Comparative Linguistics across 242 Languages Using Wikipedia Glottosets (arxiv.org)

2026-01-28|paper|arXiv

We present a large-scale comparative study of 242 Latin and Cyrillic-script languages using subword-based methodologies. By constructing 'glottosets' from Wikipedia lexicons, we introduce a framework ...

cs-CL cs-AI cs-LG

Scalable Algorithms for Approximate DNF Model Counting [TOP LAB](arxiv.org)

2026-01-16|paper|arXiv

Model counting of Disjunctive Normal Form (DNF) formulas is a critical problem in applications such as probabilistic inference and network reliability. For example, it is often used for query evaluati...

cs-DS cs-AI

Information Access of the Oppressed: A Problem-Posing Framework for Envisioning Emancipatory Information Access Platforms [TOP LAB](arxiv.org)

2026-01-15|paper|arXiv

Online information access (IA) platforms are targets of authoritarian capture. These concerns are particularly serious and urgent today in light of the rising levels of democratic erosion worldwide, t...

cs-CY cs-AI cs-HC

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning (arxiv.org)

2026-01-15|paper|arXiv

Vision-Language-Action (VLA) tasks require reasoning over complex visual scenes and executing adaptive actions in dynamic environments. While recent studies on reasoning VLAs show that explicit chain-...

cs-CV cs-AI cs-LG

Kinship Data Benchmark for Multi-hop Reasoning [TOP LAB](arxiv.org)

2026-01-13|paper|arXiv

Large language models (LLMs) are increasingly evaluated on their ability to perform multi-hop reasoning, i.e., to combine multiple pieces of information into a coherent inference. We introduce Kinship...

cs-CL cs-AI

AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs (arxiv.org)

2026-01-12|paper|arXiv

Large language models (LLMs) exhibit complementary strengths arising from differences in pretraining data, model architectures, and decoding behaviors. Inference-time ensembling provides a practical w...

cs-CL cs-AI

fairface_age_image_detection (huggingface.co)

2026-01-11|model|HuggingFace (None)

I cannot find any public AI model, dataset, or benchmark that is actually named **"fairface_age_image_detection"** in current literature, model hubs, or code repositories. The closest relevant item is...

transformers safetensors vit image-classification

An Empirical Investigation of Robustness in Large Language Models under Tabular Distortions [TOP LAB](arxiv.org)

2026-01-11|paper|arXiv

We investigate how large language models (LLMs) fail when tabular data in an otherwise canonical representation is subjected to semantic and structural distortions. Our findings reveal that LLMs lack ...

cs-AI

On the Definition and Detection of Cherry-Picking in Counterfactual Explanations [TOP LAB](arxiv.org)

2026-01-11|paper|arXiv

Counterfactual explanations are widely used to communicate how inputs must change for a model to alter its prediction. For a single instance, many valid counterfactuals can exist, which leaves open th...

cs-LG cs-AI

DatBench: Discriminative, Faithful, and Efficient VLM Evaluations [TOP LAB](arxiv.org)

2026-01-06|paper|arXiv

Empirical evaluation serves as the primary compass guiding research progress in foundation models. Despite a large body of work focused on training frontier vision-language models (VLMs), approaches t...

cs-LG cs-AI

BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models [TOP LAB](arxiv.org)

2026-01-06|paper|arXiv

Vision language foundation models such as CLIP exhibit impressive zero-shot generalization yet remain vulnerable to spurious correlations across visual and textual modalities. Existing debiasing appro...

cs-CV cs-AI cs-LG

mlflow (github.com)

2026-01-05|tool|GitHub

**MLflow is an open-source platform for managing the complete machine learning (ML) lifecycle, including experiment tracking, model packaging, deployment, and governance.** Originally developed by Dat...

machine-learning ai ml mlflow

FedHypeVAE: Federated Learning with Hypernetwork Generated Conditional VAEs for Differentially Private Embedding Sharing [TOP LAB](arxiv.org)

2026-01-05|paper|arXiv

Federated data sharing promises utility without centralizing raw data, yet existing embedding-level generators struggle under non-IID client heterogeneity and provide limited formal protection against...

cs-LG cs-AI cs-CV

A Comprehensive Dataset for Human vs. AI Generated Image Detection [TOP LAB](arxiv.org)

2026-01-05|paper|arXiv

Multimodal generative AI systems like Stable Diffusion, DALL-E, and MidJourney have fundamentally changed how synthetic images are created. These tools drive innovation but also enable the spread of m...

cs-CV cs-AI

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-04|paper|arXiv

We present SpaceTimePilot, a video diffusion model that disentangles space and time for controllable generative rendering. Given a monocular video, SpaceTimePilot can independently alter the camera vi...

cs-CV cs-AI cs-RO

AI-Driven Cloud Resource Optimization for Multi-Cluster Environments [TOP LAB](arxiv.org)

2026-01-04|paper|arXiv

Modern cloud-native systems increasingly rely on multi-cluster deployments to support scalability, resilience, and geographic distribution. However, existing resource management approaches remain larg...

cs-DC cs-AI

Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control [TOP LAB](arxiv.org)

2026-01-04|paper|arXiv

Cross-modal systems trained on 2D visual inputs are presented with a dimensional shift when processing 3D scenes. An in-scene camera bridges the dimensionality gap but requires learning a control modu...

cs-CV cs-AI

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-03|paper|arXiv

We present SpaceTimePilot, a video diffusion model that disentangles space and time for controllable generative rendering. Given a monocular video, SpaceTimePilot can independently alter the camera vi...

cs-CV cs-AI cs-RO

AI-Driven Cloud Resource Optimization for Multi-Cluster Environments [TOP LAB](arxiv.org)

2026-01-03|paper|arXiv

Modern cloud-native systems increasingly rely on multi-cluster deployments to support scalability, resilience, and geographic distribution. However, existing resource management approaches remain larg...

cs-DC cs-AI

Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control [TOP LAB](arxiv.org)

2026-01-03|paper|arXiv

Cross-modal systems trained on 2D visual inputs are presented with a dimensional shift when processing 3D scenes. An in-scene camera bridges the dimensionality gap but requires learning a control modu...

cs-CV cs-AI

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-02|paper|arXiv

We present SpaceTimePilot, a video diffusion model that disentangles space and time for controllable generative rendering. Given a monocular video, SpaceTimePilot can independently alter the camera vi...

cs-CV cs-AI cs-RO

AI-Driven Cloud Resource Optimization for Multi-Cluster Environments [TOP LAB](arxiv.org)

2026-01-02|paper|arXiv

Modern cloud-native systems increasingly rely on multi-cluster deployments to support scalability, resilience, and geographic distribution. However, existing resource management approaches remain larg...

cs-DC cs-AI

Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control [TOP LAB](arxiv.org)

2026-01-02|paper|arXiv

Cross-modal systems trained on 2D visual inputs are presented with a dimensional shift when processing 3D scenes. An in-scene camera bridges the dimensionality gap but requires learning a control modu...

cs-CV cs-AI

mlflow (github.com)

2026-01-01|tool|GitHub

**MLflow is an open-source platform for managing the complete machine learning (ML) lifecycle, including experiment tracking, model packaging, deployment, and evaluation, with support for traditional ...

machine-learning ai ml mlflow

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time (arxiv.org)

2026-01-01|paper|arXiv

We present SpaceTimePilot, a video diffusion model that disentangles space and time for controllable generative rendering. Given a monocular video, SpaceTimePilot can independently alter the camera vi...

cs-CV cs-AI cs-RO

AI-Driven Cloud Resource Optimization for Multi-Cluster Environments [TOP LAB](arxiv.org)

2026-01-01|paper|arXiv

Modern cloud-native systems increasingly rely on multi-cluster deployments to support scalability, resilience, and geographic distribution. However, existing resource management approaches remain larg...

cs-DC cs-AI

Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control [TOP LAB](arxiv.org)

2026-01-01|paper|arXiv

Cross-modal systems trained on 2D visual inputs are presented with a dimensional shift when processing 3D scenes. An in-scene camera bridges the dimensionality gap but requires learning a control modu...

cs-CV cs-AI

AI tutoring can safely and effectively support students: An exploratory RCT in UK classrooms [TOP LAB](arxiv.org)

2025-12-31|paper|arXiv

One-to-one tutoring is widely considered the gold standard for personalized education, yet it remains prohibitively expensive to scale. To evaluate whether generative AI might help expand access to th...

cs-CY cs-AI cs-LG

AI tutoring can safely and effectively support students: An exploratory RCT in UK classrooms [TOP LAB](arxiv.org)

2025-12-30|paper|arXiv

One-to-one tutoring is widely considered the gold standard for personalized education, yet it remains prohibitively expensive to scale. To evaluate whether generative AI might help expand access to th...

cs-CY cs-AI cs-LG

airflow (github.com)

2025-12-27|tool|GitHub

**Apache Airflow is an open-source platform for programmatically authoring, scheduling, and monitoring workflows, particularly data pipelines, using Python code to define directed acyclic graphs (DAGs...

airflow apache apache-airflow python