The AI Wire

513 articles tagged "c" — page 5 of 18

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques [TOP LAB](arxiv.org)

2026-02-04|paper|arXiv

Recent advances in large language models (LLMs) have opened new avenues for accelerating scientific research. While models are increasingly capable of assisting with routine tasks, their ability to co...

cs-CL cs-AI

Equilibrium Propagation for Non-Conservative Systems [TOP LAB](arxiv.org)

2026-02-04|paper|arXiv

Equilibrium Propagation (EP) is a physics-inspired learning algorithm that uses stationary states of a dynamical system both for inference and learning. In its original formulation it is limited to co...

cs-LG cs-AI cs-NE

PLATE: Plasticity-Tunable Efficient Adapters for Geometry-Aware Continual Learning (arxiv.org)

2026-02-04|paper|arXiv

We develop a continual learning method for pretrained models that \emph{requires no access to old-task data}, addressing a practical barrier in foundation model adaptation where pretraining distributi...

cs-LG cs-AI

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing (arxiv.org)

2026-02-04|paper|arXiv

Parallel thinking has emerged as a promising paradigm for reasoning, yet it imposes significant computational burdens. Existing efficiency methods primarily rely on local, per-trajectory signals and l...

cs-CL

Investigating Quantum Circuit Designs Using Neuro-Evolution (arxiv.org)

2026-02-04|paper|arXiv

Designing effective quantum circuits remains a central challenge in quantum computing, as circuit structure strongly influences expressivity, trainability, and hardware feasibility. Current approaches...

cs-NE cs-LG

aimet (github.com)

2026-02-04|tool|GitHub

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models....

quantization deep-learning compression open-source

MentisOculi: Revealing the Limits of Reasoning with Mental Imagery [TOP LAB](arxiv.org)

2026-02-03|paper|arXiv

Frontier models are transitioning from multimodal large language models (MLLMs) that merely ingest visual information to unified multimodal models (UMMs) capable of native interleaved generation. This...

cs-AI cs-CV cs-LG

Misconception Diagnosis From Student-Tutor Dialogue: Generate, Retrieve, Rerank [TOP LAB](arxiv.org)

2026-02-03|paper|arXiv

Timely and accurate identification of student misconceptions is key to improving learning outcomes and pre-empting the compounding of student errors. However, this task is highly dependent on the effo...

cs-CL cs-LG

Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning [TOP LAB](arxiv.org)

2026-02-03|paper|arXiv

Improving the reasoning capabilities of large language models (LLMs) typically relies either on the model's ability to sample a correct solution to be reinforced or on the existence of a stronger mode...

cs-LG cs-AI

Personalized Image Generation via Human-in-the-loop Bayesian Optimization [TOP LAB](arxiv.org)

2026-02-03|paper|arXiv

Imagine Alice has a specific image $x^\ast$ in her mind, say, the view of the street in which she grew up during her childhood. To generate that exact image, she guides a generative model with multipl...

cs-CV cs-LG

Reward-free Alignment for Conflicting Objectives (arxiv.org)

2026-02-03|paper|arXiv

Direct alignment methods are increasingly used to align large language models (LLMs) with human preferences. However, many real-world alignment problems involve multiple conflicting objectives, where ...

cs-CL cs-AI cs-LG

budoux (github.com)

2026-02-03|tool|GitHub

No description...

nlp machine-learning python javascript

Are you going to finish that? A Practical Study of the Tokenization Boundary Problem [TOP LAB](arxiv.org)

2026-02-02|paper|arXiv

Language models (LMs) are trained over sequences of tokens, whereas users interact with LMs via text. This mismatch gives rise to the partial token problem, which occurs when a user ends their prompt ...

cs-CL

WiFiPenTester: Advancing Wireless Ethical Hacking with Governed GenAI [TOP LAB](arxiv.org)

2026-02-02|paper|arXiv

Wireless ethical hacking relies heavily on skilled practitioners manually interpreting reconnaissance results and executing complex, time-sensitive sequences of commands to identify vulnerable targets...

cs-CR cs-AI

Chain-of-thought obfuscation learned from output supervision can generalise to unseen tasks [TOP LAB](arxiv.org)

2026-02-02|paper|arXiv

Chain-of-thought (CoT) reasoning provides a significant performance uplift to LLMs by enabling planning, exploration, and deliberation of their actions. CoT is also a powerful tool for monitoring the ...

cs-AI

The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity? [TOP LAB](arxiv.org)

2026-02-02|paper|arXiv

As AI becomes more capable, we entrust it with more general and consequential tasks. The risks from failure grow more severe with increasing task scope. It is therefore important to understand how ext...

cs-AI

VideoGPA: Distilling Geometry Priors for 3D-Consistent Video Generation (arxiv.org)

2026-02-02|paper|arXiv

While recent video diffusion models (VDMs) produce visually impressive results, they fundamentally struggle to maintain 3D structural consistency, often resulting in object deformation or spatial drif...

cs-CV cs-AI cs-LG

World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems [TOP LAB](arxiv.org)

2026-02-01|paper|arXiv

Frontier large language models (LLMs) excel as autonomous agents in many domains, yet they remain untested in complex enterprise systems where hidden workflows create cascading effects across intercon...

cs-AI cs-SE

EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers [TOP LAB](arxiv.org)

2026-02-01|paper|arXiv

Current generative video models excel at producing novel content from text and image prompts, but leave a critical gap in editing existing pre-recorded videos, where minor alterations to the spoken sc...

cs-CV cs-GR cs-LG

Investigating Associational Biases in Inter-Model Communication of Large Generative Models [TOP LAB](arxiv.org)

2026-02-01|paper|arXiv

Social bias in generative AI can manifest not only as performance disparities but also as associational bias, whereby models learn and reproduce stereotypical associations between concepts and demogra...

cs-CY cs-AI

CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty [TOP LAB](arxiv.org)

2026-02-01|paper|arXiv

Existing benchmarks for Large Language Model (LLM) agents focus on task completion under idealistic settings but overlook reliability in real-world, user-facing applications. In domains, such as in-ca...

cs-AI

RedSage: A Cybersecurity Generalist LLM (arxiv.org)

2026-02-01|paper|arXiv

Cybersecurity operations demand assistant LLMs that support diverse workflows without exposing sensitive data. Existing solutions either rely on proprietary APIs with privacy risks or on open models l...

cs-CR cs-AI cs-CL

causalml (github.com)

2026-02-01|tool|GitHub

Uplift modeling and causal inference with machine learning algorithms...

incubation machine-learning causal-inference uplift-modeling

DeepSpeed (github.com)

2026-02-01|tool|GitHub

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective....

deep-learning pytorch gpu machine-learning

SERA: Soft-Verified Efficient Repository Agents [TOP LAB](arxiv.org)

2026-01-30|paper|arXiv

Open-weight coding agents should hold a fundamental advantage over closed-source systems: they can be specialized to private codebases, encoding repository-specific information directly in their weigh...

cs-CL cs-LG cs-SE

Persona Prompting as a Lens on LLM Social Reasoning [TOP LAB](arxiv.org)

2026-01-30|paper|arXiv

For socially sensitive tasks like hate speech detection, the quality of explanations from Large Language Models (LLMs) is crucial for factors like user trust and model alignment. While Persona prompti...

cs-CL

Supervised Guidance Training for Infinite-Dimensional Diffusion Models [TOP LAB](arxiv.org)

2026-01-30|paper|arXiv

Score-based diffusion models have recently been extended to infinite-dimensional function spaces, with uses such as inverse problems arising from partial differential equations. In the Bayesian formul...

cs-LG

Agent Benchmarks Fail Public Sector Requirements [TOP LAB](arxiv.org)

2026-01-30|paper|arXiv

Deploying Large Language Model-based agents (LLM agents) in the public sector requires assuring that they meet the stringent legal, procedural, and structural requirements of public-sector institution...

cs-CY cs-AI

Evolutionary Strategies lead to Catastrophic Forgetting in LLMs (arxiv.org)

2026-01-30|paper|arXiv

One of the biggest missing capabilities in current AI systems is the ability to learn continuously after deployment. Implementing such continually learning systems have several challenges, one of whic...

cs-LG cs-AI cs-CL

World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems [TOP LAB](arxiv.org)

2026-01-30|paper|arXiv

cs-AI cs-SE

← Prev5 / 18Next →