The AI Wire

513 articles tagged "c" — page 4 of 18

R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging [TOP LAB](arxiv.org)

2026-02-09|paper|arXiv

Reinforcement Learning from Human Feedback (RLHF) remains indispensable for aligning large language models (LLMs) in subjective domains. To enhance robustness, recent work shifts toward Generative Rew...

cs-CL

Clinical-Prior Guided Multi-Modal Learning with Latent Attention Pooling for Gait-Based Scoliosis Screening [TOP LAB](arxiv.org)

2026-02-09|paper|arXiv

Adolescent Idiopathic Scoliosis (AIS) is a prevalent spinal deformity whose progression can be mitigated through early detection. Conventional screening methods are often subjective, difficult to scal...

cs-CV

MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images (arxiv.org)

2026-02-09|paper|arXiv

Multimodal large language models (MLLMs) have rapidly advanced, yet their adoption in medicine remains limited by gaps in domain coverage, modality alignment, and grounded reasoning. In this work, we ...

cs-CV

Claude Opus 4.6 (Anthropic)(anthropic.com)

2026-02-06|model|Anthropic

First Opus-class model with 1M token context window (beta), adaptive thinking with effort levels, and context compaction for sustained agentic tasks. Agent teams feature enables parallel subtask execution.

llm frontier agentic long-context

Gemini 3 Flash (Google DeepMind)(blog.google)

2026-02-06|model|Google DeepMind

Achieves Gemini 3 Pro-class reasoning at Flash-tier latency and cost. Outperforms 2.5 Pro while being 3x faster at less than 1/4 the cost of 3 Pro. 1M token context, 65K output tokens.

llm frontier multimodal fast

GPT-oss-120B / GPT-oss-20B (OpenAI)(github.com)

2026-02-06|model|GitHub / HuggingFace

OpenAI's first open-weight LLMs since GPT-2 (2019). Apache 2.0 license. Trained with RL and distillation from o3 and frontier internal models. GPT-oss-120B runs on single 80GB GPU; 20B runs on 16GB edge devices.

open-source apache-2 reasoning edge

Google Antigravity IDE (developers.googleblog.com)

2026-02-06|tool|Google

- Two interfaces: Editor View (synchronous coding) and Manager View (orchestrate parallel agents across workspaces)

ide agentic google free

Claude Cowork Plugins (Open Source)(anthropic.com)

2026-02-06|tool|Anthropic

- 11 open-source plugins bundling skills, connectors, slash commands, sub-agents

anthropic plugins open-source productivity

Kimi K2.5 (Moonshot AI)(kimi.com)

2026-02-06|model|HuggingFace / Moonshot AI

Native multimodal model trained on 15T tokens mixing visual and textual data from the start. Agent Swarm technology coordinates up to 100 specialized agents simultaneously, reducing execution time by 4.5x for complex workflows.

llm open-source multimodal moe

NVIDIA Cosmos Reason 2 / Isaac GR00T N1.6 (nvidianews.nvidia.com)

2026-02-06|model|NVIDIA (HuggingFace)

Cosmos Reason 2 is an open reasoning VLM enabling machines to see, understand, and act in the physical world. GR00T N1.6 is a vision-language-action (VLA) model for humanoid robots integrating egocentric camera streams, robot states, and language instructions into a unified policy.

robotics physical-ai nvidia open-source

Qwen3-Max-Thinking (Alibaba)(qwen.ai)

2026-02-06|model|Alibaba Cloud / Qwen

Flagship reasoning model with adaptive tool-use -- intelligently invokes retrieval and code interpreter on demand during inference. Advanced test-time scaling via RL.

llm reasoning trillion-param tool-use

SkyReels V3 (Skywork AI)(github.com)

2026-02-06|model|GitHub / HuggingFace

First open-source model supporting three video generation modes in one architecture: multi-subject reference image-to-video, audio-driven avatar generation, and video-to-video editing. Intelligent shot-switching for minute-level durations.

open-source video-gen multimodal audio-driven

Fundamental NEXUS (Fundamental)(fundamental.tech)

2026-02-06|model|Fundamental (startup)

Creates a new model category: the "Large Tabular Model" (LTM). Trained on billions of tabular datasets to natively understand non-linear relationships in structured data, bypassing traditional ETL pipelines.

new-category tabular enterprise structured-data

Polymathic AI Walrus & AION-1 (polymathic-ai.org)

2026-02-06|model|HuggingFace / GitHub

Foundation models trained on physics data, not text. Walrus learns across 19 fluid dynamics scenarios and 63 physical fields. AION-1 integrates 39 data modalities from astronomical surveys (200M+ observations, ~100TB data).

science foundation-model physics astronomy

Block Goose (github.com)

2026-02-06|tool|GitHub (Apache 2.0)

- Runs entirely local, works with any LLM

agent open-source mcp local

FastMCP 2.0 (github.com)

2026-02-06|tool|GitHub (jlowin / now in official MCP Python SDK)

- "FastAPI of MCP" -- decorator-based server building

mcp python framework infrastructure

Context7 (github.com)

2026-02-06|tool|GitHub (Upstash)

- Fetches current, version-specific documentation in real-time

mcp documentation code-accuracy developer-tools

Sarvam Vision (Sarvam AI)(sarvam.ai)

2026-02-06|model|Sarvam AI

Multilingual document intelligence model supporting all 22 official Indian languages with OCR, visual language understanding, and semantic document parsing. Uses state-space architecture rather than transformer.

multimodal ocr multilingual document-intelligence

Pseudo-Invertible Neural Networks [TOP LAB](arxiv.org)

2026-02-06|paper|arXiv

The Moore-Penrose Pseudo-inverse (PInv) serves as the fundamental solution for linear systems. In this paper, we propose a natural generalization of PInv to the nonlinear regime in general and to neur...

cs-LG cs-CV

A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies [TOP LAB](arxiv.org)

2026-02-06|paper|arXiv

Large language models (LLMs) are increasingly being used in a zero-shot fashion to assess mental health conditions, yet we have limited knowledge on what factors affect their accuracy. In this study, ...

cs-CL

Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps [TOP LAB](arxiv.org)

2026-02-06|paper|arXiv

Flow and diffusion models produce high-quality samples, but adapting them to user preferences or constraints post-training remains costly and brittle, a challenge commonly called reward alignment. We ...

cs-LG cs-AI

Shared LoRA Subspaces for almost Strict Continual Learning (arxiv.org)

2026-02-06|paper|arXiv

Adapting large pretrained models to new tasks efficiently and continually is crucial for real-world deployment but remains challenging due to catastrophic forgetting and the high cost of retraining. W...

cs-LG cs-AI cs-CV

Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning (arxiv.org)

2026-02-06|paper|arXiv

Multi-image spatial reasoning remains challenging for current multimodal large language models (MLLMs). While single-view perception is inherently 2D, reasoning over multiple views requires building a...

cs-CV

Continue v2026 (github.com)

2026-02-06|tool|GitHub (20K+ stars)

- Model-agnostic (any LLM -- local or cloud)

open-source coding model-agnostic enterprise

EvalAI (github.com)

2026-02-06|tool|GitHub

- Evaluating state of the art in AI

ai machine-learning django angularjs

Fluid Representations in Reasoning Models [TOP LAB](arxiv.org)

2026-02-05|paper|arXiv

Reasoning language models, which generate long chains of thought, dramatically outperform non-reasoning language models on abstract problems. However, the internal model mechanisms that allow this sup...

cs-AI

Beyond Rewards in Reinforcement Learning for Cyber Defence [TOP LAB](arxiv.org)

2026-02-05|paper|arXiv

Recent years have seen an explosion of interest in autonomous cyber defence agents trained to defend computer networks using deep reinforcement learning. These agents are typically trained in cyber gy...

cs-LG cs-AI

Reinforced Attention Learning (arxiv.org)

2026-02-05|paper|arXiv

Post-training with Reinforcement Learning (RL) has substantially improved reasoning in Large Language Models (LLMs) via test-time scaling. However, extending this paradigm to Multimodal LLMs (MLLMs) t...

cs-CL cs-CV cs-LG

Protein Autoregressive Modeling via Multiscale Structure Generation (arxiv.org)

2026-02-05|paper|arXiv

We present protein autoregressive modeling (PAR), the first multi-scale autoregressive framework for protein backbone generation via coarse-to-fine next-scale prediction. Using the hierarchical nature...

cs-LG cs-AI q-bio-BM

Contrastive Continual Learning for Model Adaptability in Internet of Things (arxiv.org)

2026-02-05|paper|arXiv

Internet of Things (IoT) deployments operate in nonstationary, dynamic environments where factors such as sensor drift, evolving user behavior, and heterogeneous user privacy requirements can affect a...

cs-LG cs-AI

← Prev4 / 18Next →