>
178 articles tagged "cs-CV" — page 2 of 6
Reinforced Attention Learning(arxiv.org)
|paper|arXiv

Post-training with Reinforcement Learning (RL) has substantially improved reasoning in Large Language Models (LLMs) via test-time scaling. However, extending this paradigm to Multimodal LLMs (MLLMs) t...

STEP3-VL-10B Technical Report [TOP LAB](arxiv.org)
|paper|arXiv

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized t...