146 articles tagged "cs-AI" — page 4 of 5
Revisiting Generalization Across Difficulty Levels: It's Not So Easy(arxiv.org)
|paper|arXiv
We investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. Existing research is mixed regarding whet...