72 articles tagged "cs-CL" — page 3 of 3
Revisiting Generalization Across Difficulty Levels: It's Not So Easy(arxiv.org)
|paper|arXiv
We investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. Existing research is mixed regarding whet...
Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining [TOP LAB](arxiv.org)
|paper|arXiv
Incorporating metadata in Large Language Models (LLMs) pretraining has recently emerged as a promising approach to accelerate training. However prior work highlighted only one useful signal-URLs, leav...
← Prev3 / 3