RLP: Reinforcement as a Pretraining Objective Paper β’ 2510.01265 β’ Published Sep 26, 2025 β’ 40 β’ 4
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Paper β’ 2504.07096 β’ Published Apr 9, 2025 β’ 76 β’ 3