Jun 01, 2026 A Model Trained on 200M Samples Still Collapses — And One Constant Fixes It Feb 05, 2026 The Surprising Truth About Embeddings trained with RLVR