2026  2

February  1

Curriculum Pretraining Enables 10-Digit Addition for a 296-Parameter GPT with 99% Accuracy

January  1

Looped-GPT: Looping During Pre-training improves Generalization