Open R1: Update #3
Over the last few weeks, we have focused our efforts on reproducing the competitive programming (code reasoning) aspects of the DeepSeek-R1 recipe.
In this post, we are excited to share:
- The construction of CodeForces-CoTs: a dataset of nearly 100k high-quality samples distilled from R1 to produce solutions in C++ and Python.
- The IOI benchmark: a new benchmark of challenging problems from the 2024 International Olympiad in Informatics (IOI).
- OlympicCoder: two fine-tuned 7B and 32B code models that outperform closed-source frontier models like Claude 3.7 Sonnet on IOI problems.
Here’s an overview of how the