Open R1: Update #3

image/png

Over the last few weeks, we have focused our efforts on reproducing the competitive programming (code reasoning) aspects of the DeepSeek-R1 recipe.

In this post, we are excited to share:

  • The construction of CodeForces-CoTs: a dataset of nearly 100k high-quality samples distilled from R1 to produce solutions in C++ and Python.
  • The IOI benchmark: a new benchmark of challenging problems from the 2024 International Olympiad in Informatics (IOI).
  • OlympicCoder: two fine-tuned 7B and 32B code models that outperform closed-source frontier models like Claude 3.7 Sonnet on IOI problems.

Here’s an overview of how the

 

 

 

To finish reading, please visit source site