Toward Large Reasoning Models, DeepSeek-R1, Kimi k1.5 and scaling RL, Mind Evolution for search in planning, improving Process Reward Models in math reasoning.
Share this post
AI Research Review 25.01.24 - Reasoning
Share this post
Toward Large Reasoning Models, DeepSeek-R1, Kimi k1.5 and scaling RL, Mind Evolution for search in planning, improving Process Reward Models in math reasoning.