AI knowledge distillation: The key to DeepSeek’s refinement?

Paris Tung, Associate (London)

Post feature

We review a paper entitled ‘DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning’, written by DeepSeek AI on its GitHub and archived on arXiv on 22nd January 2025. The paper has made a splash among data users who cover relevant tech stocks, build AI models or procure sentiment data.