deepseek-r1: incentivizing reasoning capability in llms viareinforcement learning

deepseek v3 开源

deepseek meme gif