本文翻译DeepSeek-R1: Advancing LLM Reasoning with Reinforcement Learning。
Posted by lili on June 22, 2025
Posted by lili on June 21, 2025
Posted by lili on June 20, 2025
Posted by lili on June 20, 2025
本文解释MLA的代码。
Posted by lili on June 19, 2025
Posted by lili on June 18, 2025
Posted by lili on June 18, 2025
本文介绍RoPE的不同代码实现。
Posted by lili on June 13, 2025
Posted by lili on June 1, 2025
本文分析阅读The Llama 3 Herd of Models。
Posted by lili on July 30, 2024