本文翻译DeepSeek Explained 6: All you need to know about Reinforcement Learning in LLM training。
Posted by lili on June 21, 2025
Posted by lili on June 20, 2025
Posted by lili on June 20, 2025
本文解释MLA的代码。
Posted by lili on June 19, 2025
Posted by lili on June 18, 2025
Posted by lili on June 18, 2025
本文介绍RoPE的不同代码实现。
Posted by lili on June 13, 2025
Posted by lili on June 1, 2025
本文分析阅读The Llama 3 Herd of Models。
Posted by lili on July 30, 2024
本文分析阅读Huggingface Whisper的代码。
Posted by lili on May 31, 2024