李理的博客

翻译：DeepSeek-R1: Advancing LLM Reasoning with Reinforcement Learning

本文翻译 DeepSeek-R1: Advancing LLM Reasoning with Reinforcement Learning。

Posted by lili on June 22, 2025

翻译：DeepSeek Explained 6: All you need to know about Reinforcement Learning in LLM training

本文翻译 DeepSeek Explained 6: All you need to know about Reinforcement Learning in LLM training。

Posted by lili on June 21, 2025

翻译：DeepSeek Explained 5: DeepSeek-V3-Base

本文翻译 DeepSeek Explained 5: DeepSeek-V3-Base。

Posted by lili on June 20, 2025

翻译：DeepSeek Explained 4: Multi-Token Prediction

本文翻译 DeepSeek Explained 4: Multi-Token Prediction。

Posted by lili on June 20, 2025

Multi-head Latent Attention代码分析

本文解释MLA的代码。

Posted by lili on June 19, 2025

翻译：DeepSeek-V3 Explained 3: Auxiliary-Loss-Free Load Balancing

本文翻译 DeepSeek-V3 Explained 3: Auxiliary-Loss-Free Load Balancing。

Posted by lili on June 18, 2025

翻译：DeepSeek-V3 Explained 2: DeepSeekMoE

本文翻译 DeepSeek-V3 Explained 2: DeepSeekMoE。

Posted by lili on June 18, 2025

RoPE代码分析

本文介绍RoPE的不同代码实现。

Posted by lili on June 13, 2025

翻译：DeepSeek-V3 Explained 1: Multi-head Latent Attention

本文翻译 DeepSeek-V3 Explained 1: Multi-head Latent Attention。

Posted by lili on June 1, 2025

翻译：The Llama 3 Herd of Models

本文分析阅读The Llama 3 Herd of Models。

Posted by lili on July 30, 2024

FEATURED TAGS

人工智能深度学习 chatbot PyTorch Java BERT git 编程 OCR 汪曾祺语音识别 Kaldi Linux XLNet 情感分析 sentiment analysis 语法纠错 Transformer Tensorflow Huggingface Ubuntu TensorFlow 深度学习框架 Tensor2Tensor 机器翻译微信 wechat automation selenium webdriver pywinauto CentOS GPU Appium t2t 代码阅读中英翻译公众号爬虫 ocr tesseract pytesseract python 默认参数位置参数 VPN JSON Jackson huggingface RoPE PagedAttention vLLM Pre-training LLM CPT weather forecasting graph neural networks qlora quantization transformers cmake pip pipenv conda padding vscode debug source code build deep learning Speech ASR linux pytorch c++ extension Deep Learning DeepSeek Attention MoE cs336 bpe tokenizer

ABOUT ME

读读论文，写写代码。

FRIENDS

Li Li