Everything, neatly stacked by month.

Archive

Jump via the sidebar, or scroll.


2026-04 (32)

Exploring the Transformer Series (30) --- Decoding Speculation 2026-04-11 Exploring the Transformer Series (29) --- DeepSeek MoE 2026-04-11 Exploring the Transformer Series (33) --- DeepSeek MTP 2026-04-11 Exploring the Transformer Series (35) --- Fundamentals of Large Model Quantization 2026-04-11 Exploring the Transformer Series (36) --- Large Model Quantization Scheme 2026-04-11 Exploring the Transformer Series (32) --- Lookahead Decoding 2026-04-11 Exploring the Transformer Series (31) --- Medusa 2026-04-11 Exploring the Transformer Series (34) --- Quantitative Fundamentals 2026-04-11 Exploring the Transformer Series (28) --- DeepSeek MLA 2026-04-09 Exploring the Transformer Series (25) --- KV Cache Optimization for Handling Long Text Sequences 2026-04-09 Exploring the Transformer Series (26) --- KV Cache Optimization: PD Separation or Merging 2026-04-09 Exploring the Transformer Series (24) --- KV Cache Optimization 2026-04-09 Exploring the Transformer Series (23) --- Length Extrapolation 2026-04-09 Exploring the Transformer Series (27) --- MQA & GQA 2026-04-09 Exploring the Transformer Series (22) --- LoRA 2026-04-08 Exploring the Transformer Series (19) --- FlashAttention V2 and its Upgrade 2026-04-07 Exploring the Transformer Series (18) --- FlashAttention 2026-04-07 Exploring the Transformer Series (20) --- KV Cache 2026-04-07 Exploring the Transformer Series (21) --- MoE 2026-04-07 Exploring the Transformer Series (14) --- Residual Networks and Normalization 2026-04-05 Exploring the Transformer Series (16) --- Resource Consumption 2026-04-05 Exploring the Transformer Series (17) --- RoPE 2026-04-05 Exploring the Transformer Series (15) --- Sampling and Output 2026-04-05 Exploring the Transformer Series (13) --- FFN 2026-04-04 Exploring the Transformer Series (11) --- Mask 2026-04-03 Exploring the Transformer Series (12) --- Multi-head Self-Attention 2026-04-03 Exploring the Transformer Series (9) --- Location Encoding Classification 2026-04-02 Exploring the Transformer Series (10) --- Self-Attention 2026-04-02 Exploring the Transformer Series (7) --- Embedding 2026-04-01 Exploring the Transformer Series (8) --- Position Encoding 2026-04-01 Exploring the Transformer Series (6) --- token 2026-04-01 Exploring the Transformer Series (5) --- Training & Reasoning 2026-04-01

2026-03 (18)

Exploring the Transformer Series (3) --- Data Processing 2026-03-31 Exploring the Transformer Series (4) --- Encoder & Decoder 2026-03-31 Exploring the Transformer Series (2) --- Overall Architecture 2026-03-31 Exploring the AI ​​Agent Framework: Deconstructing OpenHands (7) --- Agent 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (9) --- AgentController 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (8) --- CodeActAgent 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (6) --- Event System 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (12) --- Function call 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (5) --- Interaction & Conversation 2026-03-27 Exploring the AI Agent Framework: Deconstructing OpenHands (11) --- Key Runtime Components 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (13) --- Memory 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (14) --- Microagents 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (10) --- Runtime 2026-03-27 Exploring the Transformer Series (1): Attention Mechanism 2026-03-27 Exploring AI Agent Frameworks: Deconstructing OpenHands (2) --- CodeAct Paper 2026-03-26 Exploring the AI Agent Framework: Deconstructing OpenHands (4) --- Services 2026-03-26 Exploring the AI Agent Framework: Deconstructing OpenHands (3) --- Startup 2026-03-26 Exploring AI Agent Frameworks: Deconstructing OpenHands (1) --- Core Concepts 2026-03-25