Archive
Jump via the sidebar, or scroll.
2026-04 (32)
Exploring the Transformer Series (30) --- Decoding Speculation Exploring the Transformer Series (29) --- DeepSeek MoE Exploring the Transformer Series (33) --- DeepSeek MTP Exploring the Transformer Series (35) --- Fundamentals of Large Model Quantization Exploring the Transformer Series (36) --- Large Model Quantization Scheme Exploring the Transformer Series (32) --- Lookahead Decoding Exploring the Transformer Series (31) --- Medusa Exploring the Transformer Series (34) --- Quantitative Fundamentals Exploring the Transformer Series (28) --- DeepSeek MLA Exploring the Transformer Series (25) --- KV Cache Optimization for Handling Long Text Sequences Exploring the Transformer Series (26) --- KV Cache Optimization: PD Separation or Merging Exploring the Transformer Series (24) --- KV Cache Optimization Exploring the Transformer Series (23) --- Length Extrapolation Exploring the Transformer Series (27) --- MQA & GQA Exploring the Transformer Series (22) --- LoRA Exploring the Transformer Series (19) --- FlashAttention V2 and its Upgrade Exploring the Transformer Series (18) --- FlashAttention Exploring the Transformer Series (20) --- KV Cache Exploring the Transformer Series (21) --- MoE Exploring the Transformer Series (14) --- Residual Networks and Normalization Exploring the Transformer Series (16) --- Resource Consumption Exploring the Transformer Series (17) --- RoPE Exploring the Transformer Series (15) --- Sampling and Output Exploring the Transformer Series (13) --- FFN Exploring the Transformer Series (11) --- Mask Exploring the Transformer Series (12) --- Multi-head Self-Attention Exploring the Transformer Series (9) --- Location Encoding Classification Exploring the Transformer Series (10) --- Self-Attention Exploring the Transformer Series (7) --- Embedding Exploring the Transformer Series (8) --- Position Encoding Exploring the Transformer Series (6) --- token Exploring the Transformer Series (5) --- Training & Reasoning
2026-03 (18)
Exploring the Transformer Series (3) --- Data Processing Exploring the Transformer Series (4) --- Encoder & Decoder Exploring the Transformer Series (2) --- Overall Architecture Exploring the AI Agent Framework: Deconstructing OpenHands (7) --- Agent Exploring AI Agent Frameworks: Deconstructing OpenHands (9) --- AgentController Exploring AI Agent Frameworks: Deconstructing OpenHands (8) --- CodeActAgent Exploring AI Agent Frameworks: Deconstructing OpenHands (6) --- Event System Exploring AI Agent Frameworks: Deconstructing OpenHands (12) --- Function call Exploring AI Agent Frameworks: Deconstructing OpenHands (5) --- Interaction & Conversation Exploring the AI Agent Framework: Deconstructing OpenHands (11) --- Key Runtime Components Exploring AI Agent Frameworks: Deconstructing OpenHands (13) --- Memory Exploring AI Agent Frameworks: Deconstructing OpenHands (14) --- Microagents Exploring AI Agent Frameworks: Deconstructing OpenHands (10) --- Runtime Exploring the Transformer Series (1): Attention Mechanism Exploring AI Agent Frameworks: Deconstructing OpenHands (2) --- CodeAct Paper Exploring the AI Agent Framework: Deconstructing OpenHands (4) --- Services Exploring the AI Agent Framework: Deconstructing OpenHands (3) --- Startup Exploring AI Agent Frameworks: Deconstructing OpenHands (1) --- Core Concepts