Exploring the Transformer Series (35) --- Fundamentals of Large Model Quantization
Large model quantization fundamentals: outliers, superweights, massive activations, PTQ, QAT, and common quantization strategies.
Large model quantization fundamentals: outliers, superweights, massive activations, PTQ, QAT, and common quantization strategies.