beuke.org

A personal blog about computer science topics.

Quantization
A Key Technique for GPT Model Compression and Efficiency
Posted on Apr 6 2023 ~ 7 min read
#artificial intelligence  #GPT  #LLM