SqueezeLLM is a post-training quantization framework that incorporates a new method called Dense-and-Sparse Quantization to enable efficient LLM serving. 🔥 📌 Dense-and-Sparse splits weight matrices into two components: A dense component that can be heavily quantized without… https://t.co/IQBfB9vpjR https://t.co/WrGvbhVBi3
— Rohan Paul (@rohanpaul_ai) Dec 29, 2023
from Twitter https://twitter.com/rohanpaul_ai
December 29, 2023 at 03:05PM
via IFTTT
No comments:
Post a Comment