HangGuo/Llama2-70B-QuaRot-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
22
•
1
The model collection of paper: Optimal Brain Restoration for Joint Sparsification and Quantization of LLMs. Github: https://github.com/csguoh/OBR