Submitted by
Philippe Bich
AI & ML interests
None defined yet.
Recent Activity
Papers
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding