papers - a slothCreepTree Collection

slothCreepTree 's Collections

papers

papers

updated 22 days ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 41
SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

Paper • 2312.03788 • Published Dec 6, 2023 • 1
FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12, 2024 • 16
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Paper • 2501.01005 • Published Jan 2 • 1
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29 • 44