Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zucco 's Collections
SSL
VQ
Better LLM datasets
Efficient
MoE
Speed
Transformers
ViT
RAG
Transfer
LLM
Agents

Speed

updated Dec 21, 2023
Upvote
-

  • LLM in a flash: Efficient Large Language Model Inference with Limited Memory

    Paper • 2312.11514 • Published Dec 12, 2023 • 260

  • PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

    Paper • 2312.12456 • Published Dec 16, 2023 • 44
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs