Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhewei Yao's picture
3 3

Zhewei Yao

zheweiyao
21world's profile picture
·

AI & ML interests

None yet

Organizations

Stable Diffusion AI Art's profile picture Inferinite.AI's profile picture

authored a paper about 1 year ago

SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Paper • 2410.03960 • Published Oct 4, 2024 • 2
authored 2 papers almost 2 years ago

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 20

ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

Paper • 2312.08583 • Published Dec 14, 2023 • 11
authored a paper about 2 years ago

DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention

Paper • 2309.14327 • Published Sep 25, 2023 • 22
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs