Albert Catalan-Tatjer's picture

3 7 5

Albert Catalan-Tatjer

aldakata

·

https://aldakata.github.io/

aldakata

AI & ML interests

Efficiency

Recent Activity

liked a model 5 days ago

microsoft/bitnet-b1.58-2B-4T

liked a Space 5 days ago

HuggingFaceTB/smol-training-playbook

liked a Space 7 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

upvoted an article 12 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By

•

Jan 30

• 165

upvoted a paper 13 days ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7 • 3

upvoted a collection 2 months ago

open-sci-ref-0.01 nemotron-hq

10 items • Updated Aug 17 • 4

upvoted an article 4 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

By

•

Apr 16

• 52

upvoted a collection 4 months ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9 • 84

upvoted 2 articles about 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

• 52

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 200