Thierry Herrmann's picture

9

Thierry Herrmann

thierryh

AI & ML interests

deep learning, machine learning

Organizations

None yet

upvoted 2 articles 4 months ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

Mar 18, 2024

•

13

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

May 7, 2024

•

111

upvoted 3 articles 8 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

•

222

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

262

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

•

177

upvoted 2 articles 9 months ago

Article

Faster Text Generation with Self-Speculative Decoding

+2

Nov 20, 2024

•

63

Article

Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo

Dec 23, 2024

•

51

upvoted 2 articles 10 months ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

•

256

Article

From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease

Oct 21, 2022

•

42