40 16 199

Oliver Guhr

oliverguhr

https://www.impact-labs.ai

AI & ML interests

Voice Interfaces, Robotics, Deep Learning

Recent Activity

liked a model about 12 hours ago

maya-research/maya1

liked a model 3 days ago

nineninesix/kani-tts-370m

liked a model 4 days ago

canopylabs/3b-de-ft-research_release

View all activity

Organizations

upvoted a paper 24 days ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Paper • 2510.13996 • Published 26 days ago • 6

upvoted a paper 5 months ago

GeistBERT: Breathing Life into German NLP

Paper • 2506.11903 • Published Jun 13 • 4

upvoted a collection 7 months ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209

upvoted an article 7 months ago

Article

EuroLLM-9B

and 5 others •

Dec 2, 2024

• 137

upvoted an article 9 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

upvoted a paper about 1 year ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 53

upvoted a paper over 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted 2 collections over 1 year ago

Granite 2.0 Code Models

Collection

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 9 days ago • 201

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 866

upvoted 3 papers over 1 year ago

upvoted 2 papers almost 2 years ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 146

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

upvoted a paper about 2 years ago

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

upvoted a paper over 2 years ago

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 89

Oliver Guhr

AI & ML interests

Recent Activity

Organizations

oliverguhr's activity

EuroLLM-9B

Open-R1: Update #1