Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

new activity 20 days ago

google/medgemma-4b-it:Fix model parameter count

liked a model about 1 month ago

Qwen/Qwen3-4B-SafeRL

liked a Space about 2 months ago

multimodalart/nano-banana

View all activity

Organizations

upvoted a paper about 2 months ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24 • 39

upvoted 2 articles 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

162

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

•

256

upvoted a collection 2 months ago

EmbeddingGemma

3 items • Updated Sep 11 • 98

upvoted a collection 4 months ago

T5Gemma

32 items • Updated Jul 10 • 73

upvoted an article 5 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26

•

120

upvoted a paper 5 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 37

upvoted a changelog 5 months ago

Changelog

New Inference Providers Dashboard

Jun 5

• 65

upvoted a collection 6 months ago

GRMR V3 Models

An improved set of models for grammar correction. (Chat template should work, no "responding as an LLM" anymore, that kind of stuff). • 6 items • Updated Jun 4 • 10

upvoted a paper 6 months ago

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published May 23 • 59

upvoted an article 6 months ago

Article

The Transformers Library: standardizing model definitions

May 15

•

120

upvoted 2 collections 6 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11 • 344

Gemma 3n Preview

4 items • Updated Jul 10 • 187

upvoted an article 7 months ago

Article

17 Reasons Why Gradio Isn't Just Another UI Library

Apr 16

•

42

upvoted a collection 7 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 29 items • Updated Aug 14 • 32

upvoted a paper 7 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 301

upvoted an article 7 months ago

Article

The Large Language Model Course

Jan 16

•

209

upvoted a collection 8 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 210

upvoted an article 8 months ago

Article

Custom Vibe Coding Quest Part 2: 🚙 Fine-Tuning Gemma 3 for Code Reasoning

Apr 1

•

25

upvoted a paper 8 months ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 54