merve's picture

merve PRO

merve

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

liked a model about 16 hours ago

Intellindust/DEIMv2_HGNetv2_N_COCO

upvoted an article 4 days ago

Can Your LLM Think Like a Professional? Introducing ProfBench

upvoted an article 4 days ago

NVIDIA Isaac GR00T in LeRobot

View all activity

Organizations

liked a model about 16 hours ago

Intellindust/DEIMv2_HGNetv2_N_COCO

3.63M • Updated 6 days ago • 7 • 1

liked a model 4 days ago

nvidia/NV-Reason-CXR-3B

Image-Text-to-Text • 4B • Updated 8 days ago • 178 • 9

liked 2 models 5 days ago

nvidia/nemoretriever-table-structure-v1

Object Detection • Updated 7 days ago • 22 • 8

nvidia/nemoretriever-page-elements-v3

Object Detection • Updated 7 days ago • 27 • 13

liked 6 models 7 days ago

mlx-community/granite-4.0-1b-base-4bit

Text Generation • 0.3B • Updated 7 days ago • 54 • 1

mlx-community/granite-4.0-1b-base-5bit

Text Generation • 0.3B • Updated 7 days ago • 46 • 1

lovis93/next-scene-qwen-image-lora-2509

Image-to-Image • Updated 14 days ago • 36.8k • 434

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 1 day ago • 28k • 1.2k

meituan-longcat/LongCat-Video

Text-to-Video • Updated 6 days ago • 1.26k • • 274

valiantcat/Qwen-Image-Edit-MeiTu

Image-to-Image • 20B • Updated 8 days ago • 10.5k • 127

liked 5 Spaces 7 days ago

Qwen Atari

Play Atari games using a vision-language model

LongCat Video

Generate videos from text or images

Qwen Image Edit Next Scene

Fast 4 step inference with Qwen Image Edit 2509

Nanonets-OCR2-3B

Extract text from document images

Qwen3 VL Demo

An interactive demo for the Qwen3-VL family models.

liked 2 models 7 days ago

Marvis-AI/marvis-tts-250m-v0.1

Text-to-Audio • Updated Aug 26 • 1.42k • 67

Marvis-AI/marvis-tts-250m-v0.1-transformers

Text-to-Audio • 0.8B • Updated Sep 4 • 1.02k • 21

liked a Space 7 days ago

Marvis TTS 250M

Demo of Marvis-TTS

liked a Space 9 days ago

FLUX.1 Kontext LoRA the Explorer

edit images with Kontext and LoRAs

liked a Space 11 days ago

DeepSeek OCR Demo

Try out DeepSeek-OCR