Pham Minh Tuan's picture

Pham Minh Tuan

1TuanPham

·

vTuanPham

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

builddotai/Egocentric-10K

liked a model 1 day ago

autoweeb/Qwen-Image-Edit-2509-Photo-to-Anime

liked a model 1 day ago

PleIAs/Baguettotron

View all activity

Organizations

upvoted a collection 13 days ago

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 14 days ago • 56

upvoted a paper 20 days ago

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

Paper • 2510.15869 • Published 26 days ago • 44

upvoted a collection 2 months ago

Qwen3-Next

4 items • Updated Sep 22 • 152

upvoted a collection 3 months ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 377

upvoted a collection 5 months ago

Gemma 3n

4 items • Updated Jul 10 • 237

upvoted a collection 6 months ago

Perception Encoder

17 items • Updated Jul 11 • 69

upvoted 2 collections 7 months ago

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 2 days ago • 58

HiDream-I1

A collections of HiDream-I1 models. • 4 items • Updated Apr 8 • 32

upvoted a paper 8 months ago

Gemini Robotics: Bringing AI into the Physical World

Paper • 2503.20020 • Published Mar 25 • 29

upvoted 2 collections 8 months ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 97

Gemma 3 Release

28 items • Updated Aug 11 • 533

upvoted a collection 9 months ago

PaliGemma 2 Mix

13 items • Updated Jul 10 • 62

upvoted a paper 9 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 55

upvoted 2 collections 10 months ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Jul 21 • 125

Cosmos

The collection of Cosmos models • 31 items • Updated 2 days ago • 298

upvoted a collection 11 months ago

TimesFM Release

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 6 items • Updated Oct 4 • 26

upvoted a paper 11 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted 3 collections 11 months ago

[MASK] is All You Need

Code, dataset, and pretrained model • 6 items • Updated Feb 6 • 9

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 11 items • Updated Jul 7 • 119

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 187