2 246 77

oh sehun

sehun

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Scaling Latent Reasoning via Looped Language Models

upvoted a paper 2 days ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

upvoted a paper 4 days ago

FARMER: Flow AutoRegressive Transformer over Pixels

View all activity

Organizations

upvoted 2 papers 2 days ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 6 days ago • 188

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published 5 days ago • 107

upvoted a paper 4 days ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 8 days ago • 56

liked a Space 4 days ago

Granite 4.0 Nano WebGPU

🛠

In-browser tool calling with IBM Granite-4.0

upvoted a paper 5 days ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published 8 days ago • 81

liked 2 models 5 days ago

KORMo-Team/KORMo-10B-sft

Text Generation • 11B • Updated about 8 hours ago • 2.75k • 112

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated 5 days ago • 3.3k • 39

upvoted an article 5 days ago

Article

Granite 4.0 Nano: Just how small can you go?

and 1 other •

7 days ago

• 96

upvoted a paper 7 days ago

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published 8 days ago • 117

liked a model 7 days ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 6 days ago • 810k • • 1.02k

upvoted a paper 8 days ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published 19 days ago • 44

liked a model 8 days ago

lightonai/LightOnOCR-1B-1025

Image-to-Text • Updated 16 minutes ago • 11.4k • 137

upvoted a paper 8 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 13 days ago • 108

liked a model 12 days ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated about 8 hours ago • 2.25M • • 2.43k

upvoted 2 papers 12 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published 13 days ago • 26

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published 13 days ago • 59

upvoted an article 13 days ago

Article

Supercharge your OCR Pipelines with Open Models

14 days ago

• 221

upvoted a paper 14 days ago

LLM-guided Hierarchical Retrieval

Paper • 2510.13217 • Published 20 days ago • 16

upvoted 2 articles 17 days ago

Article

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Dec 17, 2024

• 7

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 503

oh sehun

AI & ML interests

Recent Activity

Organizations

sehun's activity

Granite 4.0 Nano WebGPU

Granite 4.0 Nano: Just how small can you go?

Supercharge your OCR Pipelines with Open Models

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Welcome GPT OSS, the new open-source model family from OpenAI!