2 247 77

oh sehun

sehun

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 minutes ago

Diffusion Language Models are Super Data Learners

upvoted a paper 6 days ago

Scaling Latent Reasoning via Looped Language Models

upvoted a paper 6 days ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

View all activity

Organizations

upvoted a paper 21 minutes ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 3 days ago • 90

upvoted 2 papers 6 days ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 9 days ago • 202

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published 9 days ago • 113

upvoted a paper 7 days ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published 11 days ago • 56

upvoted a paper 8 days ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published 12 days ago • 83

upvoted an article 9 days ago

Article

Granite 4.0 Nano: Just how small can you go?

and 1 other •

11 days ago

• 106

upvoted 2 papers 11 days ago

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published 11 days ago • 118

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published 22 days ago • 45

upvoted a paper 12 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 17 days ago • 110

upvoted 2 papers 16 days ago

Unified Reinforcement and Imitation Learning for Vision-Language Models

Paper • 2510.19307 • Published 17 days ago • 26

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published 17 days ago • 59

upvoted an article 17 days ago

Article

Supercharge your OCR Pipelines with Open Models

18 days ago

• 225

upvoted a paper 18 days ago

LLM-guided Hierarchical Retrieval

Paper • 2510.13217 • Published 24 days ago • 16

upvoted 3 articles 21 days ago

Article

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Dec 17, 2024

• 7

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 503

Article

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

23 days ago

• 15

upvoted a paper 21 days ago

Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published 22 days ago • 37

upvoted 3 papers 22 days ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published 23 days ago • 101

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Paper • 2510.14967 • Published 22 days ago • 32

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published 25 days ago • 47

oh sehun

AI & ML interests

Recent Activity

Organizations

sehun's activity

Granite 4.0 Nano: Just how small can you go?

Supercharge your OCR Pipelines with Open Models

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

Welcome GPT OSS, the new open-source model family from OpenAI!

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face