Sejong Yang

kingsj0405

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation

upvoted a paper about 2 months ago

Video models are zero-shot learners and reasoners

liked a model about 2 months ago

manycore-research/SpatialGen-1.0

View all activity

Organizations

None yet

upvoted a paper 17 days ago

Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation

Paper • 2510.19592 • Published 18 days ago • 11

upvoted a paper about 2 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24 • 96

liked a model about 2 months ago

manycore-research/SpatialGen-1.0

Image-to-3D • Updated Sep 24 • 29 • 37

liked a model 3 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.79M • • 4.12k

upvoted a paper 4 months ago

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Paper • 2507.07990 • Published Jul 10 • 45

liked a model 5 months ago

GSAI-ML/LLaDA-V

Image-Text-to-Text • 8B • Updated Jun 18 • 17.5k • 20

upvoted a paper 6 months ago

UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations

Paper • 2505.08787 • Published May 13 • 14

liked 3 datasets 6 months ago

liked 2 models 7 months ago

trillionlabs/Trillion-LLaVA-7B

Visual Question Answering • 8B • Updated Apr 20 • 1 • 11

trillionlabs/Trillion-7B-preview

Text Generation • 8B • Updated Apr 25 • 227 • 86

upvoted a paper 7 months ago

CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy

Paper • 2504.07959 • Published Apr 10 • 10

upvoted a paper 8 months ago

Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models

Paper • 2503.18446 • Published Mar 24 • 12

liked a dataset 8 months ago

saiyan-world/Goku-MovieGenBench

Viewer • Updated Feb 11 • 1k • 1.37k • 214

upvoted a paper 10 months ago

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Paper • 2501.08326 • Published Jan 14 • 33

liked a model 11 months ago

ByteDance/AnimateDiff-Lightning

Text-to-Video • Updated Jan 6 • 30.4k • 968

liked 2 models 12 months ago

speechbrain/sepformer-wham16k-enhancement

Audio-to-Audio • Updated Feb 25, 2024 • 985 • 32

speechbrain/sepformer-wham

Audio-to-Audio • Updated Feb 19, 2024 • 215 • 44

updated a Space about 1 year ago

Test

🐢

Sejong Yang

AI & ML interests

Recent Activity

Organizations

kingsj0405's activity

Test