Shijie Geng's picture

1 77 7

Shijie Geng

makitanikaze

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a collection 17 days ago

Self-Correcting Delta Transformer - Adaptive LLMs

upvoted a paper 17 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

View all activity

Organizations

None yet

upvoted a paper 13 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 14 days ago • 203

upvoted a collection 17 days ago

Self-Correcting Delta Transformer - Adaptive LLMs

Self-Correcting Delta Transformer - DDL provides the Hardware mechanism (The Erazor), NL solves the software problem. • 3 items • Updated 6 days ago • 2

upvoted 3 papers 17 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published 23 days ago • 139

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 23 days ago • 115

Nested Learning: The Illusion of Deep Learning Architectures

Paper • 2512.24695 • Published 23 days ago • 38

upvoted a paper 20 days ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published 23 days ago • 59

upvoted a paper 21 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 22 days ago • 275

upvoted a paper 23 days ago

LFM2 Technical Report

Paper • 2511.23404 • Published Nov 28, 2025 • 50

upvoted a paper 28 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 109

updated a collection about 1 month ago

gui agent

5 items • Updated Dec 19, 2025

upvoted 10 papers about 1 month ago

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 130

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 38

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 96

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 112

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 128

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 151

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 120

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 253

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 89

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 228