Bingzheng Wei's picture

370 49

Bingzheng Wei

Bingzheng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

upvoted a paper 4 days ago

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

upvoted a paper 4 days ago

The Principles of Diffusion Models

View all activity

Organizations

None yet

upvoted 6 papers 4 days ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published 11 days ago • 40

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published 15 days ago • 80

The Principles of Diffusion Models

Paper • 2510.21890 • Published 16 days ago • 51

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published 13 days ago • 95

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 11 days ago • 202

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published 12 days ago • 96

upvoted 4 papers 11 days ago

AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published 12 days ago • 65

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 12 days ago • 90

ReCode: Unify Plan and Action for Universal Granularity Control

Paper • 2510.23564 • Published 13 days ago • 118

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published 13 days ago • 172

upvoted a paper 12 days ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published 24 days ago • 45

upvoted 2 papers 13 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published 16 days ago • 92

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published 17 days ago • 54

upvoted 5 papers 16 days ago

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published 18 days ago • 59

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 18 days ago • 110

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published 19 days ago • 82

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published 20 days ago • 117

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published 19 days ago • 107

upvoted a paper 19 days ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published 23 days ago • 145

upvoted a paper 20 days ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published 24 days ago • 101