Tianxin Wei

tianxinwei

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

upvoted a paper about 1 month ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

View all activity

Organizations

upvoted a paper 4 days ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published 9 days ago • 40

upvoted 3 papers about 1 month ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published Oct 7 • 62

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 54

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26 • 133

updated a dataset about 2 months ago

tianxinwei/Time_Series

Viewer • Updated Sep 22 • 66k • 4

authored 4 papers about 2 months ago

NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning

Paper • 2307.08941 • Published Jul 18, 2023 • 1

Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond

Paper • 2403.10667 • Published Mar 15, 2024 • 1

SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence

Paper • 2502.08767 • Published Feb 12

WAPITI: A Watermark for Finetuned Open-Source LLMs

Paper • 2410.06467 • Published Oct 9, 2024

published a dataset about 2 months ago

tianxinwei/Time_Series

Viewer • Updated Sep 22 • 66k • 4

upvoted a paper 4 months ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10 • 76

authored a paper 5 months ago

Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance

Paper • 2506.06444 • Published Jun 6 • 73

upvoted a paper 5 months ago