17 14

Jack Zhang

jackzhang

http://jackz.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in Large Reasoning Models

upvoted a paper 26 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

commented on a paper 26 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

View all activity

Organizations

upvoted a paper 7 days ago

Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in Large Reasoning Models

Paper • 2510.21978 • Published 11 days ago • 14

upvoted 2 papers 26 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published 26 days ago • 40

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Paper • 2510.07242 • Published 27 days ago • 30

upvoted a paper 2 months ago

Jointly Reinforcing Diversity and Quality in Language Model Generations

Paper • 2509.02534 • Published Sep 2 • 24

upvoted 2 papers 6 months ago

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

Paper • 2505.02363 • Published May 5 • 7

Certified Mitigation of Worst-Case LLM Copyright Infringement

Paper • 2504.16046 • Published Apr 22 • 13

upvoted a paper 7 months ago

WikiVideo: Article Generation from Multiple Videos

Paper • 2504.00939 • Published Apr 1 • 37

upvoted a paper 8 months ago

Optimizing Decomposition for Optimal Claim Verification

Paper • 2503.15354 • Published Mar 19 • 18

upvoted an article 8 months ago

Article

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

•

Feb 4

• 18

upvoted a paper 8 months ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 28

upvoted 2 papers 11 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 157

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 35

upvoted 2 papers about 1 year ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11, 2024 • 13

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1, 2024 • 36

Jack Zhang

AI & ML interests

Recent Activity

Organizations

jackzhang's activity

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression