40 40 254

Kaizhao Liang PRO

kz919

https://kyleliang919.github.io/

AI & ML interests

None yet

Recent Activity

updated a Space 3 days ago

kz919/trl-lora-without-regret

published a Space 3 days ago

kz919/trl-lora-without-regret

liked a model 11 days ago

openai/gpt-oss-120b

View all activity

Organizations

upvoted a paper 24 days ago

Cautious Weight Decay

Paper • 2510.12402 • Published 25 days ago • 4

upvoted a paper 29 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6 • 116

upvoted a paper about 1 month ago

Artificial Hippocampus Networks for Efficient Long-Context Modeling

Paper • 2510.07318 • Published Oct 8 • 29

upvoted a paper 2 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 115

upvoted a changelog 3 months ago

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30

• 200

upvoted an article 3 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 70

upvoted a paper 3 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165

upvoted an article 4 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

Jul 18

• 50

upvoted a paper 4 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24 • 41

upvoted a paper 6 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 124

upvoted an article 8 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

upvoted 3 papers 9 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 30

upvoted 2 articles 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 885

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 490

upvoted a paper 10 months ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 11

upvoted 2 papers 11 months ago

Structured 3D Latents for Scalable and Versatile 3D Generation

Paper • 2412.01506 • Published Dec 2, 2024 • 83

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 26

upvoted a paper 12 months ago

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 20

Kaizhao Liang PRO

AI & ML interests

Recent Activity

Organizations

kz919's activity

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

A failed experiment: Infini-Attention, and why we should keep trying?

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1

Welcome to Inference Providers on the Hub 🔥