Ryker Chang

happyPydog

happyPydog

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

ibm-granite/granite-3.1-8b-instruct

liked a Space 4 days ago

akhaliq/anycoder

liked a Space 5 days ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked a model 2 days ago

ibm-granite/granite-3.1-8b-instruct

Text Generation • 8B • Updated Apr 16 • 37.2k • 164

liked a Space 4 days ago

2.86k

Anycoder

🏢

Generate Gradio app code based on user requests

liked a Space 5 days ago

1.63k

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

upvoted 7 papers 24 days ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 467

Agent Learning via Early Experience

Paper • 2510.08558 • Published 29 days ago • 260

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 136

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 525

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 220

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 672

upvoted a paper about 2 months ago

Fast Transformer Decoding: One Write-Head is All You Need

Paper • 1911.02150 • Published Nov 6, 2019 • 9

upvoted 4 articles about 2 months ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

Aug 8

• 77

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18

• 87

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

• 161

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

• 253

liked a dataset about 2 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 4.86k • 599

upvoted an article 2 months ago

Article

Why SGLang is a Game-Changer for LLM Workflows

•

Jul 7

• 9

liked 2 models 2 months ago

monster-labs/control_v1p_sd15_qrcode_monster

Updated Jul 21, 2023 • 74.1k • 1.42k

google/flan-t5-small

77M • Updated Oct 10, 2023 • 509k • 443

liked a Space 2 months ago

6.65k

MTEB Leaderboard

🥇

Embedding Leaderboard

Ryker Chang

AI & ML interests

Recent Activity

Organizations

happyPydog's activity

Anycoder

The Smol Training Playbook: The Secrets to Building World-Class LLMs

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Welcome EmbeddingGemma, Google's new efficient embedding model

Why SGLang is a Game-Changer for LLM Workflows

MTEB Leaderboard