lhl's picture

lhl PRO

leonardlin

·

https://randomfoo.net/

lhl
lhl

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Lightricks/LTX-2

View all activity

Organizations

upvoted a paper 6 days ago

JP-TL-Bench: Anchored Pairwise LLM Evaluation for Bidirectional Japanese-English Translation

Paper • 2601.00223 • Published 11 days ago • 1

upvoted an article 14 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

571

upvoted an article 24 days ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

25 days ago

•

44

upvoted a paper about 2 months ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Paper • 2412.04144 • Published Dec 5, 2024 • 6

upvoted a collection about 2 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 19 days ago • 159

upvoted a collection 2 months ago

Granite 4.0 Nano Language Models

9 items • Updated Nov 17, 2025 • 93

upvoted 2 papers 3 months ago

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21, 2025 • 71

Recent Advances in Speech Language Models: A Survey

Paper • 2410.03751 • Published Oct 1, 2024 • 1

upvoted 2 collections 5 months ago

Ovis2.5

Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19, 2025 • 57

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7, 2025 • 67

upvoted a paper 7 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

upvoted a collection 8 months ago

ChatVector

モデル間の重みの加減算のみで構築した日本語LLM • 4 items • Updated Nov 24, 2024 • 2

upvoted a paper 8 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 53

upvoted a collection 9 months ago

Shisa V2

A family of bilingual JA/EN LLMs • 32 items • Updated Jun 4, 2025 • 9

upvoted 2 papers 9 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 76

upvoted a collection 9 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29, 2025 • 682

upvoted 3 papers 11 months ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12, 2025 • 58

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13, 2025 • 148

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 32