8 357 94

MoRezaGH

Moreza009

https://github.com/mohammad-gh009

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

The Station: An Open-World Environment for AI-Driven Discovery

upvoted a paper 5 days ago

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

upvoted a paper 5 days ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

View all activity

Organizations

upvoted a paper about 4 hours ago

The Station: An Open-World Environment for AI-Driven Discovery

Paper • 2511.06309 • Published 3 days ago • 31

upvoted 3 papers 5 days ago

upvoted an article 10 days ago

Article

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

•

Jul 16

• 144

liked a Space 11 days ago

2.06k

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

Display loss curves for training LLMs

upvoted 14 papers 11 days ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 114

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published 13 days ago • 23

Exploring Conditions for Diffusion models in Robotic Control

Paper • 2510.15510 • Published 26 days ago • 39

AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published 15 days ago • 65

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 15 days ago • 91

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published 14 days ago • 16

ODesign: A World Model for Biomolecular Interaction Design

Paper • 2510.22304 • Published 18 days ago • 22

The Principles of Diffusion Models

Paper • 2510.21890 • Published 19 days ago • 55

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published 14 days ago • 44

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published 16 days ago • 83

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published 16 days ago • 95

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published 14 days ago • 207

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published 13 days ago • 102

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published 13 days ago • 103