Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2506.15677

SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

Paper • 2506.15672 • Published Jun 18 • 15
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4 • 33
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

benjamin-paine/steamboat-willie-14b

Text-to-Video • Updated Mar 20 • 98 • 41
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23
Running on Zero

MCP

320

Self Forcing Wan 2.1

🎥

320

Real-time video generation
nasa-ibm-ai4science/Surya-1.0

Updated Aug 20 • 135 • 94

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents

Paper • 2410.22476 • Published Oct 29, 2024 • 27
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 49
Training-free Regional Prompting for Diffusion Transformers

Paper • 2411.02395 • Published Nov 4, 2024 • 25

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.71k • 1.22k
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14 • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Text Generation • 8B • Updated Apr 17 • 111 • 15
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 63

Dmitri’s papers

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 30
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Paper • 2502.11357 • Published Feb 17 • 11
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17 • 32

SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

Paper • 2506.15672 • Published Jun 18 • 15
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4 • 33
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.71k • 1.22k
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14 • 15
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Text Generation • 8B • Updated Apr 17 • 111 • 15
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 63

benjamin-paine/steamboat-willie-14b

Text-to-Video • Updated Mar 20 • 98 • 41
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23
Running on Zero

MCP

320

Self Forcing Wan 2.1

🎥

320

Real-time video generation
nasa-ibm-ai4science/Surya-1.0

Updated Aug 20 • 135 • 94

Dmitri’s papers

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 30
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Paper • 2502.11357 • Published Feb 17 • 11
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17 • 32

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents

Paper • 2410.22476 • Published Oct 29, 2024 • 27
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 49
Training-free Regional Prompting for Diffusion Transformers

Paper • 2411.02395 • Published Nov 4, 2024 • 25

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs