49 32 45

Daniel Fox

FlameF0X

https://flamefox.site/

FlameF0X

AI & ML interests

Pre-training text generator. (Brother, im 17)

Recent Activity

new activity 1 day ago

aquif-ai/aquif-Grounding-7B:Model Id issue

upvoted a paper 3 days ago

Robot Learning from a Physical World Model

updated a model 3 days ago

FlameF0X/i3-80m

View all activity

Organizations

New activity in aquif-ai/aquif-Grounding-7B 1 day ago

Model Id issue

#1 opened 1 day ago by

FlameF0X

upvoted a paper 3 days ago

Robot Learning from a Physical World Model

Paper • 2511.07416 • Published 4 days ago • 25

updated a model 3 days ago

FlameF0X/i3-80m

Text Generation • 82.8M • Updated 3 days ago • 190 • 5

upvoted a paper 4 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published 6 days ago • 88

liked a model 4 days ago

dllm-collection/ModernBERT-large-chat-v0

0.4B • Updated 11 days ago • 617 • 8

upvoted a collection 4 days ago

BERT Chat

Collection

BERTs that chat • 2 items • Updated 10 days ago • 8

upvoted 2 papers 4 days ago

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5 • 23

MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28 • 72

published a Space 5 days ago

Pyrhon IDE

🏃

Run and manage Python code with a web interface

updated a Space 7 days ago

Pyrhon IDE

🏃

Run and manage Python code with a web interface

upvoted a changelog 7 days ago

Changelog

Hugging Face App on Okta Integration Network

7 days ago

• 43

New activity in aquif-ai/aquif-3.5-Plus-30B-A3B 8 days ago

Personal report

#1 opened 9 days ago by

Elsephire

upvoted a changelog 8 days ago

Changelog

Hugging Face Docs for Humans and AI Agents

9 days ago

• 46

New activity in aquif-ai/aquif-3.5-Max-42B-A3B 8 days ago

Is there any way I can work with you?

#5 opened 9 days ago by

marcosstable

liked a dataset 9 days ago

codelion/dclm-baseline-1B

Viewer • Updated 13 days ago • 774k • 952 • 2

upvoted an article 10 days ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

12 days ago

•

upvoted a paper 14 days ago

Pretraining Large Language Models with NVFP4

Paper • 2509.25149 • Published Sep 29 • 14

upvoted a collection 14 days ago

i3-architecture

Collection

5 items • Updated 15 days ago • 1

liked a dataset 15 days ago

MohamedRashad/midjourney-detailed-prompts

Viewer • Updated Apr 24, 2024 • 3.05k • 292 • 75

upvoted a paper 15 days ago

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 96

Daniel Fox

AI & ML interests

Recent Activity

Organizations

FlameF0X's activity

Model Id issue

Pyrhon IDE

Pyrhon IDE

Hugging Face App on Okta Integration Network

Personal report

Hugging Face Docs for Humans and AI Agents

Is there any way I can work with you?

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix