i3-architecture

FlameF0X 's Collections

AURORA

Others

Proto

updated about 13 hours ago

Note: The models are listed in the default order set by Hugging Face, so the latest model appears at the bottom.

FlameF0X/i3-tiny

Text Generation • 711k • Updated about 1 month ago • 24 • 1

Note the first i3 architecture LM
FlameF0X/i3-12m

Text Generation • 12.7M • Updated 24 days ago • 180 • 2

Note Our first usable i3 model (meaning that we added Transformers support and some code for it)
FlameF0X/i3-22m

Text Generation • 22.6M • Updated 16 days ago • 102 • 2

Note Smol stable text generator that took over 14 hours to pre-train :) --- Changes --- Trained on over 1T tokens LoRPt layers
FlameF0X/i3-80m

Text Generation • 82.8M • Updated about 16 hours ago • 205 • 6

Note SOTA model. Pre-trained in around 2 to 4 hours, in comparison with the previous version of over 14 hours. --- Changes --- Trained on over 3T tokens Other stuff available to read in the model card.
Running

1

1

i3 80m

💻

Try i3-80m, a SOTA efficient training LM arhitecture.

Note Our first space for i3-80m.