i3-lab

https://flamefox.gattodev.tech/i3/

AI & ML interests

None defined yet.

Recent Activity

FlameF0X updated a Space about 5 hours ago

i3-lab/README

FlameF0X new activity about 15 hours ago

i3-lab/i3-80m:Training

FlameF0X updated a collection 4 days ago

View all activity

Organization Card

Community About org cards

Welcome to i3-lab

"Chase the SOTA pipeline, not the MMLU slop."

i3-lab is dedicated to extreme efficiency in LLM architecture. We develop the i3 model family—state-of-the-art architectures designed to reach high performance levels in hours on consumer-grade hardware (like the NVIDIA Quadro P100) that typically require days on massive GPU clusters.

i3: High-Efficiency Training

We specialize in hybrid architectures, specifically RWKV-Attention, to bypass the quadratic scaling bottlenecks of traditional Transformers.

Fast Iteration: Trainable in hours, not weeks.
Accessible SOTA: High performance on legacy/mid-range hardware.
Open Research: Push the boundaries of what is possible with limited compute.

Quick Links

Source Code: FlameF0X/open-i3
Community: Join our Discord

Roadmap / TODO

We are currently scaling our architecture through the following milestones:

i3-Ethan-it — Specialized instruction-tuned variant.
i3-1B — Our first major scale-up.
i3-7B-A1.6B — Mixture of Experts / Sparsity testing.

Usage & Attribution

The open-i3 codebase is licensed under Apache 2.0. We believe in open-source, but we value attribution.

If you use our architecture (RWKV-Attention) or our weights, you are required per Section 4(b) and 4(d) to:

Carry prominent notices of any modifications.
Include a readable copy of the attribution notices from our NOTICE file.

You must include the attribution link found in the open-i3 GitHub in your documentation or model card.

Made with ❤️ and DETERMINATION by Daniel.

Collections 2

spaces 5

i3 80m

Try i3-80m, a SOTA efficient training LM arhitecture.

I3 200m

Our lates model trained on out SOTA pipeline.

I3 BERT

Predict missing words in sentences

I3 CLIP

models 10

i3-lab/i3-CLIP

Zero-Shot Image Classification • Updated 5 days ago • 1 • 1

i3-lab/i3-BERT-v2

Fill-Mask • Updated 5 days ago • 27

i3-lab/i3-BERT-v1

Updated 8 days ago

i3-lab/i3-Ethan-Base

Text Generation • 0.2B • Updated 9 days ago • 38

i3-lab/i3-200m-v2

Text Generation • 0.2B • Updated 28 days ago • 32 • 3

i3-lab/i3-80m

Text Generation • 82.8M • Updated about 1 month ago • 22 • 7

i3-lab/i3-200m-v1

Text Generation • 0.2B • Updated Nov 29 • 23 • 1

i3-lab/i3-22m

Text Generation • 22.6M • Updated Oct 31 • 21 • 2

i3-lab/i3-12m

Text Generation • 12.7M • Updated Oct 23 • 65 • 3

i3-lab/i3-tiny

Text Generation • 711k • Updated Oct 17 • 37 • 1

datasets 0

None public yet