Avey 1 Research Preview 1.5B preview models trained on 100B tokens of FineWeb, and an instruct-tuned version on smoltalk. avey-ai/avey1-1.5B-base-preview-100BT Text Generation • 2B • Updated Jun 16, 2025 • 2 • 7 avey-ai/avey1-1.5B-it-preview-100BT Text Generation • 2B • Updated Jun 16, 2025 Don't Pay Attention Paper • 2506.11305 • Published Jun 12, 2025 • 8
Don't Pay Attention All model checkpoints trained for the Don't Pay Attention paper. avey-ai/avey1-dpa-0.1B-100BT Text Generation • 0.2B • Updated Jun 16, 2025 avey-ai/avey1-dpa-0.1B-95BT Text Generation • 0.2B • Updated Jun 16, 2025 avey-ai/avey1-dpa-0.1B-90BT Text Generation • 0.2B • Updated Jun 16, 2025 avey-ai/avey1-dpa-0.5B-100BT Text Generation • 0.5B • Updated Jun 16, 2025
Avey 1 Research Preview 1.5B preview models trained on 100B tokens of FineWeb, and an instruct-tuned version on smoltalk. avey-ai/avey1-1.5B-base-preview-100BT Text Generation • 2B • Updated Jun 16, 2025 • 2 • 7 avey-ai/avey1-1.5B-it-preview-100BT Text Generation • 2B • Updated Jun 16, 2025 Don't Pay Attention Paper • 2506.11305 • Published Jun 12, 2025 • 8
Don't Pay Attention All model checkpoints trained for the Don't Pay Attention paper. avey-ai/avey1-dpa-0.1B-100BT Text Generation • 0.2B • Updated Jun 16, 2025 avey-ai/avey1-dpa-0.1B-95BT Text Generation • 0.2B • Updated Jun 16, 2025 avey-ai/avey1-dpa-0.1B-90BT Text Generation • 0.2B • Updated Jun 16, 2025 avey-ai/avey1-dpa-0.5B-100BT Text Generation • 0.5B • Updated Jun 16, 2025