k050506koch/GPT3-dev-125m-0104
Text Generation
•
0.1B
•
Updated
•
158
This collection contains my GPT-3 Small implementations. All models here share same architecture and are same model on different training stages.