Update README.md
Browse files
README.md
CHANGED
|
@@ -13,6 +13,12 @@ This model has 10 layers, 10 heads and 640 embeddings, with a context window of
|
|
| 13 |
It was able to achieve a training loss of 2.3256 and validation loss of 2.3651.
|
| 14 |
Supervised fine-tuning should be performed before use.
|
| 15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 16 |
Note: GPT-Usenet uses MBOX syntax.
|
| 17 |
```
|
| 18 |
uucp:
|
|
|
|
| 13 |
It was able to achieve a training loss of 2.3256 and validation loss of 2.3651.
|
| 14 |
Supervised fine-tuning should be performed before use.
|
| 15 |
|
| 16 |
+
Training Information:
|
| 17 |
+
| Metric |Value|
|
| 18 |
+
|---------------------------------|----:|
|
| 19 |
+
|Training Loss |2.3256|
|
| 20 |
+
|Validation Loss |2.3651|
|
| 21 |
+
|
| 22 |
Note: GPT-Usenet uses MBOX syntax.
|
| 23 |
```
|
| 24 |
uucp:
|