ZhiguangHan
/

mt5-small-task2-dataset2

text2text-generation

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

ZhiguangHan commited on Dec 18, 2023

Commit

f661e6f

·

1 Parent(s): 87c8c50

End of training

Files changed (1) hide show

README.md +18 -15

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4228
 - Accuracy: 0.32
 ## Model description
@@ -43,29 +43,32 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 12
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 6.2681        | 1.0   | 250  | 1.1622          | 0.018    |
-| 1.7716        | 2.0   | 500  | 0.8039          | 0.066    |
-| 1.1424        | 3.0   | 750  | 0.6418          | 0.14     |
-| 0.8879        | 4.0   | 1000 | 0.5569          | 0.204    |
-| 0.7728        | 5.0   | 1250 | 0.5065          | 0.256    |
-| 0.6886        | 6.0   | 1500 | 0.4821          | 0.268    |
-| 0.6422        | 7.0   | 1750 | 0.4619          | 0.282    |
-| 0.6137        | 8.0   | 2000 | 0.4500          | 0.298    |
-| 0.5837        | 9.0   | 2250 | 0.4314          | 0.3      |
-| 0.5711        | 10.0  | 2500 | 0.4256          | 0.316    |
-| 0.5596        | 11.0  | 2750 | 0.4228          | 0.316    |
-| 0.5458        | 12.0  | 3000 | 0.4228          | 0.32     |
 ### Framework versions
 - Transformers 4.35.2
-- Pytorch 2.1.0+cu118
 - Datasets 2.15.0
 - Tokenizers 0.15.0

 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4462
 - Accuracy: 0.32
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 6.6376        | 1.0   | 250  | 1.2577          | 0.004    |
+| 1.6709        | 2.0   | 500  | 0.8265          | 0.088    |
+| 1.0413        | 3.0   | 750  | 0.6782          | 0.144    |
+| 0.8324        | 4.0   | 1000 | 0.5901          | 0.222    |
+| 0.7187        | 5.0   | 1250 | 0.5476          | 0.246    |
+| 0.6556        | 6.0   | 1500 | 0.5215          | 0.276    |
+| 0.6089        | 7.0   | 1750 | 0.5028          | 0.274    |
+| 0.5736        | 8.0   | 2000 | 0.4930          | 0.304    |
+| 0.5385        | 9.0   | 2250 | 0.4695          | 0.296    |
+| 0.5195        | 10.0  | 2500 | 0.4650          | 0.304    |
+| 0.5073        | 11.0  | 2750 | 0.4571          | 0.304    |
+| 0.4895        | 12.0  | 3000 | 0.4491          | 0.306    |
+| 0.4836        | 13.0  | 3250 | 0.4495          | 0.316    |
+| 0.4745        | 14.0  | 3500 | 0.4460          | 0.318    |
+| 0.4736        | 15.0  | 3750 | 0.4462          | 0.32     |
 ### Framework versions
 - Transformers 4.35.2
+- Pytorch 2.1.0+cu121
 - Datasets 2.15.0
 - Tokenizers 0.15.0