ZhiguangHan commited on
Commit
d38ef26
·
1 Parent(s): db5bcc3

End of training

Browse files
Files changed (2) hide show
  1. README.md +24 -18
  2. model.safetensors +1 -1
README.md CHANGED
@@ -17,8 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.3592
21
- - Accuracy: 0.14
 
 
 
22
 
23
  ## Model description
24
 
@@ -43,29 +46,32 @@ The following hyperparameters were used during training:
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 12
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
- | 1.6139 | 1.0 | 250 | 1.3916 | 0.102 |
53
- | 1.5289 | 2.0 | 500 | 1.4550 | 0.108 |
54
- | 1.4823 | 3.0 | 750 | 1.3630 | 0.132 |
55
- | 1.4372 | 4.0 | 1000 | 1.3930 | 0.116 |
56
- | 1.4563 | 5.0 | 1250 | 1.3857 | 0.124 |
57
- | 1.4347 | 6.0 | 1500 | 1.3708 | 0.124 |
58
- | 1.4303 | 7.0 | 1750 | 1.3856 | 0.136 |
59
- | 1.4072 | 8.0 | 2000 | 1.3595 | 0.136 |
60
- | 1.4045 | 9.0 | 2250 | 1.3677 | 0.13 |
61
- | 1.3861 | 10.0 | 2500 | 1.3511 | 0.13 |
62
- | 1.376 | 11.0 | 2750 | 1.3543 | 0.136 |
63
- | 1.3699 | 12.0 | 3000 | 1.3592 | 0.14 |
 
 
 
64
 
65
 
66
  ### Framework versions
67
 
68
  - Transformers 4.35.2
69
- - Pytorch 2.1.0+cu118
70
  - Datasets 2.15.0
71
  - Tokenizers 0.15.0
 
17
 
18
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.3494
21
+ - Accuracy: 0.156
22
+ - Mse: 1.4726
23
+ - Log-distance: 0.6559
24
+ - S Score: 0.5092
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 15
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Mse | Log-distance | S Score |
54
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------------:|:-------:|
55
+ | 12.3227 | 1.0 | 250 | 3.0911 | 0.126 | 1.6773 | 0.5862 | 0.5608 |
56
+ | 3.0171 | 2.0 | 500 | 1.8496 | 0.126 | 1.6805 | 0.5886 | 0.5608 |
57
+ | 2.1379 | 3.0 | 750 | 1.4488 | 0.126 | 1.6773 | 0.5862 | 0.5608 |
58
+ | 1.7896 | 4.0 | 1000 | 1.4309 | 0.126 | 1.6773 | 0.5862 | 0.5608 |
59
+ | 1.6843 | 5.0 | 1250 | 1.3863 | 0.136 | 1.5477 | 0.5660 | 0.5764 |
60
+ | 1.6196 | 6.0 | 1500 | 1.3676 | 0.142 | 1.4865 | 0.6943 | 0.4700 |
61
+ | 1.5812 | 7.0 | 1750 | 1.3518 | 0.14 | 1.4748 | 0.6894 | 0.4728 |
62
+ | 1.5336 | 8.0 | 2000 | 1.3538 | 0.148 | 1.6125 | 0.7828 | 0.4220 |
63
+ | 1.5106 | 9.0 | 2250 | 1.3468 | 0.172 | 1.4330 | 0.6204 | 0.5484 |
64
+ | 1.486 | 10.0 | 2500 | 1.3519 | 0.16 | 1.4487 | 0.6414 | 0.5268 |
65
+ | 1.4524 | 11.0 | 2750 | 1.3465 | 0.156 | 1.3796 | 0.5703 | 0.5720 |
66
+ | 1.4614 | 12.0 | 3000 | 1.3494 | 0.162 | 1.4250 | 0.6270 | 0.5316 |
67
+ | 1.4525 | 13.0 | 3250 | 1.3589 | 0.146 | 1.4602 | 0.6592 | 0.5068 |
68
+ | 1.4379 | 14.0 | 3500 | 1.3505 | 0.154 | 1.4722 | 0.6524 | 0.5128 |
69
+ | 1.4397 | 15.0 | 3750 | 1.3494 | 0.156 | 1.4726 | 0.6559 | 0.5092 |
70
 
71
 
72
  ### Framework versions
73
 
74
  - Transformers 4.35.2
75
+ - Pytorch 2.1.0+cu121
76
  - Datasets 2.15.0
77
  - Tokenizers 0.15.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:95ff5071ccf89e56545f1c5e637c4e7b407d2ae465d7e015bddefaf8ca4947e7
3
  size 1200729512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea04b9c5babeac3bb13499104805e2f68c1a266928adac16c10f1f8f26e2344e
3
  size 1200729512