ErNewdev0 commited on
Commit
07bee0d
·
verified ·
1 Parent(s): 4432c09

Model save

Browse files
Files changed (3) hide show
  1. README.md +7 -7
  2. generation_config.json +1 -1
  3. model.safetensors +1 -1
README.md CHANGED
@@ -12,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # nusa-beta-0001
14
 
15
- This model was trained from scratch on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 1.7441
18
 
19
  ## Model description
20
 
@@ -36,7 +36,7 @@ The following hyperparameters were used during training:
36
  - learning_rate: 5e-05
37
  - train_batch_size: 8
38
  - eval_batch_size: 8
39
- - seed: 50
40
  - gradient_accumulation_steps: 2
41
  - total_train_batch_size: 16
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
@@ -49,12 +49,12 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 0.6651 | 10.0 | 100 | 1.7441 |
53
 
54
 
55
  ### Framework versions
56
 
57
- - Transformers 4.51.3
58
- - Pytorch 2.6.0+cu124
59
  - Datasets 3.5.1
60
- - Tokenizers 0.21.1
 
12
 
13
  # nusa-beta-0001
14
 
15
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 6.8389
18
 
19
  ## Model description
20
 
 
36
  - learning_rate: 5e-05
37
  - train_batch_size: 8
38
  - eval_batch_size: 8
39
+ - seed: 42
40
  - gradient_accumulation_steps: 2
41
  - total_train_batch_size: 16
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 6.9648 | 10.0 | 100 | 6.8389 |
53
 
54
 
55
  ### Framework versions
56
 
57
+ - Transformers 4.48.3
58
+ - Pytorch 2.5.1+cu124
59
  - Datasets 3.5.1
60
+ - Tokenizers 0.21.0
generation_config.json CHANGED
@@ -3,5 +3,5 @@
3
  "bos_token_id": 50256,
4
  "eos_token_id": 50256,
5
  "pad_token_id": 50257,
6
- "transformers_version": "4.51.3"
7
  }
 
3
  "bos_token_id": 50256,
4
  "eos_token_id": 50256,
5
  "pad_token_id": 50257,
6
+ "transformers_version": "4.48.3"
7
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:588c83f7a8a88eaacddd8d35882d3a1a5ccda135146cb83a1a0b42aece235610
3
  size 120188792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c0f8411e2bb6670d9ef4d678701e36ff3445ceeb507e898bd91c4dcde7797e38
3
  size 120188792