mpasila
/

JP-EN-Translator-2K-steps-7B

Text Generation

text-generation-inference

Model card Files Files and versions

mpasila commited on Mar 28, 2024

Commit

4077f4c

·

verified ·

1 Parent(s): 0e1cf77

Update README.md

Files changed (1) hide show

README.md +20 -0

README.md CHANGED Viewed

@@ -10,7 +10,27 @@ tags:
 - trl
 - sft
 base_model: augmxnt/shisa-base-7b-v1
 ---
 # Uploaded  model

 - trl
 - sft
 base_model: augmxnt/shisa-base-7b-v1
+datasets:
+- NilanE/ParallelFiction-Ja_En-100k
+- mpasila/ParallelFiction-Ja_En-100k-alpaca
 ---
+Experimental model, may not perform that well. Dataset used is [a modified](https://huggingface.co/datasets/mpasila/ParallelFiction-Ja_En-100k-alpaca) version of [NilanE/ParallelFiction-Ja_En-100k](https://huggingface.co/datasets/NilanE/ParallelFiction-Ja_En-100k).
+After training with an 8k context length it didn't appear to improve performance much at all. Not sure if I should keep training it (which is costly) or if I should fix some issues with the dataset (like it starting with Ch or Chapter) or I go back to finetuning Finnish models.
+### Prompt format: Alpaca
+```
+Below is a translation task, paired with an input that provides further context. Write a response that appropriately completes the request.
+### Instruction:
+{}
+### Input:
+{}
+### Response:
+{}
+```
 # Uploaded  model