atsuki-yamaguchi commited on
Commit
59f7d98
·
verified ·
1 Parent(s): 90f0c1b

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +55 -0
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: apache-2.0
4
+ datasets:
5
+ - allenai/MADLAD-400
6
+ language:
7
+ - ig
8
+ base_model:
9
+ - allenai/OLMo-2-1124-7B-Instruct
10
+ ---
11
+ # OLMo 2 1124 7B Instruct for Igbo: SSU-Wanda (Calibration with 128 samples)
12
+
13
+ This model is built on top of OLMo 2 1124 7B Instruct adapted for Igbo using 200M target language tokens sampled from MADLAD-400. The model is adapted using the SSU-Wanda approach but calibrated with 128 samples instead of 500 samples.
14
+
15
+ ## Model Description
16
+
17
+ - **Language:** Igbo
18
+ - **License:** Apache 2.0
19
+ - **Fine-tuned from model:** [allenai/OLMo-2-1124-7B-Instruct](https://huggingface.co/allenai/OLMo-2-1124-7B-Instruct)
20
+
21
+
22
+ ## Model Sources
23
+
24
+ - **Repository:** https://github.com/gucci-j/ssu
25
+ - **Paper:** https://arxiv.org/abs/2512.04844
26
+
27
+
28
+ ## How to Get Started with the Model
29
+ Use the code below to get started with the model.
30
+ ```python
31
+ from transformers import AutoTokenizer, AutoModelForCausalLM
32
+
33
+ model = AutoModelForCausalLM.from_pretrained(
34
+ "ssu-project/OLMo-2-1124-7B-Instruct-ig-ssu_128"
35
+ )
36
+ tokenizer = AutoTokenizer.from_pretrained(
37
+ "ssu-project/OLMo-2-1124-7B-Instruct-ig-ssu_128"
38
+ )
39
+ ```
40
+
41
+
42
+ ## Citation
43
+ ```
44
+ @misc{yamaguchi2025mitigatingcatastrophicforgettingtarget,
45
+ title={Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates},
46
+ author={Atsuki Yamaguchi and Terufumi Morishita and Aline Villavicencio and Nikolaos Aletras},
47
+ year={2025},
48
+ eprint={2512.04844},
49
+ archivePrefix={arXiv},
50
+ primaryClass={cs.CL},
51
+ url={https://arxiv.org/abs/2512.04844},
52
+ }
53
+ ```
54
+
55
+