ka1tovo commited on
Commit
803d8f8
·
verified ·
1 Parent(s): e4094ef

Upload 2 files

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +29 -30
  3. distill_orpo.png +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ distill_orpo.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,30 +1,29 @@
1
- ---
2
- license: mit
3
- ---
4
- ## 数据集
5
- ### 下载
6
- ```bash
7
- git lfs install
8
- git clone https://huggingface.co/datasets/juneup/PKU-SafeRLHF-orpo
9
- ```
10
- **若不想克隆大型文件:
11
- ```bash
12
- GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/datasets/juneup/PKU-SafeRLHF-orpo
13
- ```
14
-
15
- ## 模型
16
-
17
- ### 下载
18
- ```bash
19
- git lfs install
20
- git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
21
- ```
22
- **若不想克隆大型文件
23
- ```bash
24
- GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
25
- ```
26
-
27
- ### 在Ollama下载
28
- ```bash
29
- ollama run Juneup/internlm2.5_7b_distill:orpo_q4_k_m
30
- ```
 
1
+ # internlm2.5_7b_distill_orpo
2
+ ## 架构图
3
+ <div align="center">
4
+ <img src="distill_orpo.png" width="800"/>
5
+ </div>
6
+
7
+ ## 基座模型
8
+ https://huggingface.co/Slipstream-Max/Emollm-InternLM2.5-7B-chat-GGUF-fp16
9
+
10
+ ## 数据集
11
+ ### 数据集组成
12
+ PKU-SafeRLHF(https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF-single-dimension)经处理后最终数据集为(https://huggingface.co/datasets/juneup/PKU-SafeRLHF-orpo)
13
+
14
+ ### 训练方式
15
+ orpo、λ=0.2,lr=5e-6
16
+
17
+ ## 下载模型
18
+ ```bash
19
+ git lfs install
20
+ git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
21
+ ```
22
+ 若不想克隆大型文件
23
+ ```bash
24
+ GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/juneup/internlm2.5_7b_distill_orpo
25
+ ```
26
+
27
+ ### 在Ollama下载
28
+ ```bash
29
+ ollama run Juneup/internlm2.5_7b_distill:orpo_q4_k_m
 
distill_orpo.png ADDED

Git LFS Details

  • SHA256: 465379617711259d6f4bde680576232c84532445f29483d446046661b9aeab78
  • Pointer size: 131 Bytes
  • Size of remote file: 324 kB