YOYO-AI commited on
Commit
a36ea9e
·
verified ·
1 Parent(s): 61f0be7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -37,4 +37,4 @@ tokenizer_source: base
37
  name: Qwen3-30B-A3B-YOYO-AutoThink-preview
38
  ```
39
  ## *Step2: Expert replacement*
40
- *Inspired by this [paper](https://arxiv.org/abs/2506.14794) , we use the following regular expression:`^model\.layers\.\d+\.mlp\.experts\.\d+\.(down_proj|gate_proj|up_proj)\.weight$`for expert replacement — all experts in Qwen3-30B-A3B-YOYO-AutoThink-preview that match the regex are replaced with those from Qwen3-30B-A3B-Thinking-2507.*
 
37
  name: Qwen3-30B-A3B-YOYO-AutoThink-preview
38
  ```
39
  ## *Step2: Expert replacement*
40
+ *Inspired by this [paper](https://arxiv.org/abs/2506.14794) , we use the following regular expression: `^model\.layers\.\d+\.mlp\.experts\.\d+\.(down_proj|gate_proj|up_proj)\.weight$` for expert replacement — all experts in Qwen3-30B-A3B-YOYO-AutoThink-preview that match the regex are replaced with those from Qwen3-30B-A3B-Thinking-2507.*