Update README.md
Browse files
README.md
CHANGED
|
@@ -37,4 +37,4 @@ tokenizer_source: base
|
|
| 37 |
name: Qwen3-30B-A3B-YOYO-AutoThink-preview
|
| 38 |
```
|
| 39 |
## *Step2: Expert replacement*
|
| 40 |
-
*Inspired by this [paper](https://arxiv.org/abs/2506.14794) , we use the following regular expression
|
|
|
|
| 37 |
name: Qwen3-30B-A3B-YOYO-AutoThink-preview
|
| 38 |
```
|
| 39 |
## *Step2: Expert replacement*
|
| 40 |
+
*Inspired by this [paper](https://arxiv.org/abs/2506.14794) , we use the following regular expression: `^model\.layers\.\d+\.mlp\.experts\.\d+\.(down_proj|gate_proj|up_proj)\.weight$` for expert replacement — all experts in Qwen3-30B-A3B-YOYO-AutoThink-preview that match the regex are replaced with those from Qwen3-30B-A3B-Thinking-2507.*
|