What are G0 / G1 / G1a2 / G1b?

The fields like G0a / G1a / G1a2 in RWKV model names indicate versions of the training data. In terms of data quality, the ranking is: G1b > G1a3 > G1a2 > G1a > G1 > G0a2 > G0.

The RWKV7-G1a model is an advanced version of RWKV7-G1 that was further trained with 1T (1 trillion tokens) of high-quality inference and instruction data. RWKV7-G1a2 was produced by continuing to add more data and training on top of RWKV7-G1a. And so on.

More high-quality data will be added later to form the G1b dataset, and RWKV7-G1b series models will also be trained and open-sourced.

What is the difference between the RWKV7-G series and the World series?

The RWKV7-G series supports an inference mode, which can be activated using the following format:

User: USER_PROMPT

Assistant: <think

How to choose the best model?

Look at the date in the model name — for the same parameter size, a newer model is better!

For example, for the same 1.5B model, a G1a2 version released on 251005 will definitely be superior to a G1 version released on 250429.

For the 0.1B and 0.4B models, we recommend using FP16/Q8_0 quantization. Otherwise, the models may fail to complete tasks due to precision loss caused by quantization.


G0/G1/G1a2/G1b 是什么?

RWKV 模型名称中的 G0a/G1a/G1a2 等字段是训练数据的版本,数据质量排序:G1b > G1a3 > G1a2 > G1a > G1 > G0a2 > G0 。

RWKV7-G1a 模型是在 RWKV7-G1 模型的基础上继续训练了 1T 优质推理和指令数据的进阶版,RWKV7-G1a2 则是在 RWKV7-G1a 模型的基础上继续添加数据训练,以此类推。

后续会继续添加优质数据形成 G1b 数据集,也会训练并开源 RWKV7-G1b 系列模型。

RWKV7-G 系列和 World 系列有什么区别?

RWKV7-G 系列模型支持推理模式,可通过以下格式开启推理模式:

User: USER_PROMPT

Assistant: <think

如何选择最好的模型?

看模型名称中的日期,相同的参数,模型越新越好!

比如同样是 1.5B 模型,发布于 251005 的 G1a2 版本必定优于 250429 的 G1 版本 。

对于 0.1B 和 0.4B 模型,我们建议使用 FP16/Q8_0 量化类型。否则模型可能因量化带来的精度损失而无法完成任务。

Downloads last month
221
GGUF
Model size
13B params
Architecture
rwkv7
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support