Model idea

It's technically possible, but on my current setup it doesn't work because it lacks the required VRAM. But I am experimenting with other smaller MoE models as they may perform significantly better than regular sized models on consumer hardware.

saipangon

10 days ago

Here is the thing that you must remember
If the behavior of the dataset is different, you can not just apply SFT training like.
I've been test all of your model that using 2.5 pro datasets. Does it think? Yes, but it think just 'like' how gemini thinks, not really follows how gemini thinks. If the format of the reasoning is different, u have to do so for the training, it must be different also in RLHF tuning. You have to set up the certain special objects, such as how you design the special reward for the model, which is choosen, or which is think path is rejected

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment