Really impressive

#1
by mradermacher - opened

I have rarely seen a model card that has so many features which are all spot on true - including unfortunately, the memory usage. I've used this extensively with a 32k context. It doesn't seem to degrade over time, it follows orders very well etc. etc.

The only issues I had is that it isn't too imaginative, and kind of caught in, let's say, "limited length response mode" - it's hard to get it to provide longer passages than a few paragraphs, and if, it degrades badly. But other than that, I don't think I've been impressed this much by a <70B model before.

Thanks for your work!

Good day, thank you for your positive feedback. I am very pleased to hear that someone of your importance in the community liked my model. I won't miss the moment to thank you for your work, your quants are always needed.

As for truthfulness, the point is that I make merges primarily in order to use them myself, so I see no point in embellishing the advantages and disadvantages so as not to deceive myself. I test the model in real use in different scenarios, and only then I write a model card based on my impressions recorded during testing. Of course, this is not an objective method, but in my opinion it works better than numerous benchmarks.

Gemma 3 was a very good release, the model is extremely smart for its size, although it is very censored in the original, what was partially fixed by finetunes. I hope that the future gemma 4, other releases and their finetunes, (if it will be any), will be even better, giving people with limited amounts of vram and ram an experience comparable to using large models.

Thank you again for your feedback, I am glad that our opinions about this model coincide.

Sign up or log in to comment