Smol-reason

Sweaterdog 's Collections

updated Aug 7

My first ever usage of GRPO fine tuning techniques, information learned from this model will be used on future Andy models.