CrPO
Collection
Creative Preference Optimization
•
24 items
•
Updated
This is a CrPO-sft-llama-3.1-8b-instruct model preference-finetuned on the MuCE-Pref dataset from the Creative Preference Optimization paper. This model is optimized for high output diversity and quality.
@misc{ismayilzada2025creativepreferenceoptimization,
title={Creative Preference Optimization},
author={Mete Ismayilzada and Antonio Laverghetta Jr. and Simone A. Luchini and Reet Patel and Antoine Bosselut and Lonneke van der Plas and Roger E. Beaty},
year={2025},
eprint={2505.14442},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.14442},
}
Base model
meta-llama/Llama-3.1-8B