Post
400
Introducing JFLEG-JA, a new Japanese language error correction benchmark with 1,335 sentences, each paired with 4 high-quality human corrections π
Inspired by the English JFLEG dataset, this dataset covers diverse error types, including particle mistakes, kanji mix-ups, incorrect contextual verb, adjective, and literary technique usage.
You can use this for evaluating LLMs, few-shot learning, error analysis, or fine-tuning correction systems.
ronantakizawa/jfleg-japanese
#japanese #evals #benchmark
Inspired by the English JFLEG dataset, this dataset covers diverse error types, including particle mistakes, kanji mix-ups, incorrect contextual verb, adjective, and literary technique usage.
You can use this for evaluating LLMs, few-shot learning, error analysis, or fine-tuning correction systems.
ronantakizawa/jfleg-japanese
#japanese #evals #benchmark