Update README.md
Browse files
README.md
CHANGED
|
@@ -26,7 +26,7 @@ pipeline_tag: text-generation
|
|
| 26 |
|
| 27 |
#### EZO × PHI-4 × RL - Advancing LLM Training with Deepseek Knowledge
|
| 28 |
##### Overview
|
| 29 |
-
This model is the result of combining
|
| 30 |
|
| 31 |
##### Key Features & Improvements
|
| 32 |
Enhanced Multilingual Performance: Unlike previous iterations, this model strengthens English capabilities without compromising Japanese proficiency.
|
|
|
|
| 26 |
|
| 27 |
#### EZO × PHI-4 × RL - Advancing LLM Training with Deepseek Knowledge
|
| 28 |
##### Overview
|
| 29 |
+
This model is the result of combining Phi-4 with a reinforcement learning (RL) approach, incorporating insights from the latest research on Deepseek R1. By leveraging a novel training methodology, we successfully improved both Japanese and English capabilities while maintaining a high level of performance across key benchmarks.
|
| 30 |
|
| 31 |
##### Key Features & Improvements
|
| 32 |
Enhanced Multilingual Performance: Unlike previous iterations, this model strengthens English capabilities without compromising Japanese proficiency.
|