ko-wand-136M

ko-wand-136MλŠ” insturctkrμ—μ„œ μ‚¬μ „ν•™μŠ΅ν•œ SLMμž…λ‹ˆλ‹€.

Model Description

maywell/korean_textbooks와 ν•œκ΅­μ–΄ λ§λ­‰μΉ˜λ₯Ό μ΄μš©ν•˜μ—¬ μ‚¬μ „ν•™μŠ΅ λ˜μ—ˆμŠ΅λ‹ˆλ‹€.

Model Info

λ―ΈμŠ€νŠΈλž„ 아킀텍쳐λ₯Ό 기반으둜 μ™„μ „νžˆ 랜덀 κ°€μ€‘μΉ˜λ₯Ό μ‹œμž‘μœΌλ‘œ μ‚¬μ „ν•™μŠ΅ 된 λͺ¨λΈμž…λ‹ˆλ‹€. Instruction νŠœλ‹λ˜μ§€ μ•Šμ•˜μŠ΅λ‹ˆλ‹€.

Training Details

Batch Size Token Seen lr
1024 2.5B 2e-3 (cosine)

License

apache-2.0

Downloads last month
520
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for instructkr/ko-wand-136M

Quantizations
1 model