heegyu
/

kogpt-j-base-24L

Text Generation

Model card Files Files and versions

heegyu commited on Dec 29, 2022

Commit

aba37a8

·

1 Parent(s): 2acea0c

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ widget:
 - Learning Rate: 6e-4, Batch Size: 4(x8), Scheudler: Linear, WarmUp: 1000 step
 - adam_beta1=0.9 adam_beta2=0.98, weight_decay=0.01
 - Training Steps: 625000 (3 epoch)
-- 학습 토큰 수: 57.22B (625000step * 3epoch * 1024seq * 8dev * 4batch / 1024^3)
 - 학습 기간: 2022/12/21 ~ 2022/12/25
 ## 학습에 사용한 데이터
@@ -31,12 +31,11 @@ widget:
 - 국립국어원 일상대화 말뭉치(29.5MB)
 - 국립국어원 문어 말뭉치(2.91GB)
 - 국립국어원 구어 말뭉치(1.1GB)
-- 국립국어원 뉴스 말뭉치(14.16GB)
 - 청와대 국민청원(651.8MB)
 - KcBERT Pre-Training Corpus(11.86GB)
 데이터셋 크기는 전처리한 jsonl파일을 기준으로 함.
-총 토큰 수는 약 19B임
 ## 사용 예시
 ```python

 - Learning Rate: 6e-4, Batch Size: 4(x8), Scheudler: Linear, WarmUp: 1000 step
 - adam_beta1=0.9 adam_beta2=0.98, weight_decay=0.01
 - Training Steps: 625000 (3 epoch)
+- 학습 토큰 수: 19.22B (625000step * 1024seq * 8dev * 4batch / 1024^3)
 - 학습 기간: 2022/12/21 ~ 2022/12/25
 ## 학습에 사용한 데이터
 - 국립국어원 일상대화 말뭉치(29.5MB)
 - 국립국어원 문어 말뭉치(2.91GB)
 - 국립국어원 구어 말뭉치(1.1GB)
 - 청와대 국민청원(651.8MB)
 - KcBERT Pre-Training Corpus(11.86GB)
 데이터셋 크기는 전처리한 jsonl파일을 기준으로 함.
+총 토큰 수는 약 6.4B임
 ## 사용 예시
 ```python