safetyllm
/

Llama-2-7b-chat-safety

Generated from Trainer

text-generation-inference

llama-2-7b-chat

Model card Files Files and versions

safetyllm commited on Oct 16, 2023

Commit

564a57d

·

1 Parent(s): f599ee3

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -58,7 +58,10 @@ What's your evaluation based on the above unsafe content guidelines?
 ## Training and evaluation data
-More information needed
 ## Training procedure

 ## Training and evaluation data
+The finetuning is comprised of three steps:
+1. Apply LLaMA-2-70B-chat to generate responses to harmless dataset from Anthropic
+2. Apply LLaMA-2-70B-chat and Chatgpt 3.5 to evaluate the (question, answer) pairs generated in Step 1 to make dataset for finetuning
+3. Apply the evaluation dataset from Step 2 to finetune LLaMA-2-7B-chat model using int8 quantization and Low-Rank Adaptation (LoRA)
 ## Training procedure