princeton-nlp
/

Sheared-LLaMA-1.3B

Text Generation

text-generation-inference

Model card Files Files and versions

princeton-nlp commited on Oct 11, 2023

Commit

8f5cd82

·

1 Parent(s): 3bc5cf3

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -14,6 +14,10 @@ Sheared-LLaMA-1.3B is a model pruned and further pre-trained from [meta-llama/Ll
 model = AutoModelForCausalLM.from_pretrained("princeton-nlp/Sheared-LLaMA-1.3B")
 ```
 ## Downstream Tasks
 We evaluate on an extensive set of downstream tasks including reasoning, reading comprehension, language modeling and knowledge intensive tasks. Our Sheared-LLaMA models outperform existing large language models.

 model = AutoModelForCausalLM.from_pretrained("princeton-nlp/Sheared-LLaMA-1.3B")
 ```
+- Smaller-scale
+- Same vocabulary as LLaMA1 and LLaMA2
+- Derived with 50B tokens by utilizing existing strong LLMs
 ## Downstream Tasks
 We evaluate on an extensive set of downstream tasks including reasoning, reading comprehension, language modeling and knowledge intensive tasks. Our Sheared-LLaMA models outperform existing large language models.