allenai
/

Olmo-3-7B-RL-Zero-IF

Text Generation

Model card Files Files and versions

baileyk commited on 3 days ago

Commit

04f5e7c

·

verified ·

1 Parent(s): 7db0e9e

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -54,8 +54,8 @@ pip install transformers>=4.57.0
 You can use OLMo with the standard HuggingFace transformers library:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-olmo = AutoModelForCausalLM.from_pretrained("allenai/Olmo-3-RLZero-IF-7B")
-tokenizer = AutoTokenizer.from_pretrained("allenai/Olmo-3-RLZero-IF-7B")
 message = ["Language modeling is "]
 inputs = tokenizer(message, return_tensors='pt', return_token_type_ids=False)
 # optional verifying cuda
@@ -68,7 +68,7 @@ print(tokenizer.batch_decode(response, skip_special_tokens=True)[0])
 For faster performance, you can quantize the model using the following method:
 ```python
-AutoModelForCausalLM.from_pretrained("allenai/Olmo-3-RLZero-IF-7B",
     torch_dtype=torch.float16,
     load_in_8bit=True)  # Requires bitsandbytes
 ```

 You can use OLMo with the standard HuggingFace transformers library:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+olmo = AutoModelForCausalLM.from_pretrained("allenai/Olmo-3-7B-RL-Zero-IF")
+tokenizer = AutoTokenizer.from_pretrained("allenai/Olmo-3-7B-RL-Zero-IF")
 message = ["Language modeling is "]
 inputs = tokenizer(message, return_tensors='pt', return_token_type_ids=False)
 # optional verifying cuda
 For faster performance, you can quantize the model using the following method:
 ```python
+AutoModelForCausalLM.from_pretrained("allenai/Olmo-3-7B-RL-Zero-IF",
     torch_dtype=torch.float16,
     load_in_8bit=True)  # Requires bitsandbytes
 ```