Aratako
/

Ministral-3-14B-Instruct-2512-TextOnly

Model card Files Files and versions

Aratako commited on 17 days ago

Commit

d5912e0

·

verified ·

1 Parent(s): 975c9f2

Create README.md

Files changed (1) hide show

README.md +47 -0

README.md ADDED Viewed

	@@ -0,0 +1,47 @@

+---
+license: apache-2.0
+base_model:
+- mistralai/Ministral-3-14B-Instruct-2512
+---
+# Ministral-3-14B-Instruct-2512-TextOnly
+This model is the **text-only component** extracted from the Vision-Language Model [mistralai/Ministral-3-14B-Instruct-2512](https://huggingface.co/mistralai/Ministral-3-14B-Instruct-2512).
+## Usage
+You can load this model using `AutoModelForCausalLM` as shown below:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model_id = "Aratako/Ministral-3-14B-Instruct-2512-TextOnly"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(
+    model_id,
+    device_map="cuda",
+)
+messages = [
+    {
+        "role": "user",
+        "content": "Tell me a joke about computers.",
+    },
+]
+input_ids = tokenizer.apply_chat_template(messages, return_tensors="pt").to("cuda")
+output = model.generate(
+    input_ids, max_new_tokens=512, pad_token_id=tokenizer.eos_token_id
+)
+decoded_output = tokenizer.decode(
+    output[0][len(input_ids[0]) :], skip_special_tokens=True
+)
+print(decoded_output)
+```
+## Original Model Information
+This is a weight extraction of the original VLM. For detailed benchmarks, licensing details, and architectural information, please refer to the original model card: **[mistralai/Ministral-3-14B-Instruct-2512](https://huggingface.co/mistralai/Ministral-3-14B-Instruct-2512)**