pandora-s commited on
Commit
6245632
·
verified ·
1 Parent(s): 02f390f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -3
README.md CHANGED
@@ -1,3 +1,55 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: vllm
3
+ language:
4
+ - en
5
+ - fr
6
+ - es
7
+ - de
8
+ - it
9
+ - pt
10
+ - nl
11
+ - zh
12
+ - ja
13
+ - ko
14
+ - ar
15
+ license: apache-2.0
16
+ inference: false
17
+ base_model:
18
+ - mistralai/Ministral-3-3B-Reasoning-2512
19
+ extra_gated_description: >-
20
+ If you want to learn more about how we process your personal data, please read
21
+ our <a href="https://mistral.ai/terms/">Privacy Policy</a>.
22
+ tags:
23
+ - mistral-common
24
+ ---
25
+
26
+ # Ministral 3 3B Reasoning 2512 GGUF
27
+
28
+ The smallest model in the Ministral 3 family, **Ministral 3 3B** is a powerful, efficient tiny language model with vision capabilities.
29
+
30
+ This model includes different quantization levels of the reasoning post-trained version in **GGUF**, trained for reasoning tasks, making it ideal for math, coding and stem related use cases.
31
+
32
+ The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware. Ministral 3 3B can even be deployed locally, fitting in 16GB of VRAM in BF16, and less than 8GB of RAM/VRAM when quantized.
33
+
34
+ ## Key Features
35
+ Ministral 3 3B consists of two main architectural components:
36
+ - **3.4B Language Model**
37
+ - **0.4B Vision Encoder**
38
+
39
+ The Ministral 3 3B Reasoning model offers the following capabilities:
40
+ - **Vision**: Enables the model to analyze images and provide insights based on visual content, in addition to text.
41
+ - **Multilingual**: Supports dozens of languages, including English, French, Spanish, German, Italian, Portuguese, Dutch, Chinese, Japanese, Korean, Arabic.
42
+ - **System Prompt**: Maintains strong adherence and support for system prompts.
43
+ - **Agentic**: Offers best-in-class agentic capabilities with native function calling and JSON outputting.
44
+ - **Reasoning**: Excels at complex, multi-step reasoning and dynamic problem-solving.
45
+ - **Edge-Optimized**: Delivers best-in-class performance at a small scale, deployable anywhere.
46
+ - **Apache 2.0 License**: Open-source license allowing usage and modification for both commercial and non-commercial purposes.
47
+ - **Large Context Window**: Supports a 256k context window.
48
+
49
+ ## Usage
50
+
51
+ ## License
52
+
53
+ This model is licensed under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0.txt).
54
+
55
+ *You must not use this model in a manner that infringes, misappropriates, or otherwise violates any third party’s rights, including intellectual property rights.*