File size: 762 Bytes
1f56d03
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---

license: apache-2.0
base_model: Qwen/Qwen2.5-Coder-7B-Instruct
tags:
- gguf
- quantized
- q4_k_m
---


# Qwen2.5-Coder-7B-Instruct-iat-05-GGUF

This is a GGUF quantized version (q4_k_m) of Qwen/Qwen2.5-Coder-7B-Instruct fine-tuned with the 'iat-05' adapter.

## Model Details

- **Base Model:** Qwen/Qwen2.5-Coder-7B-Instruct
- **Adapter:** iat-05
- **Quantization:** q4_k_m
- **Format:** GGUF

## Usage

This model can be used with llama.cpp or any compatible inference engine that supports GGUF format.

```bash

# Example with llama.cpp

./llama-cli -m Qwen2.5-Coder-7B-Instruct-iat-05-q4_k_m.gguf -p "Your prompt here"

```

## Files

- `Qwen2.5-Coder-7B-Instruct-iat-05-q4_k_m.gguf` - Quantized model in GGUF format (q4_k_m)