Update README.md
Browse files
README.md
CHANGED
|
@@ -31,6 +31,8 @@ A multimodal vision-language model specialized for multilingual technical docume
|
|
| 31 |
|
| 32 |
QwenAmann-4B-dse is a 4B parameter vision-language model designed for efficient retrieval of technical documentation. It directly encodes document screenshots into embeddings, preserving all information including text, images, and layout without requiring separate content extraction.
|
| 33 |
|
|
|
|
|
|
|
| 34 |
## Performance
|
| 35 |
|
| 36 |
### ENERGY Benchmark (racineai/Open-VLM-Retrieval-Leaderboard)
|
|
|
|
| 31 |
|
| 32 |
QwenAmann-4B-dse is a 4B parameter vision-language model designed for efficient retrieval of technical documentation. It directly encodes document screenshots into embeddings, preserving all information including text, images, and layout without requiring separate content extraction.
|
| 33 |
|
| 34 |
+

|
| 35 |
+
|
| 36 |
## Performance
|
| 37 |
|
| 38 |
### ENERGY Benchmark (racineai/Open-VLM-Retrieval-Leaderboard)
|