Video-Text-to-Text
Transformers
Safetensors
English
qwen2
text-generation
text-generation-inference
Changli commited on
Commit
4a13d74
·
verified ·
1 Parent(s): b91c40b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -18,4 +18,12 @@ library_name: transformers
18
 
19
  # video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models
20
 
21
- Official model release of [video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models](https://github.com/bytedance/video-SALMONN-2)
 
 
 
 
 
 
 
 
 
18
 
19
  # video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models
20
 
21
+ Official model release of [video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models](https://github.com/bytedance/video-SALMONN-2)
22
+
23
+ [Github Link](https://github.com/bytedance/video-SALMONN-2)
24
+
25
+ [Paper Link](https://arxiv.org/abs/2506.15220)
26
+
27
+ ## Results
28
+
29
+ <img width="857" height="510" alt="image" src="https://github.com/user-attachments/assets/aca20b2e-1e68-4b44-a26b-03d5f070b213" />