aimagelab
/

LLaVA_MORE-llama_3_1-8B-pretrain

Image-Text-to-Text

text-generation

Model card Files Files and versions

Add link to paper

#1

by nielsr HF Staff - opened Mar 21

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -1,13 +1,15 @@
 ---
-license: apache-2.0
 datasets:
 - liuhaotian/LLaVA-CC3M-Pretrain-595K
 library_name: transformers
 pipeline_tag: image-text-to-text
 ---
 # Model Card: LLaVA_MORE-llama_3_1-8B-pretrain
 ```LLaVA-MORE``` enhances the well-known LLaVA architecture by integrating the use of LLaMA 3.1 as the language model. We are publicly releasing the checkpoints for stages one and two for the first model with 8B parameters.
 In this model space, you will find the stage one (pretrain) weights of LLaVA-MORE LLaMA 3.1 8B.

 ---
 datasets:
 - liuhaotian/LLaVA-CC3M-Pretrain-595K
 library_name: transformers
+license: apache-2.0
 pipeline_tag: image-text-to-text
 ---
 # Model Card: LLaVA_MORE-llama_3_1-8B-pretrain
+This repository contains the model described in [LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1](https://huggingface.co/papers/2503.15621).
 ```LLaVA-MORE``` enhances the well-known LLaVA architecture by integrating the use of LLaMA 3.1 as the language model. We are publicly releasing the checkpoints for stages one and two for the first model with 8B parameters.
 In this model space, you will find the stage one (pretrain) weights of LLaVA-MORE LLaMA 3.1 8B.