YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Gemma3-Aerial-12B

QLoRA-finetuned Gemma3-12B for generating natural referring expressions in aerial imagery. Distilled from 500 OpenAI o3 samples (~238ร— cheaper than direct o3 usage).

Links

Model Details

Usage

Used in the dataset generation pipeline to enhance rule-based expressions:

# Start vLLM server
vllm serve luisml77/gemma-aerial-12b --port 8000

# Run enhancement (in another terminal)
cd datagen
python pipeline/7_vllm_enhance.py

Full pipeline at GitHub.

Training

python gemma3_lora_finetune.py \
  --enhanced_data_dir enhanced_annotations_o3_dual \
  --output_dir ./gemma-aerial-12b \
  --lora_r 64 --lora_alpha 16

Citation

@article{marnoto2025aeriald,
  title={Generalized Referring Expression Segmentation on Aerial Photos},
  author={Marnoto, Luรญs Pedro Soares},
  journal={IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (J-STARS)},
  year={2025},
  note={Submitted}
}
Downloads last month
23
Safetensors
Model size
13B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including luisml77/gemma-aerial-12b