aerial-d
Collection
Generalized Referring Expression Segmentation on Aerial Photos
โข
5 items
โข
Updated
QLoRA-finetuned Gemma3-12B for generating natural referring expressions in aerial imagery. Distilled from 500 OpenAI o3 samples (~238ร cheaper than direct o3 usage).
google/gemma-3-12b-itUsed in the dataset generation pipeline to enhance rule-based expressions:
# Start vLLM server
vllm serve luisml77/gemma-aerial-12b --port 8000
# Run enhancement (in another terminal)
cd datagen
python pipeline/7_vllm_enhance.py
Full pipeline at GitHub.
python gemma3_lora_finetune.py \
--enhanced_data_dir enhanced_annotations_o3_dual \
--output_dir ./gemma-aerial-12b \
--lora_r 64 --lora_alpha 16
@article{marnoto2025aeriald,
title={Generalized Referring Expression Segmentation on Aerial Photos},
author={Marnoto, Luรญs Pedro Soares},
journal={IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (J-STARS)},
year={2025},
note={Submitted}
}