Models for LaaS - a alzhang Collection

alzhang 's Collections

Models for LaaS

Models for LaaS

updated Jan 27, 2024

Collection of models that we are interested in running. Categorized by: (1) Text generation for inference, (2) Smaller models that we want to FT

TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • 1B • Updated Mar 17, 2024 • 3.59M • 1.44k
tiiuae/falcon-7b-instruct

Text Generation • 7B • Updated Oct 12, 2024 • 94.5k • 1.02k
mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated Jul 24 • 3.48M • • 3.01k
meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 530 • 4.42k
microsoft/phi-2

Text Generation • 3B • Updated Apr 29, 2024 • 781k • 3.41k
google-t5/t5-small

Translation • 60.5M • Updated Jun 30, 2023 • 2.57M • • 503
distilbert/distilgpt2

Text Generation • 88.2M • Updated Feb 19, 2024 • 2.15M • 592

Note Text generation models.
google-bert/bert-base-uncased

Fill-Mask • 0.1B • Updated Feb 19, 2024 • 56.5M • • 2.46k
prajjwal1/bert-tiny

Updated Oct 27, 2021 • 13.1M • 132

Note BERT models generally for fine-tuning. On inference, they are the base encoder models and only do MLM
EfficientNetV2: Smaller Models and Faster Training

Paper • 2104.00298 • Published Apr 1, 2021 • 1

Note Replace with the relevant models later. Ideally want: EfficientNet (v1,v2) MobileNet YOLO (v_x) Resnet variants FPN Diffusion: SAM (?) or SEEM