Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alzhang 's Collections
Models for LaaS

Models for LaaS

updated Jan 27, 2024

Collection of models that we are interested in running. Categorized by: (1) Text generation for inference, (2) Smaller models that we want to FT

Upvote
1

  • TinyLlama/TinyLlama-1.1B-Chat-v1.0

    Text Generation • 1B • Updated Mar 17, 2024 • 3.59M • 1.44k

  • tiiuae/falcon-7b-instruct

    Text Generation • 7B • Updated Oct 12, 2024 • 94.5k • 1.02k

  • mistralai/Mistral-7B-Instruct-v0.2

    Text Generation • 7B • Updated Jul 24 • 3.48M • • 3.01k

  • meta-llama/Llama-2-7b

    Text Generation • Updated Apr 17, 2024 • 530 • 4.42k

  • microsoft/phi-2

    Text Generation • 3B • Updated Apr 29, 2024 • 781k • 3.41k

  • google-t5/t5-small

    Translation • 60.5M • Updated Jun 30, 2023 • 2.57M • • 503

  • distilbert/distilgpt2

    Text Generation • 88.2M • Updated Feb 19, 2024 • 2.15M • 592

    Note Text generation models.


  • google-bert/bert-base-uncased

    Fill-Mask • 0.1B • Updated Feb 19, 2024 • 56.5M • • 2.46k

  • prajjwal1/bert-tiny

    Updated Oct 27, 2021 • 13.1M • 132

    Note BERT models generally for fine-tuning. On inference, they are the base encoder models and only do MLM


  • EfficientNetV2: Smaller Models and Faster Training

    Paper • 2104.00298 • Published Apr 1, 2021 • 1

    Note Replace with the relevant models later. Ideally want: EfficientNet (v1,v2) MobileNet YOLO (v_x) Resnet variants FPN Diffusion: SAM (?) or SEEM

Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs