Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9, 2024 • 12
AI PC: Text Generation Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. OpenVINO/Mixtral-8x7B-Instruct-v0.1-int8-ov Text Generation • Updated Nov 5, 2024 • 12 • 4 OpenVINO/mixtral-8x7b-instruct-v0.1-int4-ov Text Generation • Updated Nov 5, 2024 • 5 • 4 OpenVINO/phi-2-fp16-ov Text Generation • Updated Nov 5, 2024 • 63 • 1 OpenVINO/phi-2-int8-ov Text Generation • Updated Oct 29, 2024 • 23
AI PC: Audio Classification Audio Classification models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. MIT/ast-finetuned-speech-commands-v2 Audio Classification • 85.4M • Updated Sep 10, 2023 • 1.06k • 17 superb/wav2vec2-base-superb-sid Audio Classification • Updated Nov 4, 2021 • 783 • 21 anton-l/wav2vec2-base-superb-sv Audio Classification • Updated Nov 11, 2022 • 1.12k • 3 anton-l/wav2vec2-base-superb-sd Updated Dec 14, 2021 • 484
MIT/ast-finetuned-speech-commands-v2 Audio Classification • 85.4M • Updated Sep 10, 2023 • 1.06k • 17
AI PC: Feature Extraction NLP models for Feature Extraction that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. BAAI/bge-base-en-v1.5 Feature Extraction • 0.1B • Updated Feb 21, 2024 • 4.62M • • 365 BAAI/bge-large-en-v1.5 Feature Extraction • 0.3B • Updated Feb 21, 2024 • 4.89M • • 593 Contrastive-Tension/BERT-Large-CT-STSb Feature Extraction • Updated May 18, 2021 • 6 DeepPavlov/bert-base-cased-conversational Feature Extraction • Updated Nov 8, 2021 • 397 • 8
AI PC: Image-to-Text Image-to-text models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. google/pix2struct-base Image-to-Text • 0.3B • Updated Dec 24, 2023 • 15.2k • 76 microsoft/trocr-base-handwritten Image-to-Text • 0.3B • Updated Feb 11 • 209k • 454
AI PC: Question Answering LLMs for Question Answering that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. aware-ai/roberta-large-squadv2 Question Answering • Updated May 20, 2021 • 2 deepset/bert-base-cased-squad2 Question Answering • 0.1B • Updated Sep 24, 2024 • 53.2k • • 21 deepset/roberta-base-squad2 Question Answering • 0.1B • Updated Sep 24, 2024 • 717k • • 924 distilbert/distilbert-base-uncased-distilled-squad Question Answering • 66.4M • Updated May 6, 2024 • 99k • • 118
distilbert/distilbert-base-uncased-distilled-squad Question Answering • 66.4M • Updated May 6, 2024 • 99k • • 118
AI PC: Text2Text Generation Text2Text Generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. facebook/blenderbot-400M-distill Updated Mar 30, 2023 • 17.6k • 457 facebook/m2m100_418M Updated Feb 29, 2024 • 852k • 319 facebook/mbart-large-50-many-to-one-mmt Updated Mar 28, 2023 • 5.1k • 67 google/mt5-base Updated Jan 24, 2023 • 70.2k • 253
AI PC: Translation LLMs for translation tasks that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. google-t5/t5-base Translation • 0.2B • Updated Feb 14, 2024 • 1.39M • • 755 google-t5/t5-large Translation • 0.7B • Updated Apr 6, 2023 • 237k • • 225 google-t5/t5-small Translation • 60.5M • Updated Jun 30, 2023 • 2.65M • • 502
Intel Neural Chat Fine-tuned 7B parameter LLM models, one of which made it to the top of the 7B HF LLM Leaderboard Intel/neural-chat-7b-v3-3 Text Generation • 7B • Updated Nov 11, 2024 • 28.3k • • 80 Intel/neural-chat-7b-v3-1 Text Generation • 7B • Updated Sep 9, 2024 • 4.14k • • 546 Intel/neural-chat-7b-v3 Text Generation • 7B • Updated Nov 14, 2024 • 48 • 67 Intel/neural-chat-7b-v3-2 Text Generation • Updated Feb 22, 2024 • 1.19k • 56
Stable Diffusion Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding Paper • 2205.11487 • Published May 23, 2022 • 1 Intel/sd-reference-only Updated Feb 9, 2024 • 1 Intel/sd-1.5-square-quantized Updated Aug 29, 2024 • 6 Intel/sd-1.5-lcm-openvino Text-to-Image • Updated Jul 12, 2024 • 13 • 4
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding Paper • 2205.11487 • Published May 23, 2022 • 1
DPT 3.0 DPT 3.0 (MiDaS) models, leveraging ViT and ViT-hybrid backbones Vision Transformers for Dense Prediction Paper • 2103.13413 • Published Mar 24, 2021 • 1 Intel/dpt-large Depth Estimation • 0.3B • Updated Feb 24, 2024 • 150k • 196 Intel/dpt-hybrid-midas Depth Estimation • Updated Feb 9, 2024 • 782k • 102 Intel/dpt-large-ade Image Segmentation • Updated Mar 25, 2024 • 3.87k • • 11
TVP Text-Visual Prompting Intel/tvp-base Updated Mar 29, 2024 • 78 • 1 Intel/tvp-base-ANet Updated Nov 9, 2023 • 4
LDM3D-VR Suite of diffusion models targeting virtual reality development LDM3D-VR: Latent Diffusion Model for 3D VR Paper • 2311.03226 • Published Nov 6, 2023 • 11 Intel/ldm3d-pano Text-to-3D • Updated Mar 11, 2024 • 43 • 56 Intel/ldm3d-4c Text-to-3D • Updated Mar 1, 2024 • 403 • 42 Intel/ldm3d Text-to-3D • Updated Mar 1, 2024 • 75 • 58
DistilBERT Smaller BERT models for question answering and text classification Intel/distilbert-base-cased-distilled-squad-int8-static-inc Question Answering • Updated Mar 21, 2024 • 2 Intel/distilbert-base-uncased-MRPC-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 1 • 1 Intel/distilbert-base-uncased-MRPC-int8-static-inc Text Classification • Updated Mar 22, 2024 • 1 Intel/distilbert-base-uncased-distilled-squad-int8-static-inc Question Answering • Updated Mar 29, 2024 • 2.31k • • 5
Intel/distilbert-base-cased-distilled-squad-int8-static-inc Question Answering • Updated Mar 21, 2024 • 2
Intel/distilbert-base-uncased-MRPC-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 1 • 1
Intel/distilbert-base-uncased-distilled-squad-int8-static-inc Question Answering • Updated Mar 29, 2024 • 2.31k • • 5
RoBERTa Intel/roberta-base-mrpc Text Classification • Updated Dec 5, 2022 • 84 • 1 Intel/roberta-base-mrpc-int8-dynamic-inc Text Classification • Updated Dec 28, 2022 • 4 Intel/roberta-base-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 6 Intel/roberta-base-squad2-int8-static-inc Updated Mar 21, 2024 • 3 • 1
DeBERTa DeBERTa is a language model that originates from Meta's RoBERTa model with disentangled attention and enhanced mask decoder. Intel/deberta-v3-base-mrpc Text Classification • Updated May 5, 2023 • 28 Intel/deberta-v3-base-mrpc-int8-dynamic-inc Text Classification • Updated Jun 27, 2023 • 1 Intel/deberta-v3-base-mrpc-int8-static-inc Text Classification • Updated May 25, 2023 • 2
ColBERT Text retrieval model, trained on the Natural Questions dataset Intel/ColBERT-NQ Updated Mar 29, 2024 • 6 • 8 google-research-datasets/natural_questions Viewer • Updated Mar 11, 2024 • 26.3k • 11.8k • 119
MiniLM Fine-tuned version of Microsoft's MiniLM models, trained on the GLUE MRPC dataset. Intel/MiniLM-L12-H384-uncased-mrpc Text Classification • Updated Jun 10, 2022 • 2 • 1 Intel/MiniLM-L12-H384-uncased-mrpc-int8-dynamic-inc Text Classification • Updated Dec 28, 2022 • 2 Intel/MiniLM-L12-H384-uncased-mrpc-int8-qat-inc Text Classification • Updated Oct 6, 2023 Intel/MiniLM-L12-H384-uncased-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 3
MS MARCO Large scale information retrieval corpus that was created based on real user search queries using Bing search engine Intel/msmarco_fid_early_exit Updated Oct 29, 2023 • 3 Intel/msmarco_fid Updated Oct 29, 2023 • 3
T5 Originally from Google: Text-To-Text Transfer Transformer (T5) Intel/t5-small-finetuned-cnn-news-int8-dynamic-inc Updated Oct 6, 2023 • 2 Intel/t5-large-finetuned-xsum-cnn-int8-dynamic-inc Updated Mar 21, 2024 • 2 Intel/t5-base-cnn-dm-int8-dynamic-inc Updated Mar 21, 2024 • 3 Intel/t5-small-xsum-int8-dynamic-inc Updated Mar 21, 2024 • 2.31k • 1
XLNet Original paper: XLNet: Generalized Autoregressive Pretraining for Language Understanding Intel/xlnet-base-cased-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 6 Intel/xlnet-base-cased-mrpc Text Classification • Updated Apr 21, 2022 • 4 • 1
LDM3D collection This collection contains the models, papers, and demo associated with the LDM3D release. Intel/ldm3d Text-to-3D • Updated Mar 1, 2024 • 75 • 58 Intel/ldm3d-sr Text-to-3D • Updated Apr 25, 2024 • 6 • 10 Intel/ldm3d-pano Text-to-3D • Updated Mar 11, 2024 • 43 • 56 Intel/ldm3d-4c Text-to-3D • Updated Mar 1, 2024 • 403 • 42
AI PC: Text-to-Image Text-to-image models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. OpenVINO/stable-diffusion-v1-5-fp16-ov Updated Feb 11 • 3 OpenVINO/stable-diffusion-v1-5-int8-ov Updated Aug 5 • 6 OpenVINO/LCM_Dreamshaper_v7-fp16-ov Updated Feb 11 • 3 OpenVINO/LCM_Dreamshaper_v7-int8-ov Updated Feb 11 • 4
AI PC: Automatic Speech Recognition Automatic Speech Recognition models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. openai/whisper-small Automatic Speech Recognition • 0.2B • Updated Feb 29, 2024 • 2.38M • 475 distil-whisper/distil-medium.en Automatic Speech Recognition • 0.4B • Updated Mar 25, 2024 • 23.4k • 126 facebook/hubert-large-ls960-ft Automatic Speech Recognition • Updated May 24, 2022 • 267k • 76 openai/whisper-base Automatic Speech Recognition • 72.6M • Updated Feb 29, 2024 • 361k • 243
distil-whisper/distil-medium.en Automatic Speech Recognition • 0.4B • Updated Mar 25, 2024 • 23.4k • 126
AI PC: Image Classification Image Classification models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. apple/mobilevit-xx-small Image Classification • Updated Feb 24 • 24.3k • • 19 facebook/convnext-base-224 Image Classification • Updated Jun 13, 2023 • 12.9k • • 9 facebook/levit-256 Image Classification • Updated Jun 1, 2022 • 4 google/mobilenet_v1_1.0_224 Image Classification • Updated May 16, 2023 • 1.09k • 1
AI PC: Masked Language Models Masked language models (MLMs) that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. FacebookAI/roberta-base Fill-Mask • 0.1B • Updated Feb 19, 2024 • 14.3M • • 530 FacebookAI/roberta-large Fill-Mask • 0.4B • Updated Feb 19, 2024 • 17.8M • • 254 FacebookAI/xlm-clm-ende-1024 Fill-Mask • 0.2B • Updated Apr 6, 2023 • 14 FacebookAI/xlm-roberta-base Fill-Mask • 0.3B • Updated Feb 19, 2024 • 8.82M • • 751
AI PC: Text Classification Text Classification LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. Alireza1044/albert-base-v2-sst2 Text Classification • Updated Jul 26, 2021 • 47 BAAI/bge-reranker-base Text Classification • 0.3B • Updated Jun 24, 2024 • 897k • 213 ChrisZeng/electra-large-discriminator-nli-efl-tweeteval Text Classification • Updated Apr 20, 2022 • 5 DeepPavlov/xlm-roberta-large-en-ru-mnli Text Classification • Updated Nov 15, 2021 • 96 • 2
ChrisZeng/electra-large-discriminator-nli-efl-tweeteval Text Classification • Updated Apr 20, 2022 • 5
AI PC: Token Classification Token Classification LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. FacebookAI/xlm-roberta-large-finetuned-conll03-english Token Classification • 0.6B • Updated Feb 19, 2024 • 247k • • 179 Jean-Baptiste/roberta-large-ner-english Token Classification • 0.4B • Updated Mar 22, 2023 • 210k • • 76 dslim/bert-base-NER Token Classification • 0.1B • Updated Oct 8, 2024 • 1.79M • • 662 dslim/bert-large-NER Token Classification • 0.3B • Updated Oct 8, 2024 • 84.1k • • 157
FacebookAI/xlm-roberta-large-finetuned-conll03-english Token Classification • 0.6B • Updated Feb 19, 2024 • 247k • • 179
Jean-Baptiste/roberta-large-ner-english Token Classification • 0.4B • Updated Mar 22, 2023 • 210k • • 76
DPT 3.1 DPT 3.1 (MiDaS) models, leveraging state-of-the-art vision backbones such as BEiT and Swinv2 MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation Paper • 2307.14460 • Published Jul 26, 2023 • 9 Intel/dpt-beit-large-512 Depth Estimation • 0.3B • Updated Jun 21, 2024 • 715 • 8 Intel/dpt-beit-large-384 Depth Estimation • 0.3B • Updated Jun 21, 2024 • 147 Intel/dpt-beit-base-384 Depth Estimation • 0.1B • Updated Dec 11, 2023 • 784 • 1
MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation Paper • 2307.14460 • Published Jul 26, 2023 • 9
Whisper Whisper models for automatic speech recognition (ASR) and speech translation, quantized for faster inference speeds. Intel/whisper-base-int8-dynamic-inc Automatic Speech Recognition • Updated Aug 25, 2023 • 2 • 1 Intel/whisper-base-int8-static-inc Automatic Speech Recognition • Updated Aug 25, 2023 • 1 Intel/whisper-base-onnx-int4-inc Automatic Speech Recognition • Updated Oct 16, 2023 • 3 • 9 Intel/whisper-large-int8-dynamic-inc Automatic Speech Recognition • Updated May 18, 2023 • 6 • 2
GPT Series of GPT fine-tuned models Intel/gpt-j-6B-int8-dynamic-inc Text Generation • Updated Apr 19, 2023 • 3 • 16 Intel/gpt-j-6B-int8-static-inc Text Generation • Updated Apr 19, 2023 • 3 • 9 Intel/gpt-j-6B-pytorch-int8-static-inc Text Generation • Updated Jan 18, 2024 • 3 Intel/gpt-j-6b-sparse Text Generation • Updated Dec 7, 2023 • 1
BGE Intel/bge-large-en-v1.5-rag-int8-static Feature Extraction • Updated Feb 19, 2024 • 12 • 2 Intel/bge-base-en-v1.5-rag-int8-static Feature Extraction • Updated Feb 19, 2024 • 2 Intel/bge-small-en-v1.5-rag-int8-static Feature Extraction • Updated Feb 19, 2024 • 27 • 2
BERT BERT models of varying flavors Intel/bert-base-cased-finetuned-sst2-int8-inc Text Classification • Updated Mar 21, 2024 • 2 Intel/bert-base-uncased-CoLA-int8-inc Text Classification • Updated Mar 22, 2024 • 1 Intel/bert-base-uncased-QNLI-int8-inc Text Classification • Updated Mar 22, 2024 • 7 Intel/bert-base-uncased-STS-B-int8-inc Text Classification • Updated Mar 22, 2024 • 6
ALBERT Quantized versions of ALBERT models for language tasks Intel/albert-base-v2-MRPC-int8-inc Text Classification • Updated Mar 22, 2024 • 3 Intel/albert-base-v2-sst2-int8-dynamic-inc Text Classification • Updated Jun 27, 2023 • 3 Intel/albert-base-v2-sst2-int8-static-inc Text Classification • Updated Mar 22, 2024 • 6
CamemBERT Based on Metas's RoBERTa model released in 2019, trained on 138GB of French text. Intel/camembert-base-mrpc Text Classification • Updated Dec 5, 2022 • 1 Intel/camembert-base-mrpc-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 2
TinyBERT Question Answering model, trained on the SQuAD 1.1 dataset Intel/dynamic_tinybert Question Answering • Updated Mar 22, 2024 • 1.48k • • 83
BART Adaptations on Meta's BART model Intel/bart-large-mrpc Text Classification • Updated Oct 9, 2023 • 4 Intel/bart-large-mrpc-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 2 Intel/bart-large-cnn-int8-dynamic-inc Updated Mar 22, 2024 • 2 • 1
NQ Natural Questions Intel/nq_fid_lfqa_early_exit Updated Oct 29, 2023 • 1 Intel/nq_fid_lfqa Updated Oct 29, 2023 • 2
Electra Intel/electra-small-discriminator-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 2
Intel/electra-small-discriminator-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 2
ViT Originally from Google, Vision Transformer (ViT) Intel/vit-base-patch16-224-int8-static-inc Image Classification • Updated Sep 6, 2022 • 20 • 1
AI PC: Text Generation Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. OpenVINO/Mixtral-8x7B-Instruct-v0.1-int8-ov Text Generation • Updated Nov 5, 2024 • 12 • 4 OpenVINO/mixtral-8x7b-instruct-v0.1-int4-ov Text Generation • Updated Nov 5, 2024 • 5 • 4 OpenVINO/phi-2-fp16-ov Text Generation • Updated Nov 5, 2024 • 63 • 1 OpenVINO/phi-2-int8-ov Text Generation • Updated Oct 29, 2024 • 23
AI PC: Text-to-Image Text-to-image models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. OpenVINO/stable-diffusion-v1-5-fp16-ov Updated Feb 11 • 3 OpenVINO/stable-diffusion-v1-5-int8-ov Updated Aug 5 • 6 OpenVINO/LCM_Dreamshaper_v7-fp16-ov Updated Feb 11 • 3 OpenVINO/LCM_Dreamshaper_v7-int8-ov Updated Feb 11 • 4
AI PC: Audio Classification Audio Classification models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. MIT/ast-finetuned-speech-commands-v2 Audio Classification • 85.4M • Updated Sep 10, 2023 • 1.06k • 17 superb/wav2vec2-base-superb-sid Audio Classification • Updated Nov 4, 2021 • 783 • 21 anton-l/wav2vec2-base-superb-sv Audio Classification • Updated Nov 11, 2022 • 1.12k • 3 anton-l/wav2vec2-base-superb-sd Updated Dec 14, 2021 • 484
MIT/ast-finetuned-speech-commands-v2 Audio Classification • 85.4M • Updated Sep 10, 2023 • 1.06k • 17
AI PC: Automatic Speech Recognition Automatic Speech Recognition models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. openai/whisper-small Automatic Speech Recognition • 0.2B • Updated Feb 29, 2024 • 2.38M • 475 distil-whisper/distil-medium.en Automatic Speech Recognition • 0.4B • Updated Mar 25, 2024 • 23.4k • 126 facebook/hubert-large-ls960-ft Automatic Speech Recognition • Updated May 24, 2022 • 267k • 76 openai/whisper-base Automatic Speech Recognition • 72.6M • Updated Feb 29, 2024 • 361k • 243
distil-whisper/distil-medium.en Automatic Speech Recognition • 0.4B • Updated Mar 25, 2024 • 23.4k • 126
AI PC: Feature Extraction NLP models for Feature Extraction that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. BAAI/bge-base-en-v1.5 Feature Extraction • 0.1B • Updated Feb 21, 2024 • 4.62M • • 365 BAAI/bge-large-en-v1.5 Feature Extraction • 0.3B • Updated Feb 21, 2024 • 4.89M • • 593 Contrastive-Tension/BERT-Large-CT-STSb Feature Extraction • Updated May 18, 2021 • 6 DeepPavlov/bert-base-cased-conversational Feature Extraction • Updated Nov 8, 2021 • 397 • 8
AI PC: Image Classification Image Classification models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. apple/mobilevit-xx-small Image Classification • Updated Feb 24 • 24.3k • • 19 facebook/convnext-base-224 Image Classification • Updated Jun 13, 2023 • 12.9k • • 9 facebook/levit-256 Image Classification • Updated Jun 1, 2022 • 4 google/mobilenet_v1_1.0_224 Image Classification • Updated May 16, 2023 • 1.09k • 1
AI PC: Image-to-Text Image-to-text models that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. google/pix2struct-base Image-to-Text • 0.3B • Updated Dec 24, 2023 • 15.2k • 76 microsoft/trocr-base-handwritten Image-to-Text • 0.3B • Updated Feb 11 • 209k • 454
AI PC: Masked Language Models Masked language models (MLMs) that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. FacebookAI/roberta-base Fill-Mask • 0.1B • Updated Feb 19, 2024 • 14.3M • • 530 FacebookAI/roberta-large Fill-Mask • 0.4B • Updated Feb 19, 2024 • 17.8M • • 254 FacebookAI/xlm-clm-ende-1024 Fill-Mask • 0.2B • Updated Apr 6, 2023 • 14 FacebookAI/xlm-roberta-base Fill-Mask • 0.3B • Updated Feb 19, 2024 • 8.82M • • 751
AI PC: Question Answering LLMs for Question Answering that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. aware-ai/roberta-large-squadv2 Question Answering • Updated May 20, 2021 • 2 deepset/bert-base-cased-squad2 Question Answering • 0.1B • Updated Sep 24, 2024 • 53.2k • • 21 deepset/roberta-base-squad2 Question Answering • 0.1B • Updated Sep 24, 2024 • 717k • • 924 distilbert/distilbert-base-uncased-distilled-squad Question Answering • 66.4M • Updated May 6, 2024 • 99k • • 118
distilbert/distilbert-base-uncased-distilled-squad Question Answering • 66.4M • Updated May 6, 2024 • 99k • • 118
AI PC: Text Classification Text Classification LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. Alireza1044/albert-base-v2-sst2 Text Classification • Updated Jul 26, 2021 • 47 BAAI/bge-reranker-base Text Classification • 0.3B • Updated Jun 24, 2024 • 897k • 213 ChrisZeng/electra-large-discriminator-nli-efl-tweeteval Text Classification • Updated Apr 20, 2022 • 5 DeepPavlov/xlm-roberta-large-en-ru-mnli Text Classification • Updated Nov 15, 2021 • 96 • 2
ChrisZeng/electra-large-discriminator-nli-efl-tweeteval Text Classification • Updated Apr 20, 2022 • 5
AI PC: Text2Text Generation Text2Text Generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. facebook/blenderbot-400M-distill Updated Mar 30, 2023 • 17.6k • 457 facebook/m2m100_418M Updated Feb 29, 2024 • 852k • 319 facebook/mbart-large-50-many-to-one-mmt Updated Mar 28, 2023 • 5.1k • 67 google/mt5-base Updated Jan 24, 2023 • 70.2k • 253
AI PC: Token Classification Token Classification LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. FacebookAI/xlm-roberta-large-finetuned-conll03-english Token Classification • 0.6B • Updated Feb 19, 2024 • 247k • • 179 Jean-Baptiste/roberta-large-ner-english Token Classification • 0.4B • Updated Mar 22, 2023 • 210k • • 76 dslim/bert-base-NER Token Classification • 0.1B • Updated Oct 8, 2024 • 1.79M • • 662 dslim/bert-large-NER Token Classification • 0.3B • Updated Oct 8, 2024 • 84.1k • • 157
FacebookAI/xlm-roberta-large-finetuned-conll03-english Token Classification • 0.6B • Updated Feb 19, 2024 • 247k • • 179
Jean-Baptiste/roberta-large-ner-english Token Classification • 0.4B • Updated Mar 22, 2023 • 210k • • 76
AI PC: Translation LLMs for translation tasks that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. google-t5/t5-base Translation • 0.2B • Updated Feb 14, 2024 • 1.39M • • 755 google-t5/t5-large Translation • 0.7B • Updated Apr 6, 2023 • 237k • • 225 google-t5/t5-small Translation • 60.5M • Updated Jun 30, 2023 • 2.65M • • 502
DPT 3.1 DPT 3.1 (MiDaS) models, leveraging state-of-the-art vision backbones such as BEiT and Swinv2 MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation Paper • 2307.14460 • Published Jul 26, 2023 • 9 Intel/dpt-beit-large-512 Depth Estimation • 0.3B • Updated Jun 21, 2024 • 715 • 8 Intel/dpt-beit-large-384 Depth Estimation • 0.3B • Updated Jun 21, 2024 • 147 Intel/dpt-beit-base-384 Depth Estimation • 0.1B • Updated Dec 11, 2023 • 784 • 1
MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation Paper • 2307.14460 • Published Jul 26, 2023 • 9
Intel Neural Chat Fine-tuned 7B parameter LLM models, one of which made it to the top of the 7B HF LLM Leaderboard Intel/neural-chat-7b-v3-3 Text Generation • 7B • Updated Nov 11, 2024 • 28.3k • • 80 Intel/neural-chat-7b-v3-1 Text Generation • 7B • Updated Sep 9, 2024 • 4.14k • • 546 Intel/neural-chat-7b-v3 Text Generation • 7B • Updated Nov 14, 2024 • 48 • 67 Intel/neural-chat-7b-v3-2 Text Generation • Updated Feb 22, 2024 • 1.19k • 56
Whisper Whisper models for automatic speech recognition (ASR) and speech translation, quantized for faster inference speeds. Intel/whisper-base-int8-dynamic-inc Automatic Speech Recognition • Updated Aug 25, 2023 • 2 • 1 Intel/whisper-base-int8-static-inc Automatic Speech Recognition • Updated Aug 25, 2023 • 1 Intel/whisper-base-onnx-int4-inc Automatic Speech Recognition • Updated Oct 16, 2023 • 3 • 9 Intel/whisper-large-int8-dynamic-inc Automatic Speech Recognition • Updated May 18, 2023 • 6 • 2
Stable Diffusion Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding Paper • 2205.11487 • Published May 23, 2022 • 1 Intel/sd-reference-only Updated Feb 9, 2024 • 1 Intel/sd-1.5-square-quantized Updated Aug 29, 2024 • 6 Intel/sd-1.5-lcm-openvino Text-to-Image • Updated Jul 12, 2024 • 13 • 4
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding Paper • 2205.11487 • Published May 23, 2022 • 1
GPT Series of GPT fine-tuned models Intel/gpt-j-6B-int8-dynamic-inc Text Generation • Updated Apr 19, 2023 • 3 • 16 Intel/gpt-j-6B-int8-static-inc Text Generation • Updated Apr 19, 2023 • 3 • 9 Intel/gpt-j-6B-pytorch-int8-static-inc Text Generation • Updated Jan 18, 2024 • 3 Intel/gpt-j-6b-sparse Text Generation • Updated Dec 7, 2023 • 1
DPT 3.0 DPT 3.0 (MiDaS) models, leveraging ViT and ViT-hybrid backbones Vision Transformers for Dense Prediction Paper • 2103.13413 • Published Mar 24, 2021 • 1 Intel/dpt-large Depth Estimation • 0.3B • Updated Feb 24, 2024 • 150k • 196 Intel/dpt-hybrid-midas Depth Estimation • Updated Feb 9, 2024 • 782k • 102 Intel/dpt-large-ade Image Segmentation • Updated Mar 25, 2024 • 3.87k • • 11
BGE Intel/bge-large-en-v1.5-rag-int8-static Feature Extraction • Updated Feb 19, 2024 • 12 • 2 Intel/bge-base-en-v1.5-rag-int8-static Feature Extraction • Updated Feb 19, 2024 • 2 Intel/bge-small-en-v1.5-rag-int8-static Feature Extraction • Updated Feb 19, 2024 • 27 • 2
TVP Text-Visual Prompting Intel/tvp-base Updated Mar 29, 2024 • 78 • 1 Intel/tvp-base-ANet Updated Nov 9, 2023 • 4
LDM3D-VR Suite of diffusion models targeting virtual reality development LDM3D-VR: Latent Diffusion Model for 3D VR Paper • 2311.03226 • Published Nov 6, 2023 • 11 Intel/ldm3d-pano Text-to-3D • Updated Mar 11, 2024 • 43 • 56 Intel/ldm3d-4c Text-to-3D • Updated Mar 1, 2024 • 403 • 42 Intel/ldm3d Text-to-3D • Updated Mar 1, 2024 • 75 • 58
BERT BERT models of varying flavors Intel/bert-base-cased-finetuned-sst2-int8-inc Text Classification • Updated Mar 21, 2024 • 2 Intel/bert-base-uncased-CoLA-int8-inc Text Classification • Updated Mar 22, 2024 • 1 Intel/bert-base-uncased-QNLI-int8-inc Text Classification • Updated Mar 22, 2024 • 7 Intel/bert-base-uncased-STS-B-int8-inc Text Classification • Updated Mar 22, 2024 • 6
DistilBERT Smaller BERT models for question answering and text classification Intel/distilbert-base-cased-distilled-squad-int8-static-inc Question Answering • Updated Mar 21, 2024 • 2 Intel/distilbert-base-uncased-MRPC-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 1 • 1 Intel/distilbert-base-uncased-MRPC-int8-static-inc Text Classification • Updated Mar 22, 2024 • 1 Intel/distilbert-base-uncased-distilled-squad-int8-static-inc Question Answering • Updated Mar 29, 2024 • 2.31k • • 5
Intel/distilbert-base-cased-distilled-squad-int8-static-inc Question Answering • Updated Mar 21, 2024 • 2
Intel/distilbert-base-uncased-MRPC-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 1 • 1
Intel/distilbert-base-uncased-distilled-squad-int8-static-inc Question Answering • Updated Mar 29, 2024 • 2.31k • • 5
ALBERT Quantized versions of ALBERT models for language tasks Intel/albert-base-v2-MRPC-int8-inc Text Classification • Updated Mar 22, 2024 • 3 Intel/albert-base-v2-sst2-int8-dynamic-inc Text Classification • Updated Jun 27, 2023 • 3 Intel/albert-base-v2-sst2-int8-static-inc Text Classification • Updated Mar 22, 2024 • 6
RoBERTa Intel/roberta-base-mrpc Text Classification • Updated Dec 5, 2022 • 84 • 1 Intel/roberta-base-mrpc-int8-dynamic-inc Text Classification • Updated Dec 28, 2022 • 4 Intel/roberta-base-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 6 Intel/roberta-base-squad2-int8-static-inc Updated Mar 21, 2024 • 3 • 1
CamemBERT Based on Metas's RoBERTa model released in 2019, trained on 138GB of French text. Intel/camembert-base-mrpc Text Classification • Updated Dec 5, 2022 • 1 Intel/camembert-base-mrpc-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 2
DeBERTa DeBERTa is a language model that originates from Meta's RoBERTa model with disentangled attention and enhanced mask decoder. Intel/deberta-v3-base-mrpc Text Classification • Updated May 5, 2023 • 28 Intel/deberta-v3-base-mrpc-int8-dynamic-inc Text Classification • Updated Jun 27, 2023 • 1 Intel/deberta-v3-base-mrpc-int8-static-inc Text Classification • Updated May 25, 2023 • 2
ColBERT Text retrieval model, trained on the Natural Questions dataset Intel/ColBERT-NQ Updated Mar 29, 2024 • 6 • 8 google-research-datasets/natural_questions Viewer • Updated Mar 11, 2024 • 26.3k • 11.8k • 119
TinyBERT Question Answering model, trained on the SQuAD 1.1 dataset Intel/dynamic_tinybert Question Answering • Updated Mar 22, 2024 • 1.48k • • 83
MiniLM Fine-tuned version of Microsoft's MiniLM models, trained on the GLUE MRPC dataset. Intel/MiniLM-L12-H384-uncased-mrpc Text Classification • Updated Jun 10, 2022 • 2 • 1 Intel/MiniLM-L12-H384-uncased-mrpc-int8-dynamic-inc Text Classification • Updated Dec 28, 2022 • 2 Intel/MiniLM-L12-H384-uncased-mrpc-int8-qat-inc Text Classification • Updated Oct 6, 2023 Intel/MiniLM-L12-H384-uncased-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 3
BART Adaptations on Meta's BART model Intel/bart-large-mrpc Text Classification • Updated Oct 9, 2023 • 4 Intel/bart-large-mrpc-int8-dynamic-inc Text Classification • Updated Mar 21, 2024 • 2 Intel/bart-large-cnn-int8-dynamic-inc Updated Mar 22, 2024 • 2 • 1
NQ Natural Questions Intel/nq_fid_lfqa_early_exit Updated Oct 29, 2023 • 1 Intel/nq_fid_lfqa Updated Oct 29, 2023 • 2
MS MARCO Large scale information retrieval corpus that was created based on real user search queries using Bing search engine Intel/msmarco_fid_early_exit Updated Oct 29, 2023 • 3 Intel/msmarco_fid Updated Oct 29, 2023 • 3
T5 Originally from Google: Text-To-Text Transfer Transformer (T5) Intel/t5-small-finetuned-cnn-news-int8-dynamic-inc Updated Oct 6, 2023 • 2 Intel/t5-large-finetuned-xsum-cnn-int8-dynamic-inc Updated Mar 21, 2024 • 2 Intel/t5-base-cnn-dm-int8-dynamic-inc Updated Mar 21, 2024 • 3 Intel/t5-small-xsum-int8-dynamic-inc Updated Mar 21, 2024 • 2.31k • 1
Electra Intel/electra-small-discriminator-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 2
Intel/electra-small-discriminator-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 2
XLNet Original paper: XLNet: Generalized Autoregressive Pretraining for Language Understanding Intel/xlnet-base-cased-mrpc-int8-static-inc Text Classification • Updated Mar 21, 2024 • 6 Intel/xlnet-base-cased-mrpc Text Classification • Updated Apr 21, 2022 • 4 • 1
ViT Originally from Google, Vision Transformer (ViT) Intel/vit-base-patch16-224-int8-static-inc Image Classification • Updated Sep 6, 2022 • 20 • 1
LDM3D collection This collection contains the models, papers, and demo associated with the LDM3D release. Intel/ldm3d Text-to-3D • Updated Mar 1, 2024 • 75 • 58 Intel/ldm3d-sr Text-to-3D • Updated Apr 25, 2024 • 6 • 10 Intel/ldm3d-pano Text-to-3D • Updated Mar 11, 2024 • 43 • 56 Intel/ldm3d-4c Text-to-3D • Updated Mar 1, 2024 • 403 • 42