INT4 LLMs for vLLM Collection Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! β’ 18 items β’ Updated Sep 26, 2024 β’ 10
Running Featured 1.03k Can You Run It? LLM version π 1.03k Calculate GPU requirements for running LLMs
meta-llama/Meta-Llama-3-8B-Instruct Text Generation β’ 8B β’ Updated Jun 18 β’ 1.33M β’ β’ 4.32k
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-FP16-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024 β’ 5
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-BinaryClass-WeightedLoss Token Classification β’ 0.3B β’ Updated Jun 1, 2024 β’ 7
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune Token Classification β’ 0.3B β’ Updated Jun 1, 2024 β’ 7
swtb/XLM-RoBERTa-Large-Conll2003-English-NER-Finetune Token Classification β’ 0.6B β’ Updated May 31, 2024 β’ 10