
AI PC: Text Generation
Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU.
- Text Generation • Updated • 58 • 4
OpenVINO/mixtral-8x7b-instruct-v0.1-int4-ov
Text Generation • Updated • 35 • 4OpenVINO/phi-2-fp16-ov
Text Generation • Updated • 80 • 1OpenVINO/phi-2-int8-ov
Text Generation • Updated • 24OpenVINO/phi-2-int4-ov
Text Generation • Updated • 11 • 1OpenVINO/mistral-7b-instruct-v0.1-fp16-ov
Text Generation • Updated • 19OpenVINO/mistral-7b-instruct-v0.1-int8-ov
Text Generation • Updated • 131 • 1OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation • Updated • 150OpenVINO/starcoder2-15b-fp16-ov
Text Generation • Updated • 9OpenVINO/starcoder2-15b-int8-ov
Text Generation • Updated • 8OpenVINO/starcoder2-15b-int4-ov
Text Generation • Updated • 171OpenVINO/neural-chat-7b-v3-3-fp16-ov
Text Generation • Updated • 12OpenVINO/neural-chat-7b-v3-3-int8-ov
Text Generation • Updated • 15 • 1OpenVINO/neural-chat-7b-v3-3-int4-ov
Text Generation • Updated • 33 • 1OpenVINO/mpt-7b-fp16-ov
Text Generation • Updated • 16OpenVINO/mpt-7b-int8-ov
Text Generation • Updated • 7OpenVINO/mpt-7b-int4-ov
Text Generation • Updated • 9OpenVINO/Phi-3-mini-128k-instruct-fp16-ov
Text Generation • Updated • 10OpenVINO/Phi-3-mini-128k-instruct-int8-ov
Text Generation • Updated • 85 • 3OpenVINO/Phi-3-mini-128k-instruct-int4-ov
Text Generation • Updated • 56 • 2OpenVINO/falcon-7b-instruct-fp16-ov
Text Generation • Updated • 6OpenVINO/falcon-7b-instruct-int8-ov
Text Generation • Updated • 8OpenVINO/falcon-7b-instruct-int4-ov
Text Generation • Updated • 5OpenVINO/open_llama_3b_v2-fp16-ov
Text Generation • Updated • 13OpenVINO/open_llama_3b_v2-int8-ov
Text Generation • Updated • 30 • 1OpenVINO/open_llama_3b_v2-int4-ov
Text Generation • Updated • 22OpenVINO/open_llama_7b_v2-fp16-ov
Text Generation • Updated • 16OpenVINO/open_llama_7b_v2-int8-ov
Text Generation • Updated • 22OpenVINO/open_llama_7b_v2-int4-ov
Text Generation • Updated • 16OpenVINO/gpt-j-6b-fp16-ov
Text Generation • Updated • 12OpenVINO/gpt-j-6b-int8-ov
Text Generation • Updated • 15OpenVINO/gpt-j-6b-int4-ov
Text Generation • Updated • 25OpenVINO/RedPajama-INCITE-7B-Chat-fp16-ov
Text Generation • Updated • 8OpenVINO/RedPajama-INCITE-7B-Chat-int8-ov
Text Generation • Updated • 9OpenVINO/RedPajama-INCITE-7B-Chat-int4-ov
Text Generation • Updated • 8OpenVINO/RedPajama-INCITE-7B-Instruct-fp16-ov
Text Generation • Updated • 9OpenVINO/RedPajama-INCITE-7B-Instruct-int8-ov
Text Generation • Updated • 11OpenVINO/RedPajama-INCITE-7B-Instruct-int4-ov
Text Generation • Updated • 7OpenVINO/Mistral-7B-Instruct-v0.2-fp16-ov
Text Generation • Updated • 13OpenVINO/Mistral-7B-Instruct-v0.2-int8-ov
Text Generation • Updated • 15 • 1OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation • Updated • 3.51k • 1OpenVINO/Phi-3-medium-4k-instruct-fp16-ov
Text Generation • Updated • 9OpenVINO/Phi-3-medium-4k-instruct-int8-ov
Text Generation • Updated • 12OpenVINO/Phi-3-medium-4k-instruct-int4-ov
Text Generation • Updated • 11 • 3OpenVINO/pythia-2.8b-fp16-ov
Text Generation • Updated • 8OpenVINO/pythia-2.8b-int8-ov
Text Generation • Updated • 10OpenVINO/pythia-2.8b-int4-ov
Text Generation • Updated • 7OpenVINO/pythia-6.9b-fp16-ov
Text Generation • Updated • 10OpenVINO/pythia-6.9b-int8-ov
Text Generation • Updated • 10OpenVINO/pythia-6.9b-int4-ov
Text Generation • Updated • 6OpenVINO/pythia-1b-int4-ov
Text Generation • Updated • 6OpenVINO/neural-chat-7b-v1-1-fp16-ov
Text Generation • Updated • 10OpenVINO/neural-chat-7b-v1-1-int8-ov
Text Generation • Updated • 9OpenVINO/neural-chat-7b-v1-1-int4-ov
Text Generation • Updated • 12OpenVINO/persimmon-8b-chat-int4-ov
Text Generation • Updated • 8OpenVINO/persimmon-8b-chat-fp16-ov
Text Generation • Updated • 13OpenVINO/persimmon-8b-chat-int8-ov
Text Generation • Updated • 11
AhmedSSoliman/MarianCausalLM
Text Generation • Updated • 24Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
AurelPx/Pegasus-7b-slerp
Text Generation • Updated • 6 • 1Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
BAAI/Aquila-7B
Updated • 2.76k • 18Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
BAAI/Aquila2-7B
Text Generation • Updated • 39 • 6Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
BAAI/AquilaChat-7B
Updated • 3.52k • 48Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
BAAI/AquilaChat2-7B
Text Generation • Updated • 3.56k • 15Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
BigSalmon/GPT2Neo1.3BPoints
Text Generation • Updated • 13Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-1.4b
Text Generation • Updated • 31.5k • 25Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-12b
Text Generation • Updated • 8.78k • 136Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-14m
Text Generation • Updated • 132k • 23Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-160m
Text Generation • Updated • 58.7k • 31Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-1b
Text Generation • Updated • 45.1k • 38Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-2.8b
Text Generation • Updated • 26.9k • 30Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-410m
Text Generation • Updated • 64.6k • 28Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-6.9b
Text Generation • Updated • 42.7k • 55Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
EleutherAI/pythia-70m
Updated • 72.2k • 68Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
HuggingFaceH4/zephyr-7b-beta
Text Generation • Updated • 455k • • 1.71kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Intel/neural-chat-7b-v1-1
Text Generation • Updated • 15 • 23Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Intel/neural-chat-7b-v3-3
Text Generation • Updated • 16.8k • 78Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/CodeQwen1.5-7B-Chat
Text Generation • Updated • 3.86k • 336Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen-1_8B
Text Generation • Updated • 4.82k • 67Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen-1_8B-Chat
Text Generation • Updated • 9.09k • 118Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen-7B
Text Generation • Updated • 11k • 382Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen-7B-Chat
Text Generation • Updated • 20.1k • 774Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-0.5B
Text Generation • Updated • 71.7k • 163Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-0.5B-Chat
Text Generation • Updated • 345k • 81Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-1.8B
Text Generation • Updated • 26.1k • 50Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-1.8B-Chat
Text Generation • Updated • 23.6k • 51Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-4B
Text Generation • Updated • 6.94k • 36Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-4B-Chat
Text Generation • Updated • 21.7k • 41Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-7B
Text Generation • Updated • 38.9k • 53Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Qwen/Qwen1.5-7B-Chat
Text Generation • Updated • 79.6k • 169Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Salesforce/codegen-2B-multi
Text Generation • Updated • 1.51k • 40Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Salesforce/codegen-350M-mono
Text Generation • Updated • 44.9k • 94Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Salesforce/codegen-6B-multi
Text Generation • Updated • 388 • 20Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Salesforce/codegen2-1B_P
Text Generation • Updated • 1.81k • 41Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Salesforce/codegen2-3_7B_P
Text Generation • Updated • 74 • 15Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
Salesforce/codegen2-7B_P
Text Generation • Updated • 153 • 26Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
WizardLMTeam/WizardMath-7B-V1.1
Text Generation • Updated • 14.2k • • 78Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
X-D-Lab/MindChat-Qwen2-4B
Text Generation • Updated • 55 • 5Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
adept/persimmon-8b-chat
Text Generation • Updated • 2.83k • 42Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
baichuan-inc/Baichuan2-13B-Chat
Text Generation • Updated • 3.39k • 424Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
baichuan-inc/Baichuan2-7B-Base
Text Generation • Updated • 1.3k • 81Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
baichuan-inc/Baichuan2-7B-Chat
Text Generation • Updated • 10.7k • 166Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
bigscience/bloom-560m
Text Generation • Updated • 398k • 355Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
bigscience/bloomz-1b1
Text Generation • Updated • 3.65k • 33Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
bigscience/bloomz-3b
Text Generation • Updated • 4.33k • 80Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
bigscience/bloomz-7b1-mt
Text Generation • Updated • 5.67k • 140Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-1.3b
Text Generation • Updated • 190k • 170Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-125m
Text Generation • Updated • 4.52M • 198Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-13b
Text Generation • Updated • 11.4k • 65Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-2.7b
Text Generation • Updated • 40.9k • 84Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-350m
Text Generation • Updated • 114k • 144Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-6.7b
Text Generation • Updated • 57.8k • 117Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
facebook/opt-iml-1.3b
Text Generation • Updated • 302 • 29Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/codegemma-1.1-2b
Text Generation • Updated • 119 • 17Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/codegemma-1.1-7b-it
Text Generation • Updated • 106 • 49Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/codegemma-2b
Text Generation • Updated • 3.22k • 80Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/codegemma-7b
Text Generation • Updated • 28.7k • 191Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/gemma-1.1-2b-it
Text Generation • Updated • 66.5k • 158Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/gemma-1.1-7b-it
Text Generation • Updated • 12.4k • 272Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/gemma-2b
Text Generation • Updated • 331k • 997Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/gemma-2b-it
Text Generation • Updated • 95.7k • • 741Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/gemma-7b
Text Generation • Updated • 56.3k • 3.17kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
google/gemma-7b-it
Text Generation • Updated • 85.1k • 1.17kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ibm-granite/granite-3b-code-base-2k
Text Generation • Updated • 1.14k • 37Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ibm-granite/granite-3b-code-instruct-2k
Text Generation • Updated • 2.2k • 37Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ibm-granite/granite-8b-code-base-4k
Text Generation • Updated • 56 • 30Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ibm-granite/granite-8b-code-instruct-4k
Text Generation • Updated • 1.32k • 111Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-1_8b
Text Generation • Updated • 7.91k • 32Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-7b
Text Generation • Updated • 6.99k • 42Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-chat-1_8b
Text Generation • Updated • 5.72k • 32Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-chat-7b
Text Generation • Updated • 13.7k • 82Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-chat-7b-sft
Text Generation • Updated • 5.47k • 6Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-math-7b
Text Generation • Updated • 95 • 27Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
internlm/internlm2-math-base-7b
Text Generation • Updated • 160 • 2Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ise-uiuc/Magicoder-CL-7B
Text Generation • Updated • 22 • 21Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ise-uiuc/Magicoder-DS-6.7B
Text Generation • Updated • 709 • 38Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ise-uiuc/Magicoder-S-CL-7B
Text Generation • Updated • 77 • 44Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
ise-uiuc/Magicoder-S-DS-6.7B
Text Generation • Updated • 731 • 203Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Llama-2-13b-chat-hf
Text Generation • Updated • 101k • • 1.08kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Llama-2-13b-hf
Text Generation • Updated • 75.3k • 602Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Llama-2-7b-chat-hf
Text Generation • Updated • 916k • 4.42kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Llama-2-7b-hf
Text Generation • Updated • 693k • 2.06kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Meta-Llama-3-8B
Text Generation • Updated • 2.04M • • 6.17kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation • Updated • 1.18M • • 3.97kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
meta-llama/Meta-Llama-Guard-2-8B
Text Generation • Updated • 12.2k • 294Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
microsoft/Phi-3-medium-4k-instruct
Text Generation • Updated • 22.4k • 220Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
microsoft/Phi-3-mini-128k-instruct
Text Generation • Updated • 235k • 1.64kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
microsoft/phi-2
Text Generation • Updated • 556k • 3.33kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 1.5M • • 2.77kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mistralai/Mistral-7B-Instruct-v0.3
Text Generation • Updated • 655k • • 1.73kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mistralai/Mistral-7B-v0.3
Text Generation • Updated • 442k • 480Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mosaicml/mpt-7b
Text Generation • Updated • 25.2k • 1.17kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mosaicml/mpt-7b-8k
Text Generation • Updated • 260 • 26Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mosaicml/mpt-7b-8k-chat
Text Generation • Updated • 24 • 40Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mosaicml/mpt-7b-chat
Text Generation • Updated • 60.5k • 514Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mosaicml/mpt-7b-instruct
Text Generation • Updated • 5.35k • 470Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
mosaicml/mpt-7b-storywriter
Text Generation • Updated • 750 • 836Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
openai-community/gpt2
Text Generation • Updated • 8.07M • 2.74kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
openbmb/MiniCPM-2B-sft-bf16
Text Generation • Updated • 5.96k • 118Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
openchat/openchat-3.6-8b-20240522
Text Generation • Updated • 3.86k • 153Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-2-12b
Text Generation • Updated • 138 • 121Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-2-12b-chat
Text Generation • Updated • 2.33k • 88Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-2-1_6b
Text Generation • Updated • 5.14k • 192Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-2-1_6b-chat
Text Generation • Updated • 2.54k • 33Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-2-zephyr-1_6b
Text Generation • Updated • 8.26k • 185Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-3b-4e1t
Text Generation • Updated • 8.7k • 312Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-base-alpha-3b
Text Generation • Updated • 317 • 82Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-tuned-alpha-7b
Text Generation • Updated • 539 • 360Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stabilityai/stablelm-zephyr-3b
Text Generation • Updated • 10.2k • 255Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
stanford-crfm/BioMedLM
Text Generation • Updated • 13.4k • 423Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
tiiuae/falcon-11B
Text Generation • Updated • 21.5k • 211Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
tiiuae/falcon-7b
Text Generation • Updated • 40k • 1.09kNote This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
tiiuae/falcon-7b-instruct
Text Generation • Updated • 143k • 971Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
togethercomputer/Pythia-Chat-Base-7B
Text Generation • Updated • 114 • 68Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
togethercomputer/RedPajama-INCITE-7B-Base
Text Generation • Updated • 271 • 92Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
togethercomputer/RedPajama-INCITE-7B-Chat
Text Generation • Updated • 118 • 93Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
togethercomputer/RedPajama-INCITE-7B-Instruct
Text Generation • Updated • 385 • 103Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
togethercomputer/RedPajama-INCITE-Chat-3B-v1
Text Generation • Updated • 895 • 152Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation
xverse/XVERSE-7B-Chat
Text Generation • Updated • 17 • 8Note This model was tested with OpenVINO version 2024.1.0, using the OVModelForCausalLM library with INT4 weight compression for the lowest memory footprint. To convert the model to OpenVINO, follow instructions at: https://huggingface.co/docs/optimum/main/en/intel/installation