Quantization from: bhenrym14/airoboros-33b-gpt4-1.4.1-PI-8192-fp16
Converted to the GGML format with: llama.cpp master-6e7cca4 (JUL 15, 2023)
Tested with: koboldcpp 1.35
Example usage:
koboldcpp.exe airoboros-33b-gpt4-1.4.1-PI-8192-ggmlv3.Q2_K.bin --threads 6 --linearrope --contextsize 8192 --stream --smartcontext --unbantokens --noblas
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.