mobiuslabsgmbh/CLIP-ViT-H-14-laion2B-2bit_g16_s128-HQQ Image Classification • Updated Aug 22 • 12 • 5
mobiuslabsgmbh/Llama-3.1-8B-Instruct_mxfp4_weights_calib_demo Text Generation • Updated Jun 26 • 12 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_nvfp4_weights_calib_demo Text Generation • Updated Jun 26 • 10 • 1
mobiuslabsgmbh/Qwen2.5-VL-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Image-to-Text • Updated Jun 4 • 7 • 2
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_16bit Text Generation • Updated Jun 4 • 16 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_16bit Text Generation • Updated Jun 4 • 9 • 1
mobiuslabsgmbh/Qwen2.5-7B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 4 • 9 • 2
mobiuslabsgmbh/Phi-4-mini-instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 4 • 16 • 1
mobiuslabsgmbh/Llama-3.1-8B-Instruct_gemlite-ao_a16w4_gs_128_pack_32bit Text Generation • Updated Jun 3 • 16 • 2
mobiuslabsgmbh/Meta-Llama-3-8B-Instruct_4bitgs64_hqq_hf Text Generation • 5B • Updated May 23 • 9 • 2
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1_4bitgs64_hqq_hf Text Generation • 25B • Updated Feb 10 • 10 • 1
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-2bitgs8-metaoffload-HQQ Text Generation • Updated Feb 5 • 15 • 19
mobiuslabsgmbh/Mixtral-8x7B-Instruct-v0.1-hf-attn-4bit-moe-2bit-metaoffload-HQQ Text Generation • Updated Feb 5 • 11 • 16