VPTQ-community
community
AI & ML interests
None defined yet.
VPTQ Llama 3.1 Nemotron 70B Instruct HF without finetune
-
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft
11B ⢠Updated ⢠3 ⢠5 -
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-256-woft
9B ⢠Updated ⢠3 -
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-65536-woft
8B ⢠Updated ⢠3 -
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-0-woft
7B ⢠Updated ⢠3
arxiv.org/abs/2409.17066, VPTQ Mistral Large Instruct 2407 without finetune
-
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-65536-woft
17B ⢠Updated ⢠6 ⢠2 -
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-256-woft
13B ⢠Updated ⢠4 -
VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-65536-woft
10B ⢠Updated ⢠5 ⢠1 -
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-0-woft
9B ⢠Updated ⢠6
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 72B Instruct without finetune
-
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-65536-woft
12B ⢠Updated ⢠3 ⢠1 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k1024-512-woft
8B ⢠Updated ⢠5 ⢠2 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-256-woft
9B ⢠Updated ⢠2 ⢠4 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k512-512-woft
7B ⢠Updated ⢠3 ⢠1
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 14B Instruct without finetune
-
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-65536-woft
4B ⢠Updated ⢠4 -
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-256-woft
3B ⢠Updated ⢠4 -
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-0-woft
3B ⢠Updated ⢠3 -
VPTQ-community/Qwen2.5-14B-Instruct-v16-k65536-65536-woft
3B ⢠Updated ⢠4
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 7B Instruct without finetune
-
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-65536-woft
2B ⢠Updated ⢠6 -
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft
2B ⢠Updated ⢠9 -
VPTQ-community/Qwen2.5-7B-Instruct-v16-k65536-65536-woft
2B ⢠Updated ⢠8 ⢠1 -
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-0-woft
2B ⢠Updated ⢠6
Reproduced VPTQ Tech Report Baseline
arxiv.org/abs/2409.17066, VPTQ Llama 3.3 70B without finetune
-
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-65536-woft
11B ⢠Updated ⢠7 ⢠1 -
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-256-woft
9B ⢠Updated ⢠3 -
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-0-woft
7B ⢠Updated ⢠4 -
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-65536-woft
8B ⢠Updated ⢠9
arxiv.org/abs/2409.17066, VPTQ Llama 3.1 405B Instruct without finetune
-
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-65536-woft
55B ⢠Updated ⢠2 -
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-256-woft
42B ⢠Updated ⢠2 ⢠1 -
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-65536-woft
31B ⢠Updated ⢠2 ⢠3 -
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft
29B ⢠Updated ⢠3 ⢠1
arxiv.org/abs/2409.17066, VPTQ Llama 3.1 70B without finetune
-
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft
11B ⢠Updated ⢠6 ⢠2 -
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft
9B ⢠Updated ⢠2 ⢠1 -
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft
8B ⢠Updated ⢠23 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated
8B ⢠Updated ⢠3 ⢠1
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 32B Instruct without finetune
-
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-65536-woft
6B ⢠Updated ⢠2 ⢠1 -
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-256-woft
5B ⢠Updated ⢠3 ⢠2 -
VPTQ-community/Qwen2.5-32B-Instruct-v16-k65536-65536-woft
4B ⢠Updated ⢠3 ⢠1 -
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-0-woft
4B ⢠Updated ⢠4
arxiv.org/abs/2409.17066, VPTQ Llama 3.1 8B Instruct without finetune
-
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft
2B ⢠Updated ⢠49 -
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft
2B ⢠Updated ⢠10 -
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft
2B ⢠Updated ⢠20 -
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft
2B ⢠Updated ⢠906 ⢠4
Hessian and InvHessian Checkpoints
arxiv.org/abs/2409.17066, VPTQ Llama 3.3 70B without finetune
-
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-65536-woft
11B ⢠Updated ⢠7 ⢠1 -
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-256-woft
9B ⢠Updated ⢠3 -
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v8-k65536-0-woft
7B ⢠Updated ⢠4 -
VPTQ-community/Meta-Llama-3.3-70B-Instruct-v16-k65536-65536-woft
8B ⢠Updated ⢠9
VPTQ Llama 3.1 Nemotron 70B Instruct HF without finetune
-
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft
11B ⢠Updated ⢠3 ⢠5 -
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-256-woft
9B ⢠Updated ⢠3 -
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-65536-woft
8B ⢠Updated ⢠3 -
VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-0-woft
7B ⢠Updated ⢠3
arxiv.org/abs/2409.17066, VPTQ Llama 3.1 405B Instruct without finetune
-
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-65536-woft
55B ⢠Updated ⢠2 -
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-256-woft
42B ⢠Updated ⢠2 ⢠1 -
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-65536-woft
31B ⢠Updated ⢠2 ⢠3 -
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft
29B ⢠Updated ⢠3 ⢠1
arxiv.org/abs/2409.17066, VPTQ Mistral Large Instruct 2407 without finetune
-
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-65536-woft
17B ⢠Updated ⢠6 ⢠2 -
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-256-woft
13B ⢠Updated ⢠4 -
VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-65536-woft
10B ⢠Updated ⢠5 ⢠1 -
VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-0-woft
9B ⢠Updated ⢠6
arxiv.org/abs/2409.17066, VPTQ Llama 3.1 70B without finetune
-
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft
11B ⢠Updated ⢠6 ⢠2 -
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft
9B ⢠Updated ⢠2 ⢠1 -
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft
8B ⢠Updated ⢠23 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated
8B ⢠Updated ⢠3 ⢠1
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 72B Instruct without finetune
-
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-65536-woft
12B ⢠Updated ⢠3 ⢠1 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k1024-512-woft
8B ⢠Updated ⢠5 ⢠2 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-256-woft
9B ⢠Updated ⢠2 ⢠4 -
VPTQ-community/Qwen2.5-72B-Instruct-v8-k512-512-woft
7B ⢠Updated ⢠3 ⢠1
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 32B Instruct without finetune
-
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-65536-woft
6B ⢠Updated ⢠2 ⢠1 -
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-256-woft
5B ⢠Updated ⢠3 ⢠2 -
VPTQ-community/Qwen2.5-32B-Instruct-v16-k65536-65536-woft
4B ⢠Updated ⢠3 ⢠1 -
VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-0-woft
4B ⢠Updated ⢠4
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 14B Instruct without finetune
-
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-65536-woft
4B ⢠Updated ⢠4 -
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-256-woft
3B ⢠Updated ⢠4 -
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-0-woft
3B ⢠Updated ⢠3 -
VPTQ-community/Qwen2.5-14B-Instruct-v16-k65536-65536-woft
3B ⢠Updated ⢠4
arxiv.org/abs/2409.17066, VPTQ Llama 3.1 8B Instruct without finetune
-
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft
2B ⢠Updated ⢠49 -
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft
2B ⢠Updated ⢠10 -
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft
2B ⢠Updated ⢠20 -
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft
2B ⢠Updated ⢠906 ⢠4
arxiv.org/abs/2409.17066, VPTQ Qwen 2.5 7B Instruct without finetune
-
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-65536-woft
2B ⢠Updated ⢠6 -
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft
2B ⢠Updated ⢠9 -
VPTQ-community/Qwen2.5-7B-Instruct-v16-k65536-65536-woft
2B ⢠Updated ⢠8 ⢠1 -
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-0-woft
2B ⢠Updated ⢠6
Hessian and InvHessian Checkpoints
Reproduced VPTQ Tech Report Baseline