Runtime error Featured 194 Low-bit Quantized Open LLM Leaderboard 🏆 194 Track, rank and evaluate open LLMs and chatbots
INC: Testing Collection A collection of low precision models generated by Intel Neural Compressor including mxfp8, mxfp4 and nvfp4. • 13 items • Updated 9 days ago