Collection of State-of-the-art FP8 Block Quantized Models
NM Testing
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
544

nm-testing/SpeculatorLlama3-1-8B-Eagle3-sgl
Updated

nm-testing/Mockup-qwen235-eagle3-fp16-sgl
Updated

nm-testing/Speculator-Qwen3-8B-Eagle3-sgl
Updated

nm-testing/Qwen3-VL-235B-A22B-Instruct-NVFP4
Updated

nm-testing/Mockup-qwen235-eagle3-fp16-speculators-converted
Updated

nm-testing/Llama-3.1-8B-Instruct-FP8-block
Text Generation
•
Updated
•
14

nm-testing/Qwen3-8B-FP8-block
Text Generation
•
Updated

nm-testing/testing-llama3.1.8b-2layer-eagle3
Updated

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-e2e
Updated
•
144

nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8_channel_weight_static_per_tensor-e2e
Updated
•
203