Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kernels-community
/
quantization

kernel
Model card Files Files and versions Community
1
quantization
Ctrl+K
Ctrl+K
  • 2 contributors
History: 50 commits
danieldk's picture
danieldk HF Staff
Build (x86_64)
b65f8ab 1 day ago
  • attention
    Sync to vLLM 20250627 8 days ago
  • build
    Build (x86_64) 1 day ago
  • compressed_tensors
    Sync to vLLM 20250627 8 days ago
  • core
    Sync to vLLM 20250627 8 days ago
  • cutlass_extensions
    Sync to vLLM 20250627 8 days ago
  • cutlass_w8a8
    Sync to vLLM 20250627 8 days ago
  • fp8
    Sync to vLLM 20250627 8 days ago
  • gptq_marlin
    Sync to vLLM 20250627 8 days ago
  • marlin
    Sync to vLLM 20250627 8 days ago
  • tests
    Sync to vLLM 20250627 8 days ago
  • torch-ext
    Fix absolute imports 2 days ago
  • .gitattributes
    1.56 kB
    Build 7 months ago
  • LICENSE
    11.4 kB
    Add cutlass_w8a8 7 months ago
  • README.md
    195 Bytes
    Update README.md (#1) 5 months ago
  • build.toml
    5.96 kB
    Fix undefined symbol on CUDA 11.8 1 day ago
  • cuda_utils.h
    1.41 kB
    Sync on vLLM 20240402 3 months ago
  • dispatch_utils.h
    3.9 kB
    Sync to vLLM 20250627 8 days ago
  • flake.lock
    4.5 kB
    Fix absolute imports 2 days ago
  • flake.nix
    352 Bytes
    Fix absolute imports 2 days ago
  • utils.cuh
    1.84 kB
    Sync on vLLM 20240402 3 months ago
  • vectorization.cuh
    878 Bytes
    Sync to vLLM 20250627 8 days ago
  • vectorization_utils.cuh
    2.61 kB
    Sync to vLLM 20250627 8 days ago