Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.4k
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
quantization
Ctrl+K
Ctrl+K
2 contributors
History:
53 commits
danieldk
HF Staff
Build (aarch64-linux)
150f8c2
about 1 month ago
attention
Sync to vLLM 20250627
about 2 months ago
build
Build (aarch64-linux)
about 1 month ago
compressed_tensors
Sync to vLLM 20250627
about 2 months ago
core
Sync to vLLM 20250627
about 2 months ago
cutlass_extensions
Sync to vLLM 20250627
about 2 months ago
cutlass_w8a8
Sync to vLLM 20250627
about 2 months ago
fp8
Sync to vLLM 20250627
about 2 months ago
gptq_marlin
Sync to vLLM 20250627
about 2 months ago
marlin
Sync to vLLM 20250627
about 2 months ago
tests
Sync to vLLM 20250627
about 2 months ago
torch-ext
Fix absolute imports
about 2 months ago
.gitattributes
Safe
1.56 kB
Build
9 months ago
LICENSE
Safe
11.4 kB
Add cutlass_w8a8
9 months ago
README.md
Safe
195 Bytes
Update README.md (#1)
6 months ago
build.toml
Safe
5.96 kB
Fix undefined symbol on CUDA 11.8
about 2 months ago
cuda_utils.h
Safe
1.41 kB
Sync on vLLM 20240402
4 months ago
dispatch_utils.h
Safe
3.9 kB
Sync to vLLM 20250627
about 2 months ago
flake.lock
Safe
4.5 kB
Prepare for Torch 2.8
about 1 month ago
flake.nix
Safe
345 Bytes
Prepare for Torch 2.8
about 1 month ago
utils.cuh
Safe
1.84 kB
Sync on vLLM 20240402
4 months ago
vectorization.cuh
Safe
878 Bytes
Sync to vLLM 20250627
about 2 months ago
vectorization_utils.cuh
Safe
2.61 kB
Sync to vLLM 20250627
about 2 months ago