Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.47k
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
4dcf20d
quantization
/
torch-ext
/
quantization
103 kB
2 contributors
History:
6 commits
danieldk
HF Staff
Fix absolute imports
c516610
2 months ago
utils
Fix absolute imports
2 months ago
__init__.py
Safe
1.08 kB
Export Marlin and quantization utilities
3 months ago
compressed_tensors.py
Safe
4.5 kB
Fixup platform FP8 data type query
3 months ago
cutlass.py
Safe
2.16 kB
Sync to vLLM 20250627
3 months ago
marlin.py
Safe
6.29 kB
Sync to vLLM 20250627
3 months ago
platforms.py
Safe
2.94 kB
Fixup platform FP8 data type query
3 months ago
scalar_type.py
Safe
12.4 kB
Sync to vLLM 20250627
3 months ago