Models
Datasets
Spaces
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up

RedHatAI
/

quantization

Model card Files Files and versions

1.08 GB

2 contributors

History: 29 commits

danieldk's picture

danieldk HF Staff

Add `flake.lock`

6d36a16 8 months ago

build
Build (Torch 2.6) 10 months ago
compressed_tensors
Sync with vLLM 10 months ago
core
Sync with vLLM 10 months ago
cutlass_extensions
Sync with vLLM 10 months ago
cutlass_w8a8
Sync with vLLM 10 months ago
fp8
Sync with vLLM 10 months ago
gptq_marlin
Sync with vLLM 10 months ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS 11 months ago
tests
Add full Marlin support and tests for Marlin/CUTLASS 11 months ago
torch-ext
Update for build.toml changes 9 months ago
.gitattributes

1.56 kB

Build 11 months ago
LICENSE

11.4 kB

Add cutlass_w8a8 11 months ago
README.md

195 Bytes

Update README.md (#1) 9 months ago
build.toml

2.82 kB

Update for build.toml changes 9 months ago
dispatch_utils.h

1.49 kB

Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` 11 months ago
flake.lock

3.09 kB

Add `flake.lock` 8 months ago
flake.nix

353 Bytes

Update for latest kernel-builder (add revision) 8 months ago
vectorization.cuh

778 Bytes

Sync with vLLM 10 months ago