tencent
/

Hunyuan-7B-Instruct-FP8

Text Generation

hunyuan_v1_dense

compressed-tensors

Model card Files Files and versions

manaestras commited on Jul 30

Commit

0b9bc85

·

verified ·

1 Parent(s): 68fdca0

Upload hf_quant_config.json with huggingface_hub

Files changed (1) hide show

hf_quant_config.json +10 -0

hf_quant_config.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+    "quantization": {
+        "exclude_modules": [
+            "lm_head",
+            "model.embed_tokens"
+        ],
+        "kv_cache_quant_algo": null,
+        "quant_algo": "FP8"
+    }
+}