Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,7 @@ base_model:
|
|
19 |
- **KV cache quantization:** OCP FP8
|
20 |
- **Calibration Dataset:** [Pile](https://huggingface.co/datasets/mit-han-lab/pile-val-backup)
|
21 |
|
22 |
-
This model
|
23 |
|
24 |
# Model Quantization
|
25 |
|
|
|
19 |
- **KV cache quantization:** OCP FP8
|
20 |
- **Calibration Dataset:** [Pile](https://huggingface.co/datasets/mit-han-lab/pile-val-backup)
|
21 |
|
22 |
+
This model was built with Meta Llama by applying [AMD-Quark](https://quark.docs.amd.com/latest/index.html) for MXFP4 quantization.
|
23 |
|
24 |
# Model Quantization
|
25 |
|