shreyasmeher
/

ConflLlama

Text Classification

Model card Files Files and versions

shreyasmeher commited on Nov 12, 2024

Commit

629dfa4

·

verified ·

1 Parent(s): 5e04abf

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -33,7 +33,7 @@ inference:
 - **Model Type:** GGUF quantized (q4_k_m and q8_0)
 - **Base Model:** unsloth/llama-3-8b-bnb-4bit
 - **Quantization Details:**
-  - Methods: q4_k_m and q8_0
   - q4_k_m uses Q6_K for half of attention.wv and feed_forward.w2 tensors
   - Optimized for both speed (q8_0) and quality (q4_k_m)

 - **Model Type:** GGUF quantized (q4_k_m and q8_0)
 - **Base Model:** unsloth/llama-3-8b-bnb-4bit
 - **Quantization Details:**
+  - Methods: q4_k_m, q8_0, BF16
   - q4_k_m uses Q6_K for half of attention.wv and feed_forward.w2 tensors
   - Optimized for both speed (q8_0) and quality (q4_k_m)