jusjinuk commited on
Commit
926de46
·
verified ·
1 Parent(s): b2707fd

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - meta-llama/Llama-2-70b-hf
4
+ base_model_relation: quantized
5
+ license: llama2
6
+ ---
7
+ # Model Card
8
+
9
+ - Base model: `meta-llama/Llama-2-70b-hf`
10
+ - Quantization method: BlockLDLQ with GuidedQuant Hessian
11
+ - Target bit-width: 2
12
+ - Backend kernel: QTIP kernel (HYB variant)
13
+ - Calibration data: RedPajama (1024 sentences / 4096 tokens)
14
+ - Calibration objective: Next-token prediction
15
+ - num_groups (for GuidedQuant Hessian): 2
16
+
17
+ # How to run
18
+ - Follow the instruction in https://github.com/snu-mllab/GuidedQuant and https://github.com/Cornell-RelaxML/qtip
19
+
20
+ # References
21
+ - [Model Paper](https://arxiv.org/abs/2505.07004)