Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 ---
 library_name: transformers
-tags: []
 ---
 [Phi4-mini](https://huggingface.co/microsoft/Phi-4-mini-instruct) model quantized with [torchao](https://huggingface.co/docs/transformers/main/en/quantization/torchao) int4 weight only quantization, by PyTorch team.
@@ -146,4 +148,4 @@ python benchmarks/benchmark_serving.py --backend vllm --dataset-name sharegpt --
 We can use the same command we used in serving benchmarks to serve the model with vllm
 ```
 vllm serve jerryzh168/phi4-mini-int4wo-hqq --tokenizer microsoft/Phi-4-mini-instruct -O3
-```

 ---
 library_name: transformers
+tags:
+- torchao
+license: mit
 ---
 [Phi4-mini](https://huggingface.co/microsoft/Phi-4-mini-instruct) model quantized with [torchao](https://huggingface.co/docs/transformers/main/en/quantization/torchao) int4 weight only quantization, by PyTorch team.
 We can use the same command we used in serving benchmarks to serve the model with vllm
 ```
 vllm serve jerryzh168/phi4-mini-int4wo-hqq --tokenizer microsoft/Phi-4-mini-instruct -O3
+```