Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -85,6 +85,9 @@ vllm serve pytorch/Phi-4-mini-instruct-float8dq --tokenizer microsoft/Phi-4-mini
 # Model Quality
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
 ## baseline
 ```
@@ -120,6 +123,11 @@ lm_eval --model hf --model_args pretrained=pytorch/Phi-4-mini-instruct-float8dq
 # Model Performance
 ## Results (H100 machine)
 | Benchmark                        |                |                          |
 |----------------------------------|----------------|--------------------------|

 # Model Quality
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
+Need to install lm-eval from source:
+https://github.com/EleutherAI/lm-evaluation-harness#install
 ## baseline
 ```
 # Model Performance
+Need to install vllm nightly to get some recent changes
+```
+pip install vllm --pre --extra-index-url https://wheels.vllm.ai/nightly
+```
 ## Results (H100 machine)
 | Benchmark                        |                |                          |
 |----------------------------------|----------------|--------------------------|