Update README.md
Browse files
README.md
CHANGED
|
@@ -50,7 +50,7 @@ print(output[0].outputs[0].text)
|
|
| 50 |
## Serving
|
| 51 |
Then we can serve with the following command:
|
| 52 |
```
|
| 53 |
-
vllm serve pytorch/Phi-4-mini-instruct-
|
| 54 |
```
|
| 55 |
|
| 56 |
# Inference with Transformers
|
|
|
|
| 50 |
## Serving
|
| 51 |
Then we can serve with the following command:
|
| 52 |
```
|
| 53 |
+
vllm serve pytorch/Phi-4-mini-instruct-float8dq --tokenizer microsoft/Phi-4-mini-instruct -O3
|
| 54 |
```
|
| 55 |
|
| 56 |
# Inference with Transformers
|