Text Generation
Transformers
Safetensors
PyTorch
nvidia
conversational
suhara commited on
Commit
786a9ff
·
verified ·
1 Parent(s): 4f47b96

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -247,9 +247,10 @@ print(outputs[0].outputs[0].text)
247
  The snippet below shows how to use this model with vLLM. Use the following [commit](https://github.com/vllm-project/vllm/commit/75531a6c134282f940c86461b3c40996b4136793) and follow these instructions to build and install vLLM in a docker container.
248
 
249
  ```shell
250
- # use full commit hash from the main branch
251
- export VLLM_COMMIT=75531a6c134282f940c86461b3c40996b4136793
252
- uv pip install vllm --extra-index-url https://wheels.vllm.ai/${VLLM_COMMIT}
 
253
  ```
254
 
255
  Now you can run run the server with:
 
247
  The snippet below shows how to use this model with vLLM. Use the following [commit](https://github.com/vllm-project/vllm/commit/75531a6c134282f940c86461b3c40996b4136793) and follow these instructions to build and install vLLM in a docker container.
248
 
249
  ```shell
250
+ git clone https://github.com/vllm-project/vllm.git
251
+ cd vllm
252
+ git checkout bf756321c72340466911b64602e88013d0210c1c
253
+ VLLM_USE_PRECOMPILED=1 pip install -e .
254
  ```
255
 
256
  Now you can run run the server with: