nvidia
/

NVIDIA-Nemotron-Nano-9B-v2

Text Generation

Model card Files Files and versions

suhara commited on Aug 18

Commit

786a9ff

·

verified ·

1 Parent(s): 4f47b96

Update README.md

Files changed (1) hide show

README.md +4 -3

README.md CHANGED Viewed

@@ -247,9 +247,10 @@ print(outputs[0].outputs[0].text)
 The snippet below shows how to use this model with vLLM. Use the following [commit](https://github.com/vllm-project/vllm/commit/75531a6c134282f940c86461b3c40996b4136793) and follow these instructions to build and install vLLM in a docker container.
 ```shell
-# use full commit hash from the main branch
-export VLLM_COMMIT=75531a6c134282f940c86461b3c40996b4136793
-uv pip install vllm --extra-index-url https://wheels.vllm.ai/${VLLM_COMMIT}
 ```
 Now you can run run the server with:

 The snippet below shows how to use this model with vLLM. Use the following [commit](https://github.com/vllm-project/vllm/commit/75531a6c134282f940c86461b3c40996b4136793) and follow these instructions to build and install vLLM in a docker container.
 ```shell
+git clone https://github.com/vllm-project/vllm.git
+cd vllm
+git checkout bf756321c72340466911b64602e88013d0210c1c
+VLLM_USE_PRECOMPILED=1 pip install -e .
 ```
 Now you can run run the server with: