grok-2 / README.md
lmzheng's picture
Create README.md
04deaba verified
|
raw
history blame
491 Bytes

Usage: Serving with SGLang

Download the weights. You can replace /local/grok-2 with any other folder name you prefer.

hf download xai-org/grok-2 --local-dir /local/grok-2

Launch a server.

python3 -m sglang.launch_server --model /local/grok-2 --tokenizer-path /local/grok-2/tokenizer.tok.json --tp 8 --quantization fp8 --attention-backend triton

Send a request.

python3 -m sglang.test.send_one --prompt "Human: What is your name? <|separator|>\n\nAssistant:"