PointerHQ
/

Qwen2.5-VL-72B-Instruct-Pointer-AWQ

Image-Text-to-Text

text-generation-inference

4-bit precision

Model card Files Files and versions

imjliao commited on Feb 23

Commit

9221ae5

·

verified ·

1 Parent(s): c0833c4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ base_model:
 # Qwen2.5-VL-72B-Instruct-Pointer-AWQ
-This model supports `--tensor-parallel` with 2, 4 or 8 GPUs. Use `vllm==0.7.3`.
 # Qwen2.5-VL-72B-Instruct
 <a href="https://chat.qwenlm.ai/" target="_blank" style="margin: 2px;">

 # Qwen2.5-VL-72B-Instruct-Pointer-AWQ
+Since the official `Qwen/Qwen2.5-VL-72B-Instruct-AWQ` doesn't work with tensor parallel on vllm yet, this model fixed the issue and supports `--tensor-parallel` with 2, 4 or 8 GPUs. Use `vllm==0.7.3`.
 # Qwen2.5-VL-72B-Instruct
 <a href="https://chat.qwenlm.ai/" target="_blank" style="margin: 2px;">