agentica-org
/

DeepSWE-Verifier

Generated from Trainer

Model card Files Files and versions

michaelzhiluo commited on Jul 2

Commit

16722b8

·

verified ·

1 Parent(s): eb8b7fc

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -54,6 +54,27 @@ Discover more about DeepSWE-Preview's development and capabilities in our [techn
   </p>
 </div>
 ## Training

   </p>
 </div>
+## Usage
+See our reproduction script for DeepSWE's [test-time scaling](https://github.com/agentica-project/R2E-Gym/blob/master/reproduction/DEEPSWE_TTS_REPRODUCTION.MD).
+## Serving DeepSWE-Verifier
+We suggest using vLLM to serve:
+```
+# Stop previous server and start verifier model
+export MAX_CONTEXT_LEN=76800
+vllm serve Qwen/Qwen3-14B \
+    --max-model-len $MAX_CONTEXT_LEN \
+    --hf-overrides '{"max_position_embeddings": '$MAX_CONTEXT_LEN'}' \
+    --enable-lora \
+    --lora-modules verifier=agentica-org/DeepSWE-Preview \
+    --port 8000 \
+    --dtype bfloat16 \
+    --max-lora-rank 64 \
+    --tensor-parallel-size 8
+```
 ## Training