Update README.md
Browse files
README.md
CHANGED
|
@@ -171,7 +171,7 @@ We report results derived from the Agentless scaffold. Departing from the origin
|
|
| 171 |
"sphinx-doc__sphinx-8475"
|
| 172 |
|
| 173 |
### TAU-bench methodology
|
| 174 |
-
We evaluate TAU-Bench with
|
| 175 |
Our general system prompt is:
|
| 176 |
```
|
| 177 |
- In each round, you need to carefully examine the tools provided to you to determine if any can be used.
|
|
|
|
| 171 |
"sphinx-doc__sphinx-8475"
|
| 172 |
|
| 173 |
### TAU-bench methodology
|
| 174 |
+
We evaluate TAU-Bench with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is 40.
|
| 175 |
Our general system prompt is:
|
| 176 |
```
|
| 177 |
- In each round, you need to carefully examine the tools provided to you to determine if any can be used.
|