Update README.md
Browse files
README.md
CHANGED
@@ -131,8 +131,7 @@ curl http://localhost:30000/v1/chat/completions \
|
|
131 |
To achieve optimal performance, we recommend the following settings:
|
132 |
|
133 |
1. **Sampling Parameters**:
|
134 |
-
- We suggest using `Temperature=0.7`, `TopP=0.8`, `TopK=20`, and `MinP=0`.
|
135 |
-
⚠️ For benchmarking scenarios requiring sampling (e.g., AIME), these parameters must be explicitly configured.
|
136 |
|
137 |
2. **Adequate Output Length**: Set output length to 16,384 tokens for general use cases to accommodate complex reasoning tasks in instruct models.
|
138 |
|
|
|
131 |
To achieve optimal performance, we recommend the following settings:
|
132 |
|
133 |
1. **Sampling Parameters**:
|
134 |
+
- We suggest using `Temperature=0.7`, `TopP=0.8`, `TopK=20`, and `MinP=0`. (⚠️ For benchmarking scenarios requiring sampling (e.g., AIME), these parameters must be explicitly configured.)
|
|
|
135 |
|
136 |
2. **Adequate Output Length**: Set output length to 16,384 tokens for general use cases to accommodate complex reasoning tasks in instruct models.
|
137 |
|