Update README.md
Browse files
README.md
CHANGED
@@ -127,6 +127,22 @@ curl http://localhost:30000/v1/chat/completions \
|
|
127 |
|
128 |
```
|
129 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
130 |
## Citation
|
131 |
```bibtex
|
132 |
@article{GroveMoE,
|
|
|
127 |
|
128 |
```
|
129 |
|
130 |
+
## Best Practices for Model Configuration
|
131 |
+
To achieve optimal performance, we recommend the following settings:
|
132 |
+
|
133 |
+
1. **Sampling Parameters**:
|
134 |
+
- We suggest using `Temperature=0.7`, `TopP=0.8`, `TopK=20`, and `MinP=0`.
|
135 |
+
⚠️ For benchmarking scenarios requiring sampling (e.g., AIME), these parameters must be explicitly configured.
|
136 |
+
|
137 |
+
2. **Adequate Output Length**: Set output length to 16,384 tokens for general use cases to accommodate complex reasoning tasks in instruct models.
|
138 |
+
|
139 |
+
3. **Standardize Output Format**: We recommend using prompts to standardize model outputs when benchmarking.
|
140 |
+
- **Math Problems**: Include "Please reason step by step, and put your final answer within \boxed{}." in the prompt.
|
141 |
+
- **Multiple-Choice Questions**: Add the following JSON structure to the prompt to standardize responses: "Please show your choice in the `answer` field with only the choice letter, e.g., `"answer": "C"`."
|
142 |
+
|
143 |
+
|
144 |
+
|
145 |
+
|
146 |
## Citation
|
147 |
```bibtex
|
148 |
@article{GroveMoE,
|