HaoxingChen commited on
Commit
7bc9ca6
·
verified ·
1 Parent(s): 99de339

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -127,6 +127,22 @@ curl http://localhost:30000/v1/chat/completions \
127
 
128
  ```
129
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
130
  ## Citation
131
  ```bibtex
132
  @article{GroveMoE,
 
127
 
128
  ```
129
 
130
+ ## Best Practices for Model Configuration
131
+ To achieve optimal performance, we recommend the following settings:
132
+
133
+ 1. **Sampling Parameters**:
134
+ - We suggest using `Temperature=0.7`, `TopP=0.8`, `TopK=20`, and `MinP=0`.
135
+ ⚠️ For benchmarking scenarios requiring sampling (e.g., AIME), these parameters must be explicitly configured.
136
+
137
+ 2. **Adequate Output Length**: Set output length to 16,384 tokens for general use cases to accommodate complex reasoning tasks in instruct models.
138
+
139
+ 3. **Standardize Output Format**: We recommend using prompts to standardize model outputs when benchmarking.
140
+ - **Math Problems**: Include "Please reason step by step, and put your final answer within \boxed{}." in the prompt.
141
+ - **Multiple-Choice Questions**: Add the following JSON structure to the prompt to standardize responses: "Please show your choice in the `answer` field with only the choice letter, e.g., `"answer": "C"`."
142
+
143
+
144
+
145
+
146
  ## Citation
147
  ```bibtex
148
  @article{GroveMoE,