inclusionAI
/

GroveMoE-Inst

Text Generation

Model card Files Files and versions Community

HaoxingChen commited on 5 days ago

Commit

7bc9ca6

·

verified ·

1 Parent(s): 99de339

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -127,6 +127,22 @@ curl http://localhost:30000/v1/chat/completions \
 ```
 ## Citation
 ```bibtex
 @article{GroveMoE,

 ```
+## Best Practices for Model Configuration
+To achieve optimal performance, we recommend the following settings:
+1. **Sampling Parameters**:
+   - We suggest using `Temperature=0.7`, `TopP=0.8`, `TopK=20`, and `MinP=0`.
+     ⚠️ For benchmarking scenarios requiring sampling (e.g., AIME), these parameters must be explicitly configured.
+2. **Adequate Output Length**: Set output length to 16,384 tokens for general use cases to accommodate complex reasoning tasks in instruct models.
+3. **Standardize Output Format**: We recommend using prompts to standardize model outputs when benchmarking.
+   - **Math Problems**: Include "Please reason step by step, and put your final answer within \boxed{}." in the prompt.
+   - **Multiple-Choice Questions**: Add the following JSON structure to the prompt to standardize responses: "Please show your choice in the `answer` field with only the choice letter, e.g., `"answer": "C"`."
 ## Citation
 ```bibtex
 @article{GroveMoE,