Model-SafeTensors
/

SOLAR-10.7B-v1.0

Model card Files Files and versions

Limerobot commited on Dec 13, 2023

Commit

345cbf8

·

1 Parent(s): 6e27838

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -14,6 +14,23 @@ We developed the Depth Up-Scaling technique. Built on the Llama2 architecture, S
 Depth-Upscaled SOLAR-10.7B has remarkable performance. It outperforms models with up to 30B parameters, even surpassing the recent Mixtral 8X7B model. For detailed information, please refer to the experimental table ([link to be updated soon]).
 Solar 10.7B is an ideal choice for fine-tuning. SOLAR-10.7B offers robustness and adaptability for your fine-tuning needs. Our simple instruction fine-tuning using the SOLAR-10.7B pre-trained model yields significant performance improvements. [[link to be updated soon]]
 # **Usage Instructions**

 Depth-Upscaled SOLAR-10.7B has remarkable performance. It outperforms models with up to 30B parameters, even surpassing the recent Mixtral 8X7B model. For detailed information, please refer to the experimental table ([link to be updated soon]).
 Solar 10.7B is an ideal choice for fine-tuning. SOLAR-10.7B offers robustness and adaptability for your fine-tuning needs. Our simple instruction fine-tuning using the SOLAR-10.7B pre-trained model yields significant performance improvements. [[link to be updated soon]]
+# **Evaluation Results**
+| Model                                  | H6    | Model Size |
+|----------------------------------------|-------|------------|
+| **SOLAR-10.7B-Instruct-v1.0**              | **74.20** | **~ 11B**      |
+| mistralai/Mixtral-8x7B-Instruct-v0.1   | 72.62 | ~ 46.7B    |
+| 01-ai/Yi-34B-200K                      | 70.81 | ~ 34B      |
+| 01-ai/Yi-34B                           | 69.42 | ~ 34B      |
+| mistralai/Mixtral-8x7B-v0.1            | 68.42 | ~ 46.7B    |
+| meta-llama/Llama-2-70b-hf              | 67.87 | ~ 70B      |
+| tiiuae/falcon-180B                     | 67.85 | ~ 180B     |
+| **SOLAR-10.7B-v1.0**                   | **66.04** | **~11B**   |
+| mistralai/Mistral-7B-Instruct-v0.2     | 65.71 | ~ 7B       |
+| Qwen/Qwen-14B                          | 65.86 | ~ 14B      |
+| 01-ai/Yi-34B-Chat                      | 65.32 | ~34B       |
+| meta-llama/Llama-2-70b-chat-hf         | 62.4  | ~ 70B      |
+| mistralai/Mistral-7B-v0.1              | 60.97 | ~ 7B       |
+| mistralai/Mistral-7B-Instruct-v0.1     | 54.96 | ~ 7B       |
 # **Usage Instructions**