Update README.md
Browse files
README.md
CHANGED
|
@@ -32,7 +32,7 @@ We utilized the following datasets:
|
|
| 32 |
| LogiQA2.0 (0-shot) | **38.0** | 36.83 |
|
| 33 |
| BBH CoT (0-shot) | 64.9 | **70.37** |
|
| 34 |
| **Code Benchmarks** | | |
|
| 35 |
-
| HumanEval (pass@1) | 47.9 | **
|
| 36 |
| **Domain Specific (Medical)** | | |
|
| 37 |
| MedQA (0-shot) | **53.6** | 52.87 |
|
| 38 |
| MedMCQA (5-shot) | **51.3** | 50.71 |
|
|
|
|
| 32 |
| LogiQA2.0 (0-shot) | **38.0** | 36.83 |
|
| 33 |
| BBH CoT (0-shot) | 64.9 | **70.37** |
|
| 34 |
| **Code Benchmarks** | | |
|
| 35 |
+
| HumanEval (pass@1) | 47.9 | **60.82** |
|
| 36 |
| **Domain Specific (Medical)** | | |
|
| 37 |
| MedQA (0-shot) | **53.6** | 52.87 |
|
| 38 |
| MedMCQA (5-shot) | **51.3** | 50.71 |
|