Updating evaluation details for RULER (reasoning off) (#6)
Browse files- Updating evaluation details for RULER (reasoning off) (2e9a897917ec654280edb6fb003349f606cf36e7)
Co-authored-by: Ameya Sunil Mahabaleshwarkar <[email protected]>
README.md
CHANGED
@@ -63,7 +63,7 @@ GOVERNING TERMS: This trial service is governed by the [NVIDIA API Trial Terms o
|
|
63 |
|
64 |
### Benchmark Results (Reasoning On)
|
65 |
|
66 |
-
We evaluated our model in
|
67 |
|
68 |
|
69 |
| Benchmark | Qwen3-8B | NVIDIA-Nemotron-Nano-9B-v2 |
|
|
|
63 |
|
64 |
### Benchmark Results (Reasoning On)
|
65 |
|
66 |
+
We evaluated our model in **Reasoning-On** mode across all benchmarks, except RULER, which is evaluated in **Reasoning-Off** mode.
|
67 |
|
68 |
|
69 |
| Benchmark | Qwen3-8B | NVIDIA-Nemotron-Nano-9B-v2 |
|