NousResearch
/

DeepHermes-3-Llama-3-8B-Preview

Model card Files Files and versions

teknium commited on Feb 13

Commit

868505c

·

verified ·

1 Parent(s): 8aeace5

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -52,6 +52,8 @@ The ethos of the Hermes series of models is focused on aligning LLMs to the user
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/O_sgWq4CVPuxuKYqHWkkN.png)
 ## Benchmarks in **Non-Reasoning Mode** against Llama-3.1-8B-Instruct
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/hZCJa8g8smOS9BcQSXAd1.png)

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/O_sgWq4CVPuxuKYqHWkkN.png)
+*Upper bound determined by measuring the % gained over Hermes 3 3 & 70b by MATH_VERIFY compared to eleuther eval harness, which ranged betweeen 33% and 50% gain in MATH Hard benchmark on retested models by them compared to eval harness reported scores*
 ## Benchmarks in **Non-Reasoning Mode** against Llama-3.1-8B-Instruct
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/hZCJa8g8smOS9BcQSXAd1.png)