NousResearch
/

DeepHermes-3-Llama-3-8B-Preview

Model card Files Files and versions

teknium commited on Feb 13

Commit

f4b970c

·

verified ·

1 Parent(s): abddf5b

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -71,10 +71,10 @@ from transformers import AutoTokenizer, AutoModelForCausalLM
 import flash_attn
 import time
-tokenizer = AutoTokenizer.from_pretrained("NousResearch/DeepHermes-3-RC4-e4")
 model = AutoModelForCausalLM.from_pretrained(
-    "NousResearch/DeepHermes-3-RC4-e4",
     torch_dtype=torch.float16,
     device_map="auto",
     attn_implementation="flash_attention_2",
@@ -110,10 +110,10 @@ from transformers import AutoTokenizer, AutoModelForCausalLM
 import flash_attn
 import time
-tokenizer = AutoTokenizer.from_pretrained("NousResearch/DeepHermes-3-RC4-e4")
 model = AutoModelForCausalLM.from_pretrained(
-    "NousResearch/DeepHermes-3-RC4-e4",
     torch_dtype=torch.float16,
     device_map="auto",
     attn_implementation="flash_attention_2",
@@ -141,7 +141,7 @@ print(f"Response: {response}")
 You can also run this model with vLLM, by running the following in your terminal after `pip install vllm`
-`vllm serve NousResearch/Hermes-3-Llama-3.1-8B`
 You may then use the model over API using the OpenAI library just like you would call OpenAI's API.

 import flash_attn
 import time
+tokenizer = AutoTokenizer.from_pretrained("NousResearch/DeepHermes-3-Llama-3-8B-Preview")
 model = AutoModelForCausalLM.from_pretrained(
+    "NousResearch/DeepHermes-3-Llama-3-8B-Preview",
     torch_dtype=torch.float16,
     device_map="auto",
     attn_implementation="flash_attention_2",
 import flash_attn
 import time
+tokenizer = AutoTokenizer.from_pretrained("NousResearch/DeepHermes-3-Llama-3-8B-Preview")
 model = AutoModelForCausalLM.from_pretrained(
+    "NousResearch/DeepHermes-3-Llama-3-8B-Preview",
     torch_dtype=torch.float16,
     device_map="auto",
     attn_implementation="flash_attention_2",
 You can also run this model with vLLM, by running the following in your terminal after `pip install vllm`
+`vllm serve NousResearch/DeepHermes-3-Llama-3-8B-Preview`
 You may then use the model over API using the OpenAI library just like you would call OpenAI's API.