cortexso
/

mixtral

jan-hq commited on Jun 25, 2024

Commit

27cb8d1

verified ·

1 Parent(s): f9a6ce5

Create model.yml

Files changed (1) hide show

model.yml ADDED Viewed

+name: mixtral
+model: mixtral:7x8B
+version: 1
+files:
+  - llama_model_path: model.gguf
+# Results Preferences
+top_p: 0.95
+temperature: 0.7
+frequency_penalty: 0
+presence_penalty: 0
+max_tokens: 32768 # Infer from base config.json -> max_position_embeddings
+stream: true # true | false
+# Engine / Model Settings
+ngl: 33 # Infer from base config.json -> num_attention_heads
+ctx_len: 32768 # Infer from base config.json -> max_position_embeddings
+engine: cortex.llamacpp
+prompt_template: "[INST] {prompt} [/INST]"