osxest commited on
Commit
e5cef60
·
verified ·
1 Parent(s): afaa127

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: nvidia/OpenCodeReasoning-Nemotron-32B-IOI
3
+ datasets:
4
+ - nvidia/OpenCodeReasoning
5
+ language:
6
+ - en
7
+ library_name: transformers
8
+ license: apache-2.0
9
+ tags:
10
+ - nvidia
11
+ - code
12
+ - mlx
13
+ - mlx-my-repo
14
+ pipeline_tag: text-generation
15
+ ---
16
+
17
+ # osxest/OpenCodeReasoning-Nemotron-32B-IOI-mlx-8Bit
18
+
19
+ The Model [osxest/OpenCodeReasoning-Nemotron-32B-IOI-mlx-8Bit](https://huggingface.co/osxest/OpenCodeReasoning-Nemotron-32B-IOI-mlx-8Bit) was converted to MLX format from [nvidia/OpenCodeReasoning-Nemotron-32B-IOI](https://huggingface.co/nvidia/OpenCodeReasoning-Nemotron-32B-IOI) using mlx-lm version **0.22.3**.
20
+
21
+ ## Use with mlx
22
+
23
+ ```bash
24
+ pip install mlx-lm
25
+ ```
26
+
27
+ ```python
28
+ from mlx_lm import load, generate
29
+
30
+ model, tokenizer = load("osxest/OpenCodeReasoning-Nemotron-32B-IOI-mlx-8Bit")
31
+
32
+ prompt="hello"
33
+
34
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
35
+ messages = [{"role": "user", "content": prompt}]
36
+ prompt = tokenizer.apply_chat_template(
37
+ messages, tokenize=False, add_generation_prompt=True
38
+ )
39
+
40
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
41
+ ```