amd
/

Instella-3B-Math

Text Generation

Model card Files Files and versions Community

Prakamya commited on 14 days ago

Commit

a02feaf

·

verified ·

1 Parent(s): d1c8b5b

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -38,6 +38,31 @@ Derived from [Instella-3B-Instruct](https://huggingface.co/amd/Instella-3B-Instr
 <em><b>Figure 1:</b> Instella-Math Training Steps</em>
 </div>
 # Supervised Finetuning (SFT)
 We perform a two-stage supervised fine-tuning process to gradually enhance the reasoning capabilities of the Instella-3B-Instruct model. The first stage we use instruction tuning for mathematical coverage. The second stage enables the model to generate in-depth analyses and structured reasoning steps, which are crucial for tackling complex problems like Olympiad-level math questions.

 <em><b>Figure 1:</b> Instella-Math Training Steps</em>
 </div>
+## Example Usage
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+checkpoint = "amd/Instella-3B-Math"
+tokenizer = AutoTokenizer.from_pretrained(checkpoint, trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map="auto", trust_remote_code=True)
+prompt = [{"role": "user", "content": "Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May? Let's think step by step and output the final answer within \\boxed{}."}]
+inputs = tokenizer.apply_chat_template(
+    prompt,
+    add_generation_prompt=True,
+    return_tensors='pt'
+)
+tokens = model.generate(
+    inputs.to(model.device),
+    max_new_tokens=1024,
+    temperature=0.8,
+    do_sample=True
+)
+print(tokenizer.decode(tokens[0], skip_special_tokens=False))
+```
 # Supervised Finetuning (SFT)
 We perform a two-stage supervised fine-tuning process to gradually enhance the reasoning capabilities of the Instella-3B-Instruct model. The first stage we use instruction tuning for mathematical coverage. The second stage enables the model to generate in-depth analyses and structured reasoning steps, which are crucial for tackling complex problems like Olympiad-level math questions.