onnx-community
/

DeepSeek-R1-Distill-Llama-8B-ONNX-DirectML-GenAI-INT4

Text Generation

Model card Files Files and versions

zamroni111 commited on Feb 3

Commit

1d11af0

·

verified ·

1 Parent(s): 9128385

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -21,9 +21,6 @@ pipeline_tag: text-generation
 ## Model Details
 deepseek-ai/DeepSeek-R1-Distill-Llama-8B quantized to ONNX GenAI INT4 with Microsoft DirectML optimization.<br>
 Output is reformatted that each sentence starts at new line to improve readability.<br>
-Output will start with COTS/reasoning.<br>
-In the tokenizer_config.json, the "unk_token" value is changed from null to ""
 <pre>
 ...
 vNewDecoded = tokenizer_stream.decode(new_token)
@@ -33,6 +30,8 @@ print(vNewDecoded, end='', flush=True)
 vPreviousDecoded = vNewDecoded
 ...
 </pre>
 ### Model Description
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization<br>

 ## Model Details
 deepseek-ai/DeepSeek-R1-Distill-Llama-8B quantized to ONNX GenAI INT4 with Microsoft DirectML optimization.<br>
 Output is reformatted that each sentence starts at new line to improve readability.<br>
 <pre>
 ...
 vNewDecoded = tokenizer_stream.decode(new_token)
 vPreviousDecoded = vNewDecoded
 ...
 </pre>
+Output will start with COTS/reasoning.<br>
+In the tokenizer_config.json, the "unk_token" value is changed from null to ""
 ### Model Description
 meta-llama/Meta-Llama-3.1-8B-Instruct quantized to ONNX GenAI INT4 with Microsoft DirectML optimization<br>