Update README.md
Browse files
README.md
CHANGED
|
@@ -5,12 +5,12 @@ tags:
|
|
| 5 |
- green
|
| 6 |
- p8
|
| 7 |
- llmware-chat
|
| 8 |
-
-
|
| 9 |
---
|
| 10 |
|
| 11 |
-
# llama-3.1-instruct-
|
| 12 |
|
| 13 |
-
**llama-3.1-instruct-ov
|
| 14 |
|
| 15 |
[**llama-3.1-instruct**](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) is a leading open source general foundation model from Meta.
|
| 16 |
|
|
|
|
| 5 |
- green
|
| 6 |
- p8
|
| 7 |
- llmware-chat
|
| 8 |
+
- onnx
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# llama-3.1-instruct-onnx
|
| 12 |
|
| 13 |
+
**llama-3.1-instruct-ov** is an ONNX int4 quantized version of Llama 3.1 Instruct, providing a very fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
|
| 14 |
|
| 15 |
[**llama-3.1-instruct**](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) is a leading open source general foundation model from Meta.
|
| 16 |
|