|
--- |
|
language: |
|
- en |
|
license: other |
|
tags: |
|
- facebook |
|
- nvidia |
|
- meta |
|
- pytorch |
|
- llama |
|
- llama-3 |
|
- mlx |
|
pipeline_tag: text-generation |
|
license_name: llama3 |
|
license_link: LICENSE |
|
--- |
|
|
|
# mlx-community/Llama3-ChatQA-1.5-8B-4bit |
|
This model was converted to MLX format from [`mlx-community/Llama3-ChatQA-1.5-8B`]() using mlx-lm version **0.12.0**. |
|
|
|
Model added by [Prince Canuma](https://twitter.com/Prince_Canuma). |
|
|
|
Refer to the [original model card](https://huggingface.co/nvidia/Llama3-ChatQA-1.5-8B) for more details on the model. |
|
## Use with mlx |
|
|
|
```bash |
|
pip install mlx-lm |
|
``` |
|
|
|
```python |
|
from mlx_lm import load, generate |
|
|
|
model, tokenizer = load("mlx-community/Llama3-ChatQA-1.5-8B-4bit") |
|
response = generate(model, tokenizer, prompt="hello", verbose=True) |
|
``` |
|
|