mlx-community
/

Llama3-ChatQA-1.5-8B-4bit

Text Generation

Model card Files Files and versions

Llama3-ChatQA-1.5-8B-4bit / README.md

prince-canuma's picture

Create README.md

760d524 verified over 1 year ago

|

history blame contribute delete

759 Bytes

	---
	language:
	- en
	license: other
	tags:
	- facebook
	- nvidia
	- meta
	- pytorch
	- llama
	- llama-3
	- mlx
	pipeline_tag: text-generation
	license_name: llama3
	license_link: LICENSE
	---

	# mlx-community/Llama3-ChatQA-1.5-8B-4bit
	This model was converted to MLX format from [`mlx-community/Llama3-ChatQA-1.5-8B`]() using mlx-lm version 0.12.0.

	Model added by [Prince Canuma](https://twitter.com/Prince_Canuma).

	Refer to the [original model card](https://huggingface.co/nvidia/Llama3-ChatQA-1.5-8B) for more details on the model.
	## Use with mlx

	```bash
	pip install mlx-lm
	```

	```python
	from mlx_lm import load, generate

	model, tokenizer = load("mlx-community/Llama3-ChatQA-1.5-8B-4bit")
	response = generate(model, tokenizer, prompt="hello", verbose=True)
	```