mlx_bark / README.md
j-csc's picture
Update README.md
a6af09a verified
|
raw
history blame
947 Bytes
metadata
license: mit
language:
  - en
library_name: mlx
pipeline_tag: text-to-speech
tags:
  - nlp
  - tts
  - bark

Model Summary

Bark is a transformer based text-to-audio model that can generate speech and miscellaneous audio i.e. background noise / music.

This is a port of Suno's Bark model in Apple's ML Framework, MLX. The intention of the port is to explore the potential in making fast on-device TTS inference possible.

This repository contains the Bark weights in npz format suitable for use with Apple's MLX Framework.

Usage

# Setup
pip install transformers huggingface_hub hf_transfer
git clone https://github.com/j-csc/mlx_bark
cd mlx_bark
pip install -r requirements.txt

# Download model
export HF_HUB_ENABLE_HF_TRANSFER=1
huggingface-cli download --local-dir-use-symlinks False --local-dir weights/ mlx-community/mlx_bark

# Run example (large model)
python model.py --text="Hello world!" --path weights/ --model large