Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
NexaAI 's Collections
Qualcomm NPU
Multimodal - MLX
LLM - MLX
Multimodal - GGUF
LLM - GGUF
NexaQuant Models
Nexa Models

Multimodal - MLX

updated 17 days ago

Language Models that takes vision input and/or audio input, hand picked by Nexa Team.

Upvote
2

  • NexaAI/gemma-3n-E4B-it-4bit-MLX

    Image-Text-to-Text • Updated Jul 22 • 77 • 1

  • NexaAI/Qwen2.5-VL-7B-Instruct-4bit-MLX

    Image-Text-to-Text • 2B • Updated Jul 22 • 68

  • NexaAI/SmolVLM-500M-Instruct-8bit-MLX

    Image-Text-to-Text • 0.7B • Updated Jul 22 • 28

  • NexaAI/SmolVLM-Instruct-8bit-MLX

    Image-Text-to-Text • 0.7B • Updated Jul 22 • 35

  • NexaAI/gemma-3-4b-it-8bit-MLX

    Image-Text-to-Text • 2B • Updated Jul 22 • 58 • 1

  • NexaAI/gemma-3n-E2B-it-4bit-MLX

    Image-Text-to-Text • 2B • Updated Jul 22 • 78 • 1

  • NexaAI/Kokoro-82M-bf16-MLX

    Text-to-Speech • Updated 17 days ago • 195 • 2

  • NexaAI/parakeet-tdt-0.6b-v2-MLX

    Automatic Speech Recognition • Updated 17 days ago • 127 • 1

  • NexaAI/whisper-large-v3-turbo-MLX

    Automatic Speech Recognition • Updated 17 days ago • 177
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略