Multimodal - MLX - a nexaml Collection

nexaml 's Collections

Multimodal - GGUF

Multimodal - MLX

Multimodal - MLX

updated Jul 21, 2025

Language Models that takes vision input and/or audio input, hand picked by Nexa Team.

NexaAI/gemma-3n-E4B-it-4bit-MLX

Image-Text-to-Text • Updated Jul 22, 2025 • 20 • 2
NexaAI/Qwen2.5-VL-7B-Instruct-4bit-MLX

Image-Text-to-Text • 2B • Updated Jul 22, 2025 • 31
NexaAI/SmolVLM-500M-Instruct-8bit-MLX

Image-Text-to-Text • 0.7B • Updated Jul 22, 2025 • 10
NexaAI/SmolVLM-Instruct-8bit-MLX

Image-Text-to-Text • 0.7B • Updated Jul 22, 2025 • 10
NexaAI/gemma-3-4b-it-8bit-MLX

Image-Text-to-Text • 2B • Updated Jul 22, 2025 • 85 • 2
NexaAI/gemma-3n-E2B-it-4bit-MLX

Image-Text-to-Text • 2B • Updated Jul 22, 2025 • 16 • 1