UnslothAI has launched a 4-bit MLX-optimized version of its DGEMMA 4-31B model, tailored for Apple Silicon. This new release promises rapid inference speeds on all M-series Macs, utilizing around 20GB of RAM efficiently. The model is designed to enhance multimodal and visual performance, supporting a 256K context length and native function calling capabilities.