MLX.zig

A Zig language binding for MLX, Apple's array framework for machine learning on Apple Silicon.

Overview

MLX.zig provides a Zig-native interface to MLX, allowing you to build and run machine learning models on Apple Silicon devices using the Zig programming language. This project demonstrates how to:

Use MLX from Zig without any additional build tools
Implement a transformer-based language model
Handle tokenization and text generation

Features

Pure Zig Build System: No CMake or other external build tools needed
Complete Dependency Management: All C/C++ dependencies (MLX, MLX-C, PCRE2, etc.) resolved through Zig's build system
Working LLM Example: Includes a tokenizer and transformer implementation capable of text generation
Efficient Tokenization: Uses PCRE2 (Perl Compatible Regular Expressions) for fast and reliable text processing
Low-Level MLX Access: Direct bindings to MLX's C API for maximum performance

Prerequisites

Apple Silicon Mac
Zig v0.13.0

Getting Started

Clone the repository:

git clone https://github.com/jaco-bro/MLX.zig.git
cd MLX.zig

Download the Llama-3.2-1B-Instruct model weights (2.47GB) and place it in the project root directory.
Build and run the example demo:

zig build run

This will compile MLX from source and run a simple text generation demo.

Examples

// Load tokenizer
var tokenizer = try Tokenizer.init(allocator, null);
defer tokenizer.deinit();

// Load transformer
var transformer = try Transformer.init(allocator, null);
defer transformer.deinit();

// Encode input string to token IDs (chat format)
const input_ids = try tokenizer.encodeChat(allocator, "You are a helpful assistant.", user_input);
defer allocator.free(input_ids);

// Generate new tokens
const output_ids = try transformer.generate(input_ids, num_tokens_to_generate);
defer allocator.free(output_ids);

How it Works

MLX.zig integrates Zig with Apple's ML framework through three key components:

Zig Build System: Compiles MLX (C++), MLX-C, and PCRE2 from source with zero external dependencies
Transformer: Implements a Llama-style language model with attention mechanisms and key-value caching
Tokenizer: Uses PCRE2 for efficient regex-based text processing, handling complex patterns and special tokens

The system works by encoding text to tokens, processing them through MLX tensor operations optimized for Apple Silicon, and decoding the generated output back to text—all managed through a clean Zig interface.

Acknowledgements

This project's build system is based on Erik Kaunismäki's zig-build-mlx, which pioneered the approach of building MLX directly with Zig rather than using CMake. This project uses a condensed version of Erik's build configuration.

License

This project is licensed under the Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
src		src
LICENSE		LICENSE
README.md		README.md
build.zig		build.zig
build.zig.zon		build.zig.zon
tokenizer.model		tokenizer.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MLX.zig

Overview

Features

Prerequisites

Getting Started

Examples

How it Works

Acknowledgements

License

About

Uh oh!

Releases

Packages

Languages

License

dominix/MLX.zig

Folders and files

Latest commit

History

Repository files navigation

MLX.zig

Overview

Features

Prerequisites

Getting Started

Examples

How it Works

Acknowledgements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages