DEV Community

# mlx

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Qwen 3.6 enable_thinking — The MoE Pitfall That Broke My Agent JSON Parsing

Qwen 3.6 enable_thinking — The MoE Pitfall That Broke My Agent JSON Parsing

Comments
5 min read
Running Qwen3.6-27B on a 16GB M1 MacBook Pro: A Practical Engineer’s Guide

Running Qwen3.6-27B on a 16GB M1 MacBook Pro: A Practical Engineer’s Guide

Comments
7 min read
Running Qwen3.6-27B on a 16GB M1 MacBook Pro: A Practical Engineer’s Guide

Running Qwen3.6-27B on a 16GB M1 MacBook Pro: A Practical Engineer’s Guide

Comments
7 min read
What 19 GB of Memory Compression Taught Me About MLX on M1 Max

What 19 GB of Memory Compression Taught Me About MLX on M1 Max

Comments
7 min read
Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Fine-Tuning LLMs on Apple Silicon: New Tools Enable Local Prototyping, Reducing Cloud GPU Dependency

Comments
19 min read
Why Apple Silicon Quietly Won the Local-AI Race (April 2026)

Why Apple Silicon Quietly Won the Local-AI Race (April 2026)

Comments
6 min read
SleepyQuant – a 12-agent crypto quant running on one Mac

SleepyQuant – a 12-agent crypto quant running on one Mac

Comments
3 min read
The Inverted Control: What 24 Hours of Running Our Own Bot Backwards Revealed

The Inverted Control: What 24 Hours of Running Our Own Bot Backwards Revealed

Comments
7 min read
2.78 TFLOPS on a Fanless MacBook Air? Benchmarking Apple's M4 with MLX

2.78 TFLOPS on a Fanless MacBook Air? Benchmarking Apple's M4 with MLX

Comments
4 min read
Gemma 4 on Apple Silicon: 85 tok/s with a pip install

Gemma 4 on Apple Silicon: 85 tok/s with a pip install

1
Comments
4 min read
Ollama Just Got 93% Faster on Mac. Here's How to Enable It.

Ollama Just Got 93% Faster on Mac. Here's How to Enable It.

11
Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.