Today, we're previewing the fastest way to run Ollama on Apple silicon, powered by MLX, Apple's machine learning framework.
Tag: Optimization
7 links

Disaggregating Prefill and Decode: Faster First Tokens, Faster Streams

Unpacking Cloudflare Workers CPU Performance Benchmarks
blog.cloudflare.comCloudflare investigated CPU performance benchmark results for Workers, uncovering and fixing issues in infrastructure, V8 garbage collection, and OpenNext optimizations. These improvements have made Cloudflare Workers faster for all customers.

Optimizing PHP applications for performance — Mateus Guimarães
mateusguimaraes.comA quick guide on how to set up OPCache and fine-tune PHP-FPM.

"Why is the Rust compiler so slow?"
sharnoff.ioI spent a month repeatedly building my website in Docker, and now have horrors to share.

Optimizations with Zig | alloc.dev
alloc.devThe power of Zig's comptime code execution

Speeding Up My ZSH Shell ⚡ - Scott Spence
scottspence.comSuper quick one I want to document here! I got myself on a side quest,
again! No biggie, my ZSH shell was taking ages to load. When I say
ag...