I built a Rust LLM inference engine with custom WGSL GPU kernels, here's what I learned!
I've been working on a side project called aether , a Rust LLM inference engine that can load GGUF models and run them with WGPU GPU acceleration. It
Curated development tutorials from top sources. Filter by language.
I've been working on a side project called aether , a Rust LLM inference engine that can load GGUF models and run them with WGPU GPU acceleration. It
The Problem We Were Actually Solving At Veltrix our treasure hunt engine was running on the JVM stack—OpenJDK 17, GraalVM Native Image, and a Kotlin
I built HiyokoMTP because the existing tools kept failing me. Android File Transfer has been effectively abandoned. OpenMTP is Electron-based — which
The Problem We Were Actually Solving I still remember the day our Veltrix-based event handling system started to show signs of distress, wi
Every payment gateway I've ever worked on had the same hidden bug. A provider API times out. The code says "failure". So you retry. But the original
The Problem We Were Actually Solving I was tasked with optimizing the scaling of our server, which was stalling at the first growth inflect
A complete ML pipeline: engine, backprop, binary format, and a live browser demo. Zero dependencies. Under 200 KB total. If you have built machine
I Shipped a Rust + WebAssembly Battle Map Editor to the Microsoft Store I recently shipped RPG Map Editor to the Microsoft Store. It is a
The Problem We Were Actually Solving As I reflect on our team's journey with Hytale and Veltrix configuration, I realize that our initial g
Paneflow is a terminal multiplexer and AI agents IDE written in pure Rust on top of Zed's GPUI. The pitch is one line: a Rust native host for your CLI
Writing self-referential structs in Rust is a complete nightmare; the borrow checker won't let you do it natively. Testing three common workarounds fo
I write C++ full-time. Before this, I had never touched Rust. Three weeks ago I read an article about offline background removal and started building
You're six hours into debugging a production issue. The trace points to line 847 in order_processor.rs, but you need to see how the state flowed from
Over the past two years, the AI Agent ecosystem has changed dramatically. In 2023, most projects were still fundamentally: LLM API + Prompt + Tool
Tokenization is one of those silent bottlenecks in the Large Language Model (LLM) world. While GPUs do the heavy lifting of running the model, the CPU
If you've ever had a GPL dependency sneak into a commercial project, you know the drill. License violations don't fail your tests. They don't break yo
The Problem We Were Actually Solving I was tasked with optimizing the performance of our Hytale server, which was experiencing high latency
The Problem We Were Actually Solving It started with a scream from the observability dashboard. At 02:47 on a Sunday, our event ingestion pipeline hi
Solana's runtime was designed for speed — 400ms block times, parallel transaction execution, sub-cent fees. But when you pair that with AI agents that
The Problem We Were Actually Solving The treasure-hunt engine had to keep state for each actor: current tile, inventory, active effects, an