Building Real-Time A/B Model Comparison with Parallel Async Streams in TypeScript

typescript dev.to April 03, 2026

Ever wanted to pit two LLMs against each other with the exact same prompt and see who wins — in real time? That's what I built into Locally Uncensored v2.1, and the implementation turned out to be more interesting than I expected. The Problem I was constantly switching between Ollama models, trying to figure out which one actually gives better answers for my use cases. Copy-pasting prompts between tabs is tedious. So I built a split-view A/B comparison — same prompt, two models, stre

Read Full Tutorial open_in_new