Building Real-Time A/B Model Comparison with Parallel Async Streams in TypeScript
typescript
dev.to
Ever wanted to pit two LLMs against each other with the exact same prompt and see who wins — in real time? That's what I built into Locally Uncensored v2.1, and the implementation turned out to be more interesting than I expected. The Problem I was constantly switching between Ollama models, trying to figure out which one actually gives better answers for my use cases. Copy-pasting prompts between tabs is tedious. So I built a split-view A/B comparison — same prompt, two models, stre