Why I built a neutral LLM eval framework after Promptfoo joined OpenAI
python
dev.to
A few weeks ago, Promptfoo — one of the most popular open-source LLM evaluation frameworks — joined OpenAI. I don't think that's inherently bad. But it created a real problem for the ecosystem: the tools we use to evaluate AI systems are increasingly owned by the same companies that build those AI systems. That's a conflict of interest that matters. So I built Rubric — an independent, MIT-licensed LLM and AI agent evaluation framework. No corporate parent. Open source forever. Here's what I l