VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

hackernews

Score: 258 | Comments: 111

Read Full Article open_in_new
arrow_back Back to News