VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO Von Hackernews 23. Juni 2026 security ORIGINAL QUELLE:arxiv.org Quelle: Hackernews Comments Tags: arkham, comments, ki-modell, Security, vibethinker