1 Comment
User's avatar
⭠ Return to thread
Neural Foundry's avatar

Nice to see MiniMax continuing to push open-source boundaries. The 74.0 SWE-bench Verified score is impressive given how quickly they're iterating. I've been tracking how VIBE-Web differs from traditional text benchmarks and it's a much better signal for actual web app generation tasks bc it tests end-to-end functional outcomes rather than just isolated completions. Curious how M2.1 handles state managment in more complex React apps compared to the previous version.

Expand full comment