Post
136
SynLogic π§ logical reasoning model & dataset by MiniMax.
MiniMaxAI/synlogic-6836c3246fca0277657ff032
β¨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
β¨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
β¨ RL training with auto-verifiable rewards
β¨ Generalizes to math without explicit math training
β¨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines
MiniMaxAI/synlogic-6836c3246fca0277657ff032
β¨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
β¨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
β¨ RL training with auto-verifiable rewards
β¨ Generalizes to math without explicit math training
β¨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines