Especially on MATH-500, it achieved an excellent score of 96.2, closely following DeepSeek R1, demonstrating T1’s ...