how does deepseek r1's performance in math-heavy benchmarks compare to gpt-4o

whatsapp中文叫什么名字