Math Reasoning Examples

Tencent’s new AI technique teaches language models ‘parallel thinking’

The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...

Live Science on MSN

Scientists asked ChatGPT to solve a math problem from more than 2,000 years ago — how it answered it surprised them

We've wondered for centuries whether knowledge is latent and innate or learned and grasped through experience, and a new ...

AWS scientist: Your AI strategy needs mathematical logic

Hallucination is fundamental to how transformer-based language models work. In fact, it's their greatest asset.

11don MSN

China's DeepSeek applying trial-and-error learning to its AI 'reasoning'

Chinese AI company DeepSeek has shown it can improve the reasoning of its LLM DeepSeek-R1 through trial-and-error based ...

Cambridge study reveals ChatGPT-4 struggles with ancient Greek mathematical puzzle

Cambridge researchers test ChatGPT-4 on an ancient Greek maths puzzle, revealing strengths, weaknesses and the importance of ...

12d

Pulan Network Applies for Patent on Advanced Mathematics Problem-Solving Model, Thinking Chain Reasoning Assists AI in Accurate Problem Solving

Reinforcement Method for Problem-Solving Model Based on Hierarchical Thinking Chains ...

Unite.AI

Jagged Intelligence: Why AIs Ace Olympiad Problems but Flounder on School Math

The artificial intelligence community celebrated a remarkable milestone in 2025 when both Google DeepMind and OpenAI systems ...

12d

Secrets of Chinese AI Model DeepSeek Revealed in Landmark Paper

The success of DeepSeek’s powerful artificial intelligence (AI) model R1 — that made the US stock market plummet when it was ...

11d

Meta's new small reasoning model shows industry shift toward tiny AI for enterprise applications

As Meta unveils its powerful on-device reasoner, a wider industry trend emerges where small, specialized models are solving enterprise challenges around cost, privacy, and control.

Scientific American

Secrets of DeepSeek AI Model Revealed in Landmark Paper

The first peer-reviewed study of the DeepSeek AI model shows how a Chinese start-up firm made the market-shaking LLM for $300 ...

Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

Claude 4.5 is available everywhere today. Through the API, the model maintains the same pricing as Claude Sonnet 4, at $3 per ...

11d

DeepSeek on the Cover of Nature: AI Learns Reasoning Independently of Human Instruction

While effective, this approach has notable limitations: it heavily relies on human annotations, making it costly and difficult to scale; models only mimic humans, struggling to surpass human reasoning ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results