How the DeepSeek-R1 AI model was taught to teach itsel

APPs
- Android

Trending News
- Bigg Boss 19
- Asia Cup 2025
- Stock Market
- Cloudburst
- PM Modi
- Heavy Rains
- Bads of Bollywood
- Hydrogen Bomb
- Blackbuck
- Delhi BMW Accident
- India vs Pakistan
- Navaratri 2025
- Movies
- BJP
- Trump Tariffs
- GST
- India Alliance
- Arvind Kejriwal
- Earthquake
- Shiv Sena
- Boycott
- Rahul Gandhi
- AAP
- Congress
Select News Language
APPs
- Android

Updated: 12:00 pm Sep 18, 2025

SENSEX

NIFTY

GOLD

USD/INR

Weather

32C

Science/Tech News

Elections 2025

Science/Tech / The Hindu

details

How the DeepSeek-R1 AI model was taught to teach itself to reason | Explained

Reinforcement learning alone, with the right design, could produce reasoning behaviour that was previously thought to require human examples

17 Sep 2025 8:30 pm

Opinion Polls

With Lok Sabha Voting starting today, what would be Voters focus:

Stable government and continuity

Vote for Development and growth

Change for new government

No opinion

View All