AI Reinforcement learning

Pioneers of Reinforcement Learning Win Turing Award

· 4d · on MSN

AI pioneers scoop Turing Award for reinforcement learning work

· 4d

Pioneers of Reinforcement Learning Win the Turing Award

· 7h

Nvidia's $10 Trillion+ Roadmap: Reinforcement Learning And Synthetic Data

AI scholars win Turing Prize for technique that made possible AlphaGo's chess triumph

Scholars Andrew G. Barto and Richard S. Sutton pioneered reinforcement learning long before it became a key tool in AI.

Idaho Business Review3d

AI pioneers who channeled ‘hedonistic’ machines win computer science’s top prize

Andrew Barto amp Richard Sutton win the Turing Award for their pioneering work in reinforcement learning, shaping AI advancements from ChatGPT to robotics.

12d

This leaping robotic bike somehow lacks a stabilization system.

After demonstrating the benefits of reinforcement learning with a faster version of Boston Dynamics’ Spot, the Robotics and AI Institute has shared a video of an impressive robotic bike that can balance and jump without a dedicated stabilization system.

TMCnet4d

ACM A.M. TURING AWARD HONORS TWO RESEARCHERS WHO LED THE DEVELOPMENT OF CORNERSTONE AI TECHNOLOGY

Sutton as the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. In a series of papers beginning in the 1980s, Barto and Sutton introduced the main ideas,

10d

Boston Dynamics Led a Robot Revolution. Now Its Machines Are Teaching Themselves New Tricks

Boston Dynamics founder Marc Raibert says reinforcement learning is helping his creations gain more independence.

4don MSN

AI pioneers who channeled 'hedonistic' machines win computer science's top prize

Teaching machines in the way that animal trainers mold the behavior of dogs or horses has been an important method for developing artificial intelligence and one that was recognized Wednesday with the top computer science award.

AI tries to cheat at chess when it’s losing

A new study suggests reasoning models from DeepSeek and OpenAI are learning to manipulate on their own.

MIT Technology Review2d

AI reasoning models can cheat to win chess games

These newer models appear more likely to indulge in rule-bending behaviors than previous generations—and there’s no way to stop them.

Devdiscourse6h

AI-powered drug repurposing: A game changer for cancer research

The field of cancer treatment has long struggled with the immense costs and time-consuming nature of drug development. Traditional methods often take over a decade and billions of dollars to bring a single drug to market,

Building A Comprehensive AI Safety Framework: A Roadmap For Responsible Innovation

Current research combined with industry development demonstrates that AI safety requires a complex approach that includes explanation methods alongside secure training procedures, adversarial validation and steady performance monitoring.

InfoWorld2d

Alibaba says its new AI model rivals DeepSeeks’s R-1, OpenAI’s o1

Alibaba Cloud on Thursday launched QwQ-32B, a compact reasoning model built on its latest large language model (LLM), Qwen2.5-32b, one it says delivers performance comparable to other large cutting edge models, including Chinese rival DeepSeek and OpenAI’s o1, with only 32 billion parameters.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Pioneers of Reinforcement Learning Win Turing Award

Organizations

People

Fields

Hot Topics