Reinforcement Learning Models

19h

Leadership Amid Uncertainty: CEOs Can Learn Effective Decision Making From Reinforcement Learning

Let’s look at how RL agents are trained to deal with ambiguity, and it may provide a blueprint of leadership lessons to ...

Gradient Launches Echo-2 to Break the Cost Barrier Holding Back the Next Wave of AI Progress

Echo-2 is designed to change that dynamic. Rather than forcing all training to run inside tightly controlled clusters, the system allows reinforcement learning workloads to be spr ...

Geeky Gadgets

Reinforcement Learning for LLMs in 2025

Imagine trying to teach a child how to solve a tricky math problem. You might start by showing them examples, guiding them step by step, and encouraging them to think critically about their approach.

MIT's new fine-tuning method lets LLMs learn new skills without losing old ones

MIT researchers unveil a new fine-tuning method that lets enterprises consolidate their "model zoos" into a single, continuously learning agent.

EurekAlert!

Reinforcement learning world models for catalyst surface reconstruction: state-of-the-art review

This work presents an AI-based world model framework that simulates atomic-level reconstructions in catalyst surfaces under dynamic conditions. Focusing on AgPd nanoalloys, it leverages Dreamer-style ...

TTT-Discover optimizes GPU kernels 2x faster than human experts — by training during inference

A new technique from Stanford, Nvidia, and Together AI lets models learn during inference rather than relying on static ...

10don MSN

Computational models predict neural activity for re-establishing connectivity after stroke or injury

Researchers at The Hong Kong University of Science and Technology (HKUST) School of Engineering have developed a novel reinforcement learning–based generative model to predict neural signals, creating ...

Hosted on MSN

New look at dopamine signaling suggests neuroscientists' model of reinforcement learning may need to be revised

Dopamine is a powerful signal in the brain, influencing our moods, motivations, movements, and more. The neurotransmitter is crucial for reward-based learning, a function that may be disrupted in a ...

Researchers propose a self-distillation fix for ‘catastrophic forgetting’ in LLMs

LLMs tend to lose prior skills when fine-tuned for new tasks. A new self-distillation approach aims to reduce regression and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results