Reinforcement Learning
12 researchers · 6 papers · 4 projects · 4 builders
Researchers (12)
Oriol Vinyals
Google DeepMind
415,165 citations · h-index 95
Jürgen Schmidhuber
KAUST / IDSIA
302,473 citations · h-index 105
David Silver
Ineffable Intelligence / UCL
294,572 citations · h-index 102
Demis Hassabis
Google DeepMind
180,509 citations · h-index 89
Richard Sutton
University of Alberta / DeepMind
142,000 citations · h-index 78
Chelsea Finn
Stanford University
117,781 citations · h-index 82
Stuart Russell
UC Berkeley
115,000 citations · h-index 86
Jason Weston
Meta AI
112,000 citations · h-index 88
Pieter Abbeel
UC Berkeley / Covariant
76,697 citations · h-index 124
Sergey Levine
UC Berkeley
63,215 citations · h-index 110
Nando de Freitas
Google DeepMind
48,000 citations · h-index 72
Karol Hausman
Google DeepMind / Stanford
16,000 citations · h-index 40
Papers (6)
Reinforcement Learning: An Introduction
Mastering the game of Go with deep neural networks and tree search
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Neural Architecture Search with Reinforcement Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
A Path Towards Autonomous Machine Intelligence
Projects (4)
TRL (Transformer Reinforcement Learning)
Hugging Face library for training language models with RLHF, DPO, and PPO. The standard open-source toolkit for alignment fine-tuning.
by Lewis Tunstall
Covariant Brain
Universal AI for robotic manipulation. The Covariant Brain enables robots to pick, place, and handle novel objects in real-world warehouse and logistics settings.
by Peter Chen
DeepMind Robotics Research
Pioneering continual learning and sim-to-real transfer for embodied agents. Research on navigation, manipulation, and lifelong learning.
by Raia Hadsell
Lil'Log
Comprehensive technical blog covering LLMs, diffusion, RL, and more. The most widely cited personal ML blog, serving as a reference for researchers worldwide.
by Lilian Weng