Reinforcement Learning

12 researchers · 6 papers · 4 projects · 4 builders

Researchers (12)

Oriol Vinyals

Google DeepMind

415,165 citations · h-index 95

Jürgen Schmidhuber

KAUST / IDSIA

302,473 citations · h-index 105

David Silver

Ineffable Intelligence / UCL

294,572 citations · h-index 102

Demis Hassabis

Google DeepMind

180,509 citations · h-index 89

Richard Sutton

University of Alberta / DeepMind

142,000 citations · h-index 78

Chelsea Finn

Stanford University

117,781 citations · h-index 82

Stuart Russell

UC Berkeley

115,000 citations · h-index 86

Jason Weston

Meta AI

112,000 citations · h-index 88

Pieter Abbeel

UC Berkeley / Covariant

76,697 citations · h-index 124

Sergey Levine

UC Berkeley

63,215 citations · h-index 110

Nando de Freitas

Google DeepMind

48,000 citations · h-index 72

Karol Hausman

Google DeepMind / Stanford

16,000 citations · h-index 40

Papers (6)

Reinforcement Learning: An Introduction

MIT Press201865,000 citations

Mastering the game of Go with deep neural networks and tree search

Nature201620,000 citations

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

ICML 2017201716,000 citations

Neural Architecture Search with Reinforcement Learning

ICLR 201720175,100 citations

A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

AISTATS 201120113,500 citations

A Path Towards Autonomous Machine Intelligence

OpenReview20226 citations

Projects (4)

TRL (Transformer Reinforcement Learning)

Hugging Face library for training language models with RLHF, DPO, and PPO. The standard open-source toolkit for alignment fine-tuning.

by Lewis Tunstall

Covariant Brain

Universal AI for robotic manipulation. The Covariant Brain enables robots to pick, place, and handle novel objects in real-world warehouse and logistics settings.

by Peter Chen

DeepMind Robotics Research

Pioneering continual learning and sim-to-real transfer for embodied agents. Research on navigation, manipulation, and lifelong learning.

by Raia Hadsell

Lil'Log

Comprehensive technical blog covering LLMs, diffusion, RL, and more. The most widely cited personal ML blog, serving as a reference for researchers worldwide.

by Lilian Weng