Lewis Tunstall

Builder

Zurich

@lewtun

ML Engineer at Hugging Face. Co-author of 'Natural Language Processing with Transformers' and maintainer of the TRL library for RLHF.

Skills

RLHFNLPtransformerspythonopen source

Looking For

collaborator

Research Foundations (2)

Attention Is All You Need

NeurIPS 201720176,510 citations

Used by: TRL (Transformer Reinforcement Learning)

Large-scale Video Classification with Convolutional Neural Networks

CVPR 201420145,800 citations

Used by: TRL (Transformer Reinforcement Learning)

Projects (1)

TRL (Transformer Reinforcement Learning)

GitHub Live

Hugging Face library for training language models with RLHF, DPO, and PPO. The standard open-source toolkit for alignment fine-tuning.

NLP Generative AI Reinforcement Learning

Built on research:

paperAttention Is All You Need(2017)

paperLarge-scale Video Classification with Convolutional Neural Networks(2014)

AI Suggested Researchers

Researchers whose work may be relevant to your projects (auto-detected)

Jürgen Schmidhuber

KAUST / IDSIA

Shared domains: NLP, Generative AI, Reinforcement Learning

61% matchAI suggested

Oriol Vinyals

Google DeepMind

Shared domains: NLP, Generative AI, Reinforcement Learning

60% matchAI suggested

Jason Weston

Meta AI

Shared domains: NLP, Reinforcement Learning

51% matchAI suggested

Hannaneh Hajishirzi

University of Washington / Allen AI

Shared domains: NLP, Generative AI

50% matchAI suggested

AI Suggested Papers

Papers that may have inspired your projects (auto-detected by domain & keyword analysis)

For TRL (Transformer Reinforcement Learning):

Language Models are Few-Shot Learners

NeurIPS 2020 · 2020 · 18,500 citations

Shared domains: NLP, Generative AIRelated keywords: language, models, fine-tuning, nlpHighly cited (18,500 citations)

48% match

Llama 2: Open Foundation and Fine-Tuned Chat Models

arXiv preprint · 2023 · 7,200 citations

Shared domains: NLP, Generative AIRelated keywords: language, models, open-source, open

46% match

Language Models are Few-Shot Learners

NeurIPS 2020 · 2020 · 3,027 citations

Shared domains: NLP, Generative AIRelated keywords: language, models

45% match

Website Twitter GitHub