Lewis Tunstall
BuilderZurich
@lewtun
ML Engineer at Hugging Face. Co-author of 'Natural Language Processing with Transformers' and maintainer of the TRL library for RLHF.
Skills
Looking For
Research Foundations (2)
Projects (1)
Hugging Face library for training language models with RLHF, DPO, and PPO. The standard open-source toolkit for alignment fine-tuning.
Built on research:
AI Suggested Researchers
Researchers whose work may be relevant to your projects (auto-detected)
Jürgen Schmidhuber
KAUST / IDSIA
Oriol Vinyals
Google DeepMind
Jason Weston
Meta AI
Hannaneh Hajishirzi
University of Washington / Allen AI
AI Suggested Papers
Papers that may have inspired your projects (auto-detected by domain & keyword analysis)
For TRL (Transformer Reinforcement Learning):
Language Models are Few-Shot Learners
NeurIPS 2020 · 2020 · 18,500 citations
Llama 2: Open Foundation and Fine-Tuned Chat Models
arXiv preprint · 2023 · 7,200 citations
Language Models are Few-Shot Learners
NeurIPS 2020 · 2020 · 3,027 citations