Andrej Karpathy
BuilderSan Francisco
@karpathy
Building Eureka Labs — AI-native education. Created nanoGPT and built neural networks from scratch on YouTube. Previously Director of AI at Tesla, founding member of OpenAI.
Skills
Looking For
Research Foundations (2)
Projects (1)
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs. Reproduces GPT-2 (124M) in ~$20 of compute.
Built on research:
AI Suggested Researchers
Researchers whose work may be relevant to your projects (auto-detected)
Kyunghyun Cho
New York University / Genentech
Aaron Courville
Mila / Université de Montréal
Luke Zettlemoyer
University of Washington / Meta AI
Hannaneh Hajishirzi
University of Washington / Allen AI
AI Suggested Papers
Papers that may have inspired your projects (auto-detected by domain & keyword analysis)
For nanoGPT:
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
NAACL 2019 · 2019 · 108,124 citations
Long Short-Term Memory
Neural Computation · 1997 · 95,000 citations
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
ACL 2019 · 2019 · 1,500 citations