Bryan Catanzaro
BuilderSanta Clara
@bcatanzaro
VP of Applied Deep Learning at NVIDIA. Led teams behind CUDA deep learning libraries, Megatron-LM, and GPU-accelerated AI infrastructure.
Skills
Looking For
Research Foundations (2)
Projects (1)
Megatron-LM
NVIDIA's framework for training large transformer language models efficiently across thousands of GPUs using model and data parallelism.
Built on research:
AI Suggested Researchers
Researchers whose work may be relevant to your projects (auto-detected)
Luke Zettlemoyer
University of Washington / Meta AI
Hannaneh Hajishirzi
University of Washington / Allen AI
Kyunghyun Cho
New York University / Genentech
Aaron Courville
Mila / Université de Montréal
AI Suggested Papers
Papers that may have inspired your projects (auto-detected by domain & keyword analysis)
For Megatron-LM:
OLMo: Accelerating the Science of Language Models
ACL 2024 · 2024 · 800 citations
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
NAACL 2019 · 2019 · 108,124 citations
Language Models are Few-Shot Learners
NeurIPS 2020 · 2020 · 3,027 citations