Tri Dao
BuilderPrinceton
@tridao
Assistant Professor at Princeton and co-founder of Together AI. Creator of FlashAttention — the IO-aware exact attention algorithm used everywhere.
Skills
Looking For
Research Foundations (2)
Projects (1)
FlashAttention
IO-aware exact attention algorithm that is 2-4x faster and uses 5-20x less memory. Now integrated into PyTorch and used by virtually all LLM training runs.
Built on research:
AI Suggested Researchers
Researchers whose work may be relevant to your projects (auto-detected)
Luke Zettlemoyer
University of Washington / Meta AI
Hannaneh Hajishirzi
University of Washington / Allen AI
Tomas Mikolov
Czech Institute of Informatics (CIIRC)
Jacob Devlin
AI Suggested Papers
Papers that may have inspired your projects (auto-detected by domain & keyword analysis)
For FlashAttention:
Long Short-Term Memory
Neural Computation · 1997 · 95,000 citations
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
NAACL 2019 · 2019 · 108,124 citations
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
ACL 2019 · 2019 · 1,500 citations