← Back to Directory
Tri Dao

Tri Dao

Builder

Princeton

@tridao

Assistant Professor at Princeton and co-founder of Together AI. Creator of FlashAttention — the IO-aware exact attention algorithm used everywhere.

Skills

FlashAttentionsystems MLCUDApythonefficient transformers

Looking For

collaboratorhiring

Projects (1)

FlashAttention

IO-aware exact attention algorithm that is 2-4x faster and uses 5-20x less memory. Now integrated into PyTorch and used by virtually all LLM training runs.