Albert Gu
BuilderPittsburgh
@albertgu
Assistant Professor at CMU and co-founder of Cartesia. Creator of Mamba and structured state space models (S4) for efficient sequence modeling.
Skills
Looking For
Research Foundations (2)
Projects (1)
Mamba
Selective state space model architecture offering linear-time sequence modeling. Achieves transformer-quality performance with 5x faster inference throughput.
Built on research:
AI Suggested Researchers
Researchers whose work may be relevant to your projects (auto-detected)
Kyunghyun Cho
New York University / Genentech
Aaron Courville
Mila / Université de Montréal
Luke Zettlemoyer
University of Washington / Meta AI
Hannaneh Hajishirzi
University of Washington / Allen AI
AI Suggested Papers
Papers that may have inspired your projects (auto-detected by domain & keyword analysis)
For Mamba:
Language Models are Few-Shot Learners
NeurIPS 2020 · 2020 · 3,027 citations
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
NAACL 2019 · 2019 · 108,124 citations
Long Short-Term Memory
Neural Computation · 1997 · 95,000 citations