Generative AI
51 researchers · 27 papers · 77 projects · 77 builders
Researchers (51)
Kaiming He
MIT / Google DeepMind
750,000 citations · h-index 67
Yoshua Bengio
Mila / Université de Montréal
447,206 citations · h-index 183
Geoffrey Hinton
University of Toronto
444,501 citations · h-index 137
Oriol Vinyals
Google DeepMind
415,165 citations · h-index 95
Jeff Dean
Google DeepMind
400,701 citations · h-index 124
Alec Radford
OpenAI
346,122 citations · h-index 24
Michael I. Jordan
UC Berkeley
310,000 citations · h-index 155
Jürgen Schmidhuber
KAUST / IDSIA
302,473 citations · h-index 105
David Silver
Ineffable Intelligence / UCL
294,572 citations · h-index 102
Andrew Ng
Stanford University / Coursera
283,000 citations · h-index 130
Alex Krizhevsky
Independent
264,618 citations · h-index 14
Aidan Gomez
Cohere
260,248 citations · h-index 15
Yann LeCun
Meta AI / NYU
245,920 citations · h-index 118
Ilya Sutskever
Safe Superintelligence Inc.
220,945 citations · h-index 64
Diederik P. Kingma
Google DeepMind
185,000 citations · h-index 30
Demis Hassabis
Google DeepMind
180,509 citations · h-index 89
Karen Simonyan
DeepMind
175,000 citations · h-index 36
Tomas Mikolov
Czech Institute of Informatics (CIIRC)
165,000 citations · h-index 42
Kyunghyun Cho
New York University / Genentech
140,000 citations · h-index 82
Aaron Courville
Mila / Université de Montréal
135,000 citations · h-index 72
Jacob Devlin
127,030 citations · h-index 23
Chelsea Finn
Stanford University
117,781 citations · h-index 82
Samy Bengio
Apple AI/ML
112,069 citations · h-index 72
Ruslan Salakhutdinov
Carnegie Mellon University
97,000 citations · h-index 85
Phillip Isola
MIT CSAIL
95,000 citations · h-index 52
Rob Fergus
Meta AI / NYU
91,000 citations · h-index 80
Max Welling
University of Amsterdam / Microsoft Research
86,000 citations · h-index 88
Ian Goodfellow
Google DeepMind
82,619 citations · h-index 64
Hugo Larochelle
Google DeepMind / Mila
72,000 citations · h-index 68
Dario Amodei
Anthropic
68,000 citations · h-index 42
Daniela Rus
MIT CSAIL
65,000 citations · h-index 108
Saining Xie
New York University
62,000 citations · h-index 38
Luke Zettlemoyer
University of Washington / Meta AI
62,000 citations · h-index 78
Andrej Karpathy
Independent
58,419 citations · h-index 20
Tommi Jaakkola
MIT CSAIL
58,000 citations · h-index 92
Daphne Koller
insitro / Stanford
57,022 citations · h-index 118
Tom Brown
OpenAI
52,000 citations · h-index 25
Nando de Freitas
Google DeepMind
48,000 citations · h-index 72
Anima Anandkumar
Caltech / NVIDIA
42,000 citations · h-index 68
Stefano Ermon
Stanford University
42,000 citations · h-index 58
Shakir Mohamed
Google DeepMind
34,000 citations · h-index 52
Thomas Kipf
Google DeepMind
32,000 citations · h-index 22
Hannaneh Hajishirzi
University of Washington / Allen AI
32,000 citations · h-index 55
Noam Shazeer
28,748 citations · h-index 34
Petar Velickovic
Google DeepMind
28,000 citations · h-index 30
David Sontag
MIT CSAIL
28,000 citations · h-index 55
Sanja Fidler
University of Toronto / NVIDIA
28,000 citations · h-index 58
Ryan Adams
Princeton University
28,000 citations · h-index 52
Alexander Rush
Cornell University
22,000 citations · h-index 45
Hao Li
Pinscreen / UC Berkeley
12,000 citations · h-index 38
Ashish Vaswani
Essential AI
9,969 citations · h-index 28
Papers (27)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Long Short-Term Memory
Highly accurate protein structure prediction with AlphaFold
Auto-Encoding Variational Bayes
Generative Adversarial Nets
Language Models are Few-Shot Learners
Image-to-Image Translation with Conditional Adversarial Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
High-Resolution Image Synthesis with Latent Diffusion Models
Denoising Diffusion Probabilistic Models
GPT-4 Technical Report
Llama 2: Open Foundation and Fine-Tuned Chat Models
Attention Is All You Need
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Hierarchical Text-Conditional Image Generation with CLIP Latents
Variational Graph Auto-Encoders
Language Models are Few-Shot Learners
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
Data-Centric Artificial Intelligence: A Survey
CPM: A Large-scale Generative Chinese Pre-trained Language Model
Projects (77)
Ideogram
AI image generation platform known for reliable text rendering in images. Ideogram 2.0 sets the standard for typography accuracy in generated visuals.
by Mohammad Norouzi
Build a Large Language Model From Scratch
Hands-on book and code repository teaching LLM internals from the ground up. Covers pretraining, fine-tuning, and RLHF with clear PyTorch implementations.
by Sebastian Raschka
Pinecone AI Education & Canopy
Developer education content and open-source RAG framework for Pinecone. Tutorials on vector search, embeddings, and building production RAG systems.
by James Briggs
sentence-transformers
The most popular library for computing dense vector representations of text. Powers semantic search, clustering, and similarity across millions of applications.
by Nils Reimers
TRL (Transformer Reinforcement Learning)
Hugging Face library for training language models with RLHF, DPO, and PPO. The standard open-source toolkit for alignment fine-tuning.
by Lewis Tunstall
Hugging Face Inference Endpoints
Production deployment platform for transformer models. One-click deployment of any Hugging Face model with auto-scaling and optimized inference.
by Philipp Schmid
Diffusers
Hugging Face library for state-of-the-art diffusion models. Provides pretrained pipelines for image, audio, and 3D generation with simple APIs.
by Sayak Paul
GPT-NeoX / Pythia
EleutherAI's open-source large language model suite. Pythia provides 16 model checkpoints (70M-12B) for scientific study of LLM training dynamics.
by Stella Biderman
Together AI Platform
Cloud platform for training, fine-tuning, and running open-source AI models. Offers the fastest inference for Llama, Mixtral, and other open models.
by Vinicius Moens
Mistral AI Models
Open-weight frontier models from Europe. Mistral 7B, Mixtral 8x7B, and Mistral Large compete with models 10x their size through architecture innovations.
by Arthur Mensch
Sakana AI — Evolutionary Model Merging
Nature-inspired AI research lab using evolutionary algorithms to automatically merge and create new foundation models without training from scratch.
by David Ha
Imbue AI Agents
AI systems that reason about code and build software autonomously. Imbue trains models optimized for practical reasoning and agent capabilities.
by Kanjun Qiu
Adept ACT-1
AI model trained to use software tools like a human. ACT-1 can navigate web browsers, use spreadsheets, and operate enterprise software via natural language.
by David Luan
Runway Gen-3
AI-powered creative suite for video generation and editing. Gen-3 Alpha produces photorealistic video from text and image prompts with fine temporal control.
by Cristóbal Valenzuela
Midjourney
The world's most popular AI image generation platform. Known for stunning artistic quality and used by millions of creators, designers, and artists.
by David Holz
Jasper AI
AI content generation platform for enterprise marketing teams. Used by 100K+ businesses to create on-brand copy, blog posts, and campaigns at scale.
by Dave Rogenmoser
Character.AI
Platform for creating and chatting with AI characters. Over 20M monthly users interact with millions of user-created AI personalities and assistants.
by Daniel De Freitas
Inflection Pi / Microsoft AI
Personal AI assistant designed for emotional intelligence and natural conversation. Pi pioneered empathetic AI before Mustafa moved to lead Microsoft AI.
by Mustafa Suleyman
Perplexity AI
AI-native answer engine with real-time web search and citations. Redefining search by providing direct, sourced answers instead of link lists.
by Aravind Srinivas
Pika
AI video generation platform that creates and edits cinematic videos from text, images, and existing clips. Known for intuitive creative controls.
by Demi Guo
ElevenLabs Voice AI
Most realistic AI voice synthesis platform. Offers voice cloning, text-to-speech in 29 languages, and AI dubbing used by creators and enterprises worldwide.
by Mati Staniszewski
Synthesia EXPRESSIVE-1
AI video generation platform creating realistic presenter-led videos from text. Used by 50K+ companies for training, marketing, and internal communications.
by Victor Riparbelli
Photoroom
AI photo editing app that instantly removes backgrounds, generates product images, and creates studio-quality visuals. Used by 150M+ people globally.
by Matthieu Rouif
Snorkel AI
Data-centric AI platform. Write labeling functions instead of hand-labeling data. Used by Google, Apple, Intel, and government agencies.
by Alex Ratner
Weights & Biases
ML experiment tracking, model versioning, and dataset management. The MLOps platform used by OpenAI, DeepMind, and 500K+ ML practitioners.
by Lukas Biewald
DataRobot
Enterprise AutoML platform. Automates model building, deployment, and monitoring. Serves Fortune 500 companies from Boston headquarters.
by Jeremy Fiance
Cohere
Enterprise LLM platform built by a Transformer paper co-author. RAG, embeddings, and language understanding for enterprise search and generation.
by Aidan Gomez
Allen AI (AI2)
Nonprofit AI research lab founded by Paul Allen. Builds open-source AI tools: Semantic Scholar, OLMo, AI2 THOR. Mission: AI for the common good.
by Ali Farhadi
Semantic Scholar
AI-powered academic search engine by Allen AI. Indexes 200M+ papers with AI-generated TLDRs, citation contexts, and research recommendations.
by Michael Schmitz
Lexion AI
AI contract management for enterprises. Uses NLP to extract key terms, track obligations, and automate legal document workflows.
by Ammad Ahmad
OLMo
Fully open language model from Allen AI. Open training data (Dolma), open code, open weights, open evals. Making LLM science reproducible.
by Dirk Groeneveld
Megatron-LM
NVIDIA's framework for training large transformer language models efficiently across thousands of GPUs using model and data parallelism.
by Bryan Catanzaro
PyTorch
The most widely used open-source deep learning framework. Provides dynamic computation graphs, GPU acceleration, and a rich ecosystem for research and production.
by Soumith Chintala
Hugging Face Transformers
The definitive open-source library for state-of-the-art NLP, vision, and multimodal models. Provides 200K+ pretrained models and unified APIs.
by Thomas Wolf
Stable Diffusion / FLUX
Latent diffusion models for high-resolution image synthesis. Stable Diffusion democratized AI art; FLUX is the next-generation architecture from Black Forest Labs.
by Robin Rombach
RAG (Retrieval-Augmented Generation)
The original RAG framework combining retrieval and generation for knowledge-intensive NLP. Now a foundational pattern used across the LLM ecosystem.
by Patrick Lewis
LoRA
Low-Rank Adaptation of Large Language Models. The most widely adopted parameter-efficient fine-tuning method, reducing trainable parameters by 10,000x.
by Edward Hu
bitsandbytes / QLoRA
Efficient 4-bit quantization library enabling fine-tuning of 65B parameter models on a single 48GB GPU. QLoRA made LLM fine-tuning accessible to everyone.
by Tim Dettmers
FlashAttention
IO-aware exact attention algorithm that is 2-4x faster and uses 5-20x less memory. Now integrated into PyTorch and used by virtually all LLM training runs.
by Tri Dao
Mamba
Selective state space model architecture offering linear-time sequence modeling. Achieves transformer-quality performance with 5x faster inference throughput.
by Albert Gu
Craiyon (DALL-E Mini)
Open-source text-to-image model that went viral in 2022. Demonstrated that AI image generation could be accessible and fun for millions of users.
by Boris Dayma
Hugging Face Spaces & Demos
Interactive ML art demos and creative AI tools on Hugging Face Spaces. Bridging the gap between generative AI research and creative communities.
by Apolinário Passos
Qwen
Alibaba's family of large language and multimodal models. Qwen2 series achieves top performance among open-weight models across benchmarks.
by Junyang Lin
Reka AI Models
Multimodal AI models from Reka AI. Reka Core, Flash, and Edge deliver frontier performance across text, image, video, and audio understanding.
by Yi Tay
PaLM Training Infrastructure
Scaling infrastructure behind Google's PaLM model. Achieved efficient training of 540B parameter models across 6144 TPU v4 chips.
by Reiner Pope
Open Assistant
Community-driven open-source project to create a free, high-quality chat assistant dataset and model. One of the earliest open RLHF efforts.
by Christoph Schuhmann
Circuits / Mechanistic Interpretability
Pioneering research on understanding neural networks by reverse-engineering their internal mechanisms. Published the influential Circuits thread on Distill.
by Chris Olah
Lil'Log
Comprehensive technical blog covering LLMs, diffusion, RL, and more. The most widely cited personal ML blog, serving as a reference for researchers worldwide.
by Lilian Weng
Contextual AI RAG 2.0
Enterprise RAG platform that goes beyond naive retrieval. Contextual AI builds RAG-native language models trained end-to-end for grounded generation.
by Douwe Kiela
DSPy
Programming framework for optimizing LLM pipelines. Replaces hand-written prompts with composable, learnable modules that auto-optimize via compilation.
by Omar Khattab
ALiBi Positional Encoding
Attention with Linear Biases enables transformers to generalize to longer sequences than seen during training without learned position embeddings.
by Ofir Press
Machine Learning Mastery
The largest practitioner-focused ML education platform with 1000+ tutorials and 20+ books covering deep learning, NLP, computer vision, and time series.
by Jason Brownlee
OpenAI API Platform
Built and scaled OpenAI's API platform serving GPT-4, DALL-E, and Whisper to millions of developers. Architected the product powering ChatGPT.
by Peter Welinder
OpenAI Engineering
Co-founded OpenAI and led engineering from GPT-1 through ChatGPT and GPT-4. Built the infrastructure and team behind the most impactful AI products.
by Greg Brockman
Thinking Machines Lab
AI research startup founded after departing OpenAI as CTO. Led the technical development of ChatGPT, GPT-4, and DALL-E at OpenAI.
by Mira Murati
Scale AI Data Platform
Enterprise data labeling and AI infrastructure platform. Powers training data for OpenAI, Meta, and US DoD with human-in-the-loop annotation at scale.
by Alexandr Wang
Hugging Face Transformers
The most popular open-source library for state-of-the-art NLP, vision, and audio models. Hosts 500K+ models and 100K+ datasets. The GitHub of machine learning.
by Clem Delangue
Pioneer Fund
AI-first venture fund and talent accelerator co-founded with Nat Friedman. Finds and funds the most ambitious AI builders globally using AI-powered scouting.
by Daniel Gross
AI Grants & Investments
Portfolio of AI investments and open-source contributions. Co-leads Pioneer Fund. Previously acquired GitHub for Microsoft and shipped Copilot.
by Nat Friedman
OpenAI Cookbook & DevRel
Built OpenAI's developer ecosystem from scratch. Created the OpenAI Cookbook with 200+ examples, grew the API community to millions of developers.
by Logan Kilpatrick
Latent Space & smol.ai
Latent Space is the leading AI engineering podcast and newsletter. smol.ai builds small, useful AI developer tools. Defined the AI Engineer role.
by Swyx (Shawn Wang)
fast.ai
Free deep learning courses and the fastai library. Made cutting-edge deep learning accessible to anyone who can code. Used by hundreds of thousands of students.
by Jeremy Howard
ML Paper Explanations (YouTube)
YouTube channel with 250K+ subscribers explaining cutting-edge ML papers. Co-created OpenAssistant, an open-source ChatGPT alternative.
by Yannic Kilcher
Replit
AI-powered coding platform used by 30M+ developers. Features Replit Agent for building full-stack apps from natural language prompts in the browser.
by Amjad Masad
Stable Diffusion
Open-source text-to-image diffusion model that democratized AI image generation. Runs locally on consumer GPUs and powers thousands of apps.
by Emad Mostaque
Chain-of-Thought Research
Pioneering research on chain-of-thought prompting, instruction tuning, and emergent abilities of large language models at OpenAI.
by Jason Wei
tinygrad
A tiny neural network framework. Under 1000 lines of code, supports GPU acceleration. The antithesis of PyTorch complexity.
by George Hotz
v0 by Vercel
AI-powered UI generation. Describe a component in natural language, get production-ready React/Tailwind code. Uses LLMs for code generation.
by Guillermo Rauch
llm
CLI tool for interacting with Large Language Models. Supports OpenAI, Claude, local models via plugins. Log and search prompts.
by Simon Willison
DocETL
AI-powered data processing pipelines for unstructured documents. LLM-driven extraction, transformation, and analysis at scale.
by Shreya Shankar
Suno AI
AI music generation from text. Type a description, get a full song with vocals, instruments, and production. Used by millions.
by Martin Camacho
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs. Reproduces GPT-2 (124M) in ~$20 of compute.
by Andrej Karpathy
MosaicML / DBRX
Open-source efficient LLM training platform. DBRX is a 132B MoE model rivaling GPT-3.5. Acquired by Databricks for $1.3B.
by Jonathan Frankle
LangChain
Framework for developing applications powered by language models. Connects LLMs to external data, tools, and agents.
by Harrison Chase
MIT 6.S191 — Introduction to Deep Learning
MIT's official introductory course on deep learning. Open courseware with labs covering CNNs, RNNs, GANs, reinforcement learning, and more.
by Alexander Amini
Liquid Neural Networks
Open-source implementation of Liquid Time-Constant networks — continuous-time neural networks inspired by biological neurons.
by Mathias Lechner
Liquid Foundation Models
Compact, efficient foundation models built on Liquid Neural Networks. State-space models that outperform transformers at smaller sizes.
by Ramin Hasani