Generative AI

51 researchers · 27 papers · 77 projects · 77 builders

Researchers (51)

Kaiming He

MIT / Google DeepMind

750,000 citations · h-index 67

Yoshua Bengio

Mila / Université de Montréal

447,206 citations · h-index 183

Geoffrey Hinton

University of Toronto

444,501 citations · h-index 137

Oriol Vinyals

Google DeepMind

415,165 citations · h-index 95

Jeff Dean

Google DeepMind

400,701 citations · h-index 124

Alec Radford

OpenAI

346,122 citations · h-index 24

Michael I. Jordan

UC Berkeley

310,000 citations · h-index 155

Jürgen Schmidhuber

KAUST / IDSIA

302,473 citations · h-index 105

David Silver

Ineffable Intelligence / UCL

294,572 citations · h-index 102

Andrew Ng

Stanford University / Coursera

283,000 citations · h-index 130

Alex Krizhevsky

Independent

264,618 citations · h-index 14

Aidan Gomez

Cohere

260,248 citations · h-index 15

Yann LeCun

Meta AI / NYU

245,920 citations · h-index 118

Ilya Sutskever

Safe Superintelligence Inc.

220,945 citations · h-index 64

Diederik P. Kingma

Google DeepMind

185,000 citations · h-index 30

Demis Hassabis

Google DeepMind

180,509 citations · h-index 89

Karen Simonyan

DeepMind

175,000 citations · h-index 36

Tomas Mikolov

Czech Institute of Informatics (CIIRC)

165,000 citations · h-index 42

Kyunghyun Cho

New York University / Genentech

140,000 citations · h-index 82

Aaron Courville

Mila / Université de Montréal

135,000 citations · h-index 72

Jacob Devlin

Google

127,030 citations · h-index 23

Chelsea Finn

Stanford University

117,781 citations · h-index 82

Samy Bengio

Apple AI/ML

112,069 citations · h-index 72

Ruslan Salakhutdinov

Carnegie Mellon University

97,000 citations · h-index 85

Phillip Isola

MIT CSAIL

95,000 citations · h-index 52

Rob Fergus

Meta AI / NYU

91,000 citations · h-index 80

Max Welling

University of Amsterdam / Microsoft Research

86,000 citations · h-index 88

Ian Goodfellow

Google DeepMind

82,619 citations · h-index 64

Hugo Larochelle

Google DeepMind / Mila

72,000 citations · h-index 68

Dario Amodei

Anthropic

68,000 citations · h-index 42

Daniela Rus

MIT CSAIL

65,000 citations · h-index 108

Saining Xie

New York University

62,000 citations · h-index 38

Luke Zettlemoyer

University of Washington / Meta AI

62,000 citations · h-index 78

Andrej Karpathy

Independent

58,419 citations · h-index 20

Tommi Jaakkola

MIT CSAIL

58,000 citations · h-index 92

Daphne Koller

insitro / Stanford

57,022 citations · h-index 118

Tom Brown

OpenAI

52,000 citations · h-index 25

Nando de Freitas

Google DeepMind

48,000 citations · h-index 72

Anima Anandkumar

Caltech / NVIDIA

42,000 citations · h-index 68

Stefano Ermon

Stanford University

42,000 citations · h-index 58

Shakir Mohamed

Google DeepMind

34,000 citations · h-index 52

Thomas Kipf

Google DeepMind

32,000 citations · h-index 22

Hannaneh Hajishirzi

University of Washington / Allen AI

32,000 citations · h-index 55

Noam Shazeer

Google

28,748 citations · h-index 34

Petar Velickovic

Google DeepMind

28,000 citations · h-index 30

David Sontag

MIT CSAIL

28,000 citations · h-index 55

Sanja Fidler

University of Toronto / NVIDIA

28,000 citations · h-index 58

Ryan Adams

Princeton University

28,000 citations · h-index 52

Alexander Rush

Cornell University

22,000 citations · h-index 45

Hao Li

Pinscreen / UC Berkeley

12,000 citations · h-index 38

Ashish Vaswani

Essential AI

9,969 citations · h-index 28

Papers (27)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

NAACL 20192019108,124 citations

Long Short-Term Memory

Neural Computation199795,000 citations

Highly accurate protein structure prediction with AlphaFold

Nature202142,952 citations

Auto-Encoding Variational Bayes

ICLR 2014201422,000 citations

Generative Adversarial Nets

NeurIPS 2014201421,735 citations

Language Models are Few-Shot Learners

NeurIPS 2020202018,500 citations

Image-to-Image Translation with Conditional Adversarial Networks

CVPR 2017201718,000 citations

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

ICML 2017201716,000 citations

High-Resolution Image Synthesis with Latent Diffusion Models

CVPR 2022202214,000 citations

Denoising Diffusion Probabilistic Models

NeurIPS 2020202012,000 citations

GPT-4 Technical Report

arXiv preprint20238,500 citations

Llama 2: Open Foundation and Fine-Tuned Chat Models

arXiv preprint20237,200 citations

Attention Is All You Need

NeurIPS 201720176,510 citations

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

ICLR 201920196,500 citations

Hierarchical Text-Conditional Image Generation with CLIP Latents

arXiv preprint20225,200 citations

Variational Graph Auto-Encoders

NeurIPS Workshop 201620163,100 citations

Language Models are Few-Shot Learners

NeurIPS 202020203,027 citations

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction

ACL 201920191,500 citations

Data-Centric Artificial Intelligence: A Survey

Communications of the ACM20231,200 citations

CPM: A Large-scale Generative Chinese Pre-trained Language Model

AI Open20211,200 citations

Projects (77)

Ideogram

AI image generation platform known for reliable text rendering in images. Ideogram 2.0 sets the standard for typography accuracy in generated visuals.

by Mohammad Norouzi

Build a Large Language Model From Scratch

Hands-on book and code repository teaching LLM internals from the ground up. Covers pretraining, fine-tuning, and RLHF with clear PyTorch implementations.

by Sebastian Raschka

Pinecone AI Education & Canopy

Developer education content and open-source RAG framework for Pinecone. Tutorials on vector search, embeddings, and building production RAG systems.

by James Briggs

sentence-transformers

The most popular library for computing dense vector representations of text. Powers semantic search, clustering, and similarity across millions of applications.

by Nils Reimers

TRL (Transformer Reinforcement Learning)

Hugging Face library for training language models with RLHF, DPO, and PPO. The standard open-source toolkit for alignment fine-tuning.

by Lewis Tunstall

Hugging Face Inference Endpoints

Production deployment platform for transformer models. One-click deployment of any Hugging Face model with auto-scaling and optimized inference.

by Philipp Schmid

Diffusers

Hugging Face library for state-of-the-art diffusion models. Provides pretrained pipelines for image, audio, and 3D generation with simple APIs.

by Sayak Paul

GPT-NeoX / Pythia

EleutherAI's open-source large language model suite. Pythia provides 16 model checkpoints (70M-12B) for scientific study of LLM training dynamics.

by Stella Biderman

Together AI Platform

Cloud platform for training, fine-tuning, and running open-source AI models. Offers the fastest inference for Llama, Mixtral, and other open models.

by Vinicius Moens

Mistral AI Models

Open-weight frontier models from Europe. Mistral 7B, Mixtral 8x7B, and Mistral Large compete with models 10x their size through architecture innovations.

by Arthur Mensch

Sakana AI — Evolutionary Model Merging

Nature-inspired AI research lab using evolutionary algorithms to automatically merge and create new foundation models without training from scratch.

by David Ha

Imbue AI Agents

AI systems that reason about code and build software autonomously. Imbue trains models optimized for practical reasoning and agent capabilities.

by Kanjun Qiu

Adept ACT-1

AI model trained to use software tools like a human. ACT-1 can navigate web browsers, use spreadsheets, and operate enterprise software via natural language.

by David Luan

Runway Gen-3

AI-powered creative suite for video generation and editing. Gen-3 Alpha produces photorealistic video from text and image prompts with fine temporal control.

by Cristóbal Valenzuela

Midjourney

The world's most popular AI image generation platform. Known for stunning artistic quality and used by millions of creators, designers, and artists.

by David Holz

Jasper AI

AI content generation platform for enterprise marketing teams. Used by 100K+ businesses to create on-brand copy, blog posts, and campaigns at scale.

by Dave Rogenmoser

Character.AI

Platform for creating and chatting with AI characters. Over 20M monthly users interact with millions of user-created AI personalities and assistants.

by Daniel De Freitas

Inflection Pi / Microsoft AI

Personal AI assistant designed for emotional intelligence and natural conversation. Pi pioneered empathetic AI before Mustafa moved to lead Microsoft AI.

by Mustafa Suleyman

Perplexity AI

AI-native answer engine with real-time web search and citations. Redefining search by providing direct, sourced answers instead of link lists.

by Aravind Srinivas

Pika

AI video generation platform that creates and edits cinematic videos from text, images, and existing clips. Known for intuitive creative controls.

by Demi Guo

ElevenLabs Voice AI

Most realistic AI voice synthesis platform. Offers voice cloning, text-to-speech in 29 languages, and AI dubbing used by creators and enterprises worldwide.

by Mati Staniszewski

Synthesia EXPRESSIVE-1

AI video generation platform creating realistic presenter-led videos from text. Used by 50K+ companies for training, marketing, and internal communications.

by Victor Riparbelli

Photoroom

AI photo editing app that instantly removes backgrounds, generates product images, and creates studio-quality visuals. Used by 150M+ people globally.

by Matthieu Rouif

Snorkel AI

Data-centric AI platform. Write labeling functions instead of hand-labeling data. Used by Google, Apple, Intel, and government agencies.

by Alex Ratner

Weights & Biases

ML experiment tracking, model versioning, and dataset management. The MLOps platform used by OpenAI, DeepMind, and 500K+ ML practitioners.

by Lukas Biewald

DataRobot

Enterprise AutoML platform. Automates model building, deployment, and monitoring. Serves Fortune 500 companies from Boston headquarters.

by Jeremy Fiance

Cohere

Enterprise LLM platform built by a Transformer paper co-author. RAG, embeddings, and language understanding for enterprise search and generation.

by Aidan Gomez

Allen AI (AI2)

Nonprofit AI research lab founded by Paul Allen. Builds open-source AI tools: Semantic Scholar, OLMo, AI2 THOR. Mission: AI for the common good.

by Ali Farhadi

Semantic Scholar

AI-powered academic search engine by Allen AI. Indexes 200M+ papers with AI-generated TLDRs, citation contexts, and research recommendations.

by Michael Schmitz

Lexion AI

AI contract management for enterprises. Uses NLP to extract key terms, track obligations, and automate legal document workflows.

by Ammad Ahmad

OLMo

Fully open language model from Allen AI. Open training data (Dolma), open code, open weights, open evals. Making LLM science reproducible.

by Dirk Groeneveld

Megatron-LM

NVIDIA's framework for training large transformer language models efficiently across thousands of GPUs using model and data parallelism.

by Bryan Catanzaro

PyTorch

The most widely used open-source deep learning framework. Provides dynamic computation graphs, GPU acceleration, and a rich ecosystem for research and production.

by Soumith Chintala

Hugging Face Transformers

The definitive open-source library for state-of-the-art NLP, vision, and multimodal models. Provides 200K+ pretrained models and unified APIs.

by Thomas Wolf

Stable Diffusion / FLUX

Latent diffusion models for high-resolution image synthesis. Stable Diffusion democratized AI art; FLUX is the next-generation architecture from Black Forest Labs.

by Robin Rombach

RAG (Retrieval-Augmented Generation)

The original RAG framework combining retrieval and generation for knowledge-intensive NLP. Now a foundational pattern used across the LLM ecosystem.

by Patrick Lewis

LoRA

Low-Rank Adaptation of Large Language Models. The most widely adopted parameter-efficient fine-tuning method, reducing trainable parameters by 10,000x.

by Edward Hu

bitsandbytes / QLoRA

Efficient 4-bit quantization library enabling fine-tuning of 65B parameter models on a single 48GB GPU. QLoRA made LLM fine-tuning accessible to everyone.

by Tim Dettmers

FlashAttention

IO-aware exact attention algorithm that is 2-4x faster and uses 5-20x less memory. Now integrated into PyTorch and used by virtually all LLM training runs.

by Tri Dao

Mamba

Selective state space model architecture offering linear-time sequence modeling. Achieves transformer-quality performance with 5x faster inference throughput.

by Albert Gu

Craiyon (DALL-E Mini)

Open-source text-to-image model that went viral in 2022. Demonstrated that AI image generation could be accessible and fun for millions of users.

by Boris Dayma

Hugging Face Spaces & Demos

Interactive ML art demos and creative AI tools on Hugging Face Spaces. Bridging the gap between generative AI research and creative communities.

by Apolinário Passos

Qwen

Alibaba's family of large language and multimodal models. Qwen2 series achieves top performance among open-weight models across benchmarks.

by Junyang Lin

Reka AI Models

Multimodal AI models from Reka AI. Reka Core, Flash, and Edge deliver frontier performance across text, image, video, and audio understanding.

by Yi Tay

PaLM Training Infrastructure

Scaling infrastructure behind Google's PaLM model. Achieved efficient training of 540B parameter models across 6144 TPU v4 chips.

by Reiner Pope

Open Assistant

Community-driven open-source project to create a free, high-quality chat assistant dataset and model. One of the earliest open RLHF efforts.

by Christoph Schuhmann

Circuits / Mechanistic Interpretability

Pioneering research on understanding neural networks by reverse-engineering their internal mechanisms. Published the influential Circuits thread on Distill.

by Chris Olah

Lil'Log

Comprehensive technical blog covering LLMs, diffusion, RL, and more. The most widely cited personal ML blog, serving as a reference for researchers worldwide.

by Lilian Weng

Contextual AI RAG 2.0

Enterprise RAG platform that goes beyond naive retrieval. Contextual AI builds RAG-native language models trained end-to-end for grounded generation.

by Douwe Kiela

DSPy

Programming framework for optimizing LLM pipelines. Replaces hand-written prompts with composable, learnable modules that auto-optimize via compilation.

by Omar Khattab

ALiBi Positional Encoding

Attention with Linear Biases enables transformers to generalize to longer sequences than seen during training without learned position embeddings.

by Ofir Press

Machine Learning Mastery

The largest practitioner-focused ML education platform with 1000+ tutorials and 20+ books covering deep learning, NLP, computer vision, and time series.

by Jason Brownlee

OpenAI API Platform

Built and scaled OpenAI's API platform serving GPT-4, DALL-E, and Whisper to millions of developers. Architected the product powering ChatGPT.

by Peter Welinder

OpenAI Engineering

Co-founded OpenAI and led engineering from GPT-1 through ChatGPT and GPT-4. Built the infrastructure and team behind the most impactful AI products.

by Greg Brockman

Thinking Machines Lab

AI research startup founded after departing OpenAI as CTO. Led the technical development of ChatGPT, GPT-4, and DALL-E at OpenAI.

by Mira Murati

Scale AI Data Platform

Enterprise data labeling and AI infrastructure platform. Powers training data for OpenAI, Meta, and US DoD with human-in-the-loop annotation at scale.

by Alexandr Wang

Hugging Face Transformers

The most popular open-source library for state-of-the-art NLP, vision, and audio models. Hosts 500K+ models and 100K+ datasets. The GitHub of machine learning.

by Clem Delangue

Pioneer Fund

AI-first venture fund and talent accelerator co-founded with Nat Friedman. Finds and funds the most ambitious AI builders globally using AI-powered scouting.

by Daniel Gross

AI Grants & Investments

Portfolio of AI investments and open-source contributions. Co-leads Pioneer Fund. Previously acquired GitHub for Microsoft and shipped Copilot.

by Nat Friedman

OpenAI Cookbook & DevRel

Built OpenAI's developer ecosystem from scratch. Created the OpenAI Cookbook with 200+ examples, grew the API community to millions of developers.

by Logan Kilpatrick

Latent Space & smol.ai

Latent Space is the leading AI engineering podcast and newsletter. smol.ai builds small, useful AI developer tools. Defined the AI Engineer role.

by Swyx (Shawn Wang)

fast.ai

Free deep learning courses and the fastai library. Made cutting-edge deep learning accessible to anyone who can code. Used by hundreds of thousands of students.

by Jeremy Howard

ML Paper Explanations (YouTube)

YouTube channel with 250K+ subscribers explaining cutting-edge ML papers. Co-created OpenAssistant, an open-source ChatGPT alternative.

by Yannic Kilcher

Replit

AI-powered coding platform used by 30M+ developers. Features Replit Agent for building full-stack apps from natural language prompts in the browser.

by Amjad Masad

Stable Diffusion

Open-source text-to-image diffusion model that democratized AI image generation. Runs locally on consumer GPUs and powers thousands of apps.

by Emad Mostaque

Chain-of-Thought Research

Pioneering research on chain-of-thought prompting, instruction tuning, and emergent abilities of large language models at OpenAI.

by Jason Wei

tinygrad

A tiny neural network framework. Under 1000 lines of code, supports GPU acceleration. The antithesis of PyTorch complexity.

by George Hotz

v0 by Vercel

AI-powered UI generation. Describe a component in natural language, get production-ready React/Tailwind code. Uses LLMs for code generation.

by Guillermo Rauch

llm

CLI tool for interacting with Large Language Models. Supports OpenAI, Claude, local models via plugins. Log and search prompts.

by Simon Willison

DocETL

AI-powered data processing pipelines for unstructured documents. LLM-driven extraction, transformation, and analysis at scale.

by Shreya Shankar

Suno AI

AI music generation from text. Type a description, get a full song with vocals, instruments, and production. Used by millions.

by Martin Camacho

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs. Reproduces GPT-2 (124M) in ~$20 of compute.

by Andrej Karpathy

MosaicML / DBRX

Open-source efficient LLM training platform. DBRX is a 132B MoE model rivaling GPT-3.5. Acquired by Databricks for $1.3B.

by Jonathan Frankle

LangChain

Framework for developing applications powered by language models. Connects LLMs to external data, tools, and agents.

by Harrison Chase

MIT 6.S191 — Introduction to Deep Learning

MIT's official introductory course on deep learning. Open courseware with labs covering CNNs, RNNs, GANs, reinforcement learning, and more.

by Alexander Amini

Liquid Neural Networks

Open-source implementation of Liquid Time-Constant networks — continuous-time neural networks inspired by biological neurons.

by Mathias Lechner

Liquid Foundation Models

Compact, efficient foundation models built on Liquid Neural Networks. State-space models that outperform transformers at smaller sizes.

by Ramin Hasani