Computer Vision
46 researchers · 27 papers · 27 projects · 27 builders
Researchers (46)
Kaiming He
MIT / Google DeepMind
750,000 citations · h-index 67
Ross Girshick
Anthropic (via Vercept)
673,758 citations · h-index 62
Geoffrey Hinton
University of Toronto
444,501 citations · h-index 137
Alec Radford
OpenAI
346,122 citations · h-index 24
Andrew Zisserman
University of Oxford
310,000 citations · h-index 142
Alex Krizhevsky
Independent
264,618 citations · h-index 14
Yann LeCun
Meta AI / NYU
245,920 citations · h-index 118
Fei-Fei Li
Stanford University
216,976 citations · h-index 137
Jitendra Malik
UC Berkeley
196,000 citations · h-index 134
Trevor Darrell
UC Berkeley
175,000 citations · h-index 120
Karen Simonyan
DeepMind
175,000 citations · h-index 36
Jian Sun
Megvii / Face++
148,000 citations · h-index 75
Samy Bengio
Apple AI/ML
112,069 citations · h-index 72
Cordelia Schmid
INRIA / Google Research
98,000 citations · h-index 105
Phillip Isola
MIT CSAIL
95,000 citations · h-index 52
Xiaogang Wang
Chinese University of Hong Kong
95,000 citations · h-index 82
Rob Fergus
Meta AI / NYU
91,000 citations · h-index 80
Antonio Torralba
MIT CSAIL
85,000 citations · h-index 90
Liang-Chieh Chen
Google DeepMind
82,000 citations · h-index 45
Ming-Hsuan Yang
UC Merced / Google Research
78,000 citations · h-index 85
Hugo Larochelle
Google DeepMind / Mila
72,000 citations · h-index 68
Philip Torr
University of Oxford
68,000 citations · h-index 88
Bolei Zhou
UCLA
62,000 citations · h-index 55
Saining Xie
New York University
62,000 citations · h-index 38
Marc'Aurelio Ranzato
Meta AI
62,000 citations · h-index 60
Andrej Karpathy
Independent
58,419 citations · h-index 20
Song Han
MIT
58,000 citations · h-index 62
Ali Farhadi
University of Washington / Allen AI
58,000 citations · h-index 72
Wei Liu
Tencent AI Lab
55,000 citations · h-index 60
Aleksander Madry
MIT CSAIL
55,000 citations · h-index 65
Vladlen Koltun
Apple
55,000 citations · h-index 72
Raquel Urtasun
Waabi / University of Toronto
52,000 citations · h-index 78
Abhinav Gupta
Carnegie Mellon University
45,000 citations · h-index 72
Zhouchen Lin
Peking University
42,000 citations · h-index 68
Silvio Savarese
Salesforce AI Research
42,000 citations · h-index 72
Pushmeet Kohli
Google DeepMind
41,000 citations · h-index 72
Lior Wolf
Tel Aviv University
38,000 citations · h-index 65
Li Jia Li
Google Cloud AI
36,000 citations · h-index 55
Hengshuang Zhao
University of Hong Kong
35,000 citations · h-index 38
Tinne Tuytelaars
KU Leuven
32,000 citations · h-index 62
Ivan Laptev
INRIA Paris / MBZUAI
30,000 citations · h-index 55
Priya Goyal
Meta AI
28,000 citations · h-index 22
Sanja Fidler
University of Toronto / NVIDIA
28,000 citations · h-index 58
Jiajun Wu
Stanford University
16,000 citations · h-index 42
Timnit Gebru
DAIR Institute
15,864 citations · h-index 28
Hao Li
Pinscreen / UC Berkeley
12,000 citations · h-index 38
Papers (27)
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
ImageNet Classification with Deep Convolutional Neural Networks
Very Deep Convolutional Networks for Large-Scale Image Recognition
ImageNet: A Large-Scale Hierarchical Image Database
Gradient-based learning applied to document recognition
You Only Look Once: Unified, Real-Time Object Detection
SSD: Single Shot MultiBox Detector
Rich feature hierarchies for accurate object detection and semantic segmentation
Image-to-Image Translation with Conditional Adversarial Networks
A Simple Framework for Contrastive Learning of Visual Representations
High-Resolution Image Synthesis with Latent Diffusion Models
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Towards Deep Learning Models Resistant to Adversarial Attacks
Denoising Diffusion Probabilistic Models
Aggregated Residual Transformations for Deep Neural Networks
Pyramid Scene Parsing Network
Places: A 10 Million Image Database for Scene Recognition
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
Projects (27)
Ideogram
AI image generation platform known for reliable text rendering in images. Ideogram 2.0 sets the standard for typography accuracy in generated visuals.
by Mohammad Norouzi
Diffusers
Hugging Face library for state-of-the-art diffusion models. Provides pretrained pipelines for image, audio, and 3D generation with simple APIs.
by Sayak Paul
timm (PyTorch Image Models)
The definitive collection of state-of-the-art computer vision model implementations. 800+ pretrained models including EfficientNet, Vision Transformers, and ConvNeXt.
by Ross Wightman
Runway Gen-3
AI-powered creative suite for video generation and editing. Gen-3 Alpha produces photorealistic video from text and image prompts with fine temporal control.
by Cristóbal Valenzuela
Midjourney
The world's most popular AI image generation platform. Known for stunning artistic quality and used by millions of creators, designers, and artists.
by David Holz
Covariant Brain
Universal AI for robotic manipulation. The Covariant Brain enables robots to pick, place, and handle novel objects in real-world warehouse and logistics settings.
by Peter Chen
Pika
AI video generation platform that creates and edits cinematic videos from text, images, and existing clips. Known for intuitive creative controls.
by Demi Guo
Synthesia EXPRESSIVE-1
AI video generation platform creating realistic presenter-led videos from text. Used by 50K+ companies for training, marketing, and internal communications.
by Victor Riparbelli
Photoroom
AI photo editing app that instantly removes backgrounds, generates product images, and creates studio-quality visuals. Used by 150M+ people globally.
by Matthieu Rouif
Neurala Brain Builder
Edge AI platform for visual inspection and anomaly detection. Lifelong-learning neural networks that improve in production without retraining from scratch.
by Marc Pare
Lunit INSIGHT
AI-powered cancer detection in chest X-rays and mammography. FDA-cleared, CE-marked, deployed in 4000+ hospitals across 50+ countries.
by Hyun Kim
Allen AI (AI2)
Nonprofit AI research lab founded by Paul Allen. Builds open-source AI tools: Semantic Scholar, OLMo, AI2 THOR. Mission: AI for the common good.
by Ali Farhadi
Xnor.ai (acquired by Apple)
Ultra-efficient edge AI. Ran deep learning on $5 solar-powered chips using binary neural networks. Acquired by Apple for $200M in 2020.
by Rory Finnegan
Mighty AI (acquired by Uber)
Training data platform for autonomous vehicles. Computer vision annotation at scale for self-driving cars. Acquired by Uber ATG in 2019.
by Max Friedman
PyTorch
The most widely used open-source deep learning framework. Provides dynamic computation graphs, GPU acceleration, and a rich ecosystem for research and production.
by Soumith Chintala
Hugging Face Transformers
The definitive open-source library for state-of-the-art NLP, vision, and multimodal models. Provides 200K+ pretrained models and unified APIs.
by Thomas Wolf
Stable Diffusion / FLUX
Latent diffusion models for high-resolution image synthesis. Stable Diffusion democratized AI art; FLUX is the next-generation architecture from Black Forest Labs.
by Robin Rombach
Craiyon (DALL-E Mini)
Open-source text-to-image model that went viral in 2022. Demonstrated that AI image generation could be accessible and fun for millions of users.
by Boris Dayma
Hugging Face Spaces & Demos
Interactive ML art demos and creative AI tools on Hugging Face Spaces. Bridging the gap between generative AI research and creative communities.
by Apolinário Passos
LAION-5B Dataset
The largest openly available image-text dataset with 5.85 billion CLIP-filtered pairs. Enabled training of Stable Diffusion and open CLIP models.
by Ludwig Schmidt
Machine Learning Mastery
The largest practitioner-focused ML education platform with 1000+ tutorials and 20+ books covering deep learning, NLP, computer vision, and time series.
by Jason Brownlee
Scale AI Data Platform
Enterprise data labeling and AI infrastructure platform. Powers training data for OpenAI, Meta, and US DoD with human-in-the-loop annotation at scale.
by Alexandr Wang
Hugging Face Transformers
The most popular open-source library for state-of-the-art NLP, vision, and audio models. Hosts 500K+ models and 100K+ datasets. The GitHub of machine learning.
by Clem Delangue
fast.ai
Free deep learning courses and the fastai library. Made cutting-edge deep learning accessible to anyone who can code. Used by hundreds of thousands of students.
by Jeremy Howard
Stable Diffusion
Open-source text-to-image diffusion model that democratized AI image generation. Runs locally on consumer GPUs and powers thousands of apps.
by Emad Mostaque
openpilot
Open-source self-driving system by comma.ai. Runs on supported cars with a $200 device. 10,000+ users driving daily.
by George Hotz
MIT 6.S191 — Introduction to Deep Learning
MIT's official introductory course on deep learning. Open courseware with labs covering CNNs, RNNs, GANs, reinforcement learning, and more.
by Alexander Amini