Computer Vision

46 researchers · 27 papers · 27 projects · 27 builders

Researchers (46)

Kaiming He

MIT / Google DeepMind

750,000 citations · h-index 67

Ross Girshick

Anthropic (via Vercept)

673,758 citations · h-index 62

Geoffrey Hinton

University of Toronto

444,501 citations · h-index 137

Alec Radford

OpenAI

346,122 citations · h-index 24

Andrew Zisserman

University of Oxford

310,000 citations · h-index 142

Alex Krizhevsky

Independent

264,618 citations · h-index 14

Yann LeCun

Meta AI / NYU

245,920 citations · h-index 118

Fei-Fei Li

Stanford University

216,976 citations · h-index 137

Jitendra Malik

UC Berkeley

196,000 citations · h-index 134

Trevor Darrell

UC Berkeley

175,000 citations · h-index 120

Karen Simonyan

DeepMind

175,000 citations · h-index 36

Jian Sun

Megvii / Face++

148,000 citations · h-index 75

Samy Bengio

Apple AI/ML

112,069 citations · h-index 72

Cordelia Schmid

INRIA / Google Research

98,000 citations · h-index 105

Phillip Isola

MIT CSAIL

95,000 citations · h-index 52

Xiaogang Wang

Chinese University of Hong Kong

95,000 citations · h-index 82

Rob Fergus

Meta AI / NYU

91,000 citations · h-index 80

Antonio Torralba

MIT CSAIL

85,000 citations · h-index 90

Liang-Chieh Chen

Google DeepMind

82,000 citations · h-index 45

Ming-Hsuan Yang

UC Merced / Google Research

78,000 citations · h-index 85

Hugo Larochelle

Google DeepMind / Mila

72,000 citations · h-index 68

Philip Torr

University of Oxford

68,000 citations · h-index 88

Bolei Zhou

UCLA

62,000 citations · h-index 55

Saining Xie

New York University

62,000 citations · h-index 38

Marc'Aurelio Ranzato

Meta AI

62,000 citations · h-index 60

Andrej Karpathy

Independent

58,419 citations · h-index 20

Song Han

MIT

58,000 citations · h-index 62

Ali Farhadi

University of Washington / Allen AI

58,000 citations · h-index 72

Wei Liu

Tencent AI Lab

55,000 citations · h-index 60

Aleksander Madry

MIT CSAIL

55,000 citations · h-index 65

Vladlen Koltun

Apple

55,000 citations · h-index 72

Raquel Urtasun

Waabi / University of Toronto

52,000 citations · h-index 78

Abhinav Gupta

Carnegie Mellon University

45,000 citations · h-index 72

Zhouchen Lin

Peking University

42,000 citations · h-index 68

Silvio Savarese

Salesforce AI Research

42,000 citations · h-index 72

Pushmeet Kohli

Google DeepMind

41,000 citations · h-index 72

Lior Wolf

Tel Aviv University

38,000 citations · h-index 65

Li Jia Li

Google Cloud AI

36,000 citations · h-index 55

Hengshuang Zhao

University of Hong Kong

35,000 citations · h-index 38

Tinne Tuytelaars

KU Leuven

32,000 citations · h-index 62

Ivan Laptev

INRIA Paris / MBZUAI

30,000 citations · h-index 55

Priya Goyal

Meta AI

28,000 citations · h-index 22

Sanja Fidler

University of Toronto / NVIDIA

28,000 citations · h-index 58

Jiajun Wu

Stanford University

16,000 citations · h-index 42

Timnit Gebru

DAIR Institute

15,864 citations · h-index 28

Hao Li

Pinscreen / UC Berkeley

12,000 citations · h-index 38

Papers (27)

Deep Residual Learning for Image Recognition

CVPR 20162015220,000 citations

Deep Residual Learning for Image Recognition

CVPR 20162016185,000 citations

ImageNet Classification with Deep Convolutional Neural Networks

NeurIPS 20122012130,000 citations

Very Deep Convolutional Networks for Large-Scale Image Recognition

ICLR 2015201592,000 citations

ImageNet: A Large-Scale Hierarchical Image Database

CVPR 2009200960,625 citations

Gradient-based learning applied to document recognition

Proceedings of the IEEE199857,014 citations

You Only Look Once: Unified, Real-Time Object Detection

CVPR 2016201635,000 citations

SSD: Single Shot MultiBox Detector

ECCV 2016201628,000 citations

Rich feature hierarchies for accurate object detection and semantic segmentation

CVPR 2014201425,000 citations

Image-to-Image Translation with Conditional Adversarial Networks

CVPR 2017201718,000 citations

A Simple Framework for Contrastive Learning of Visual Representations

ICML 2020202018,000 citations

High-Resolution Image Synthesis with Latent Diffusion Models

CVPR 2022202214,000 citations

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

ICLR 2015201514,000 citations

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

IEEE TPAMI201714,000 citations

Towards Deep Learning Models Resistant to Adversarial Attacks

ICLR 2018201812,000 citations

Denoising Diffusion Probabilistic Models

NeurIPS 2020202012,000 citations

Aggregated Residual Transformations for Deep Neural Networks

CVPR 2017201710,500 citations

Pyramid Scene Parsing Network

CVPR 201720179,500 citations

Places: A 10 Million Image Database for Scene Recognition

IEEE TPAMI20178,500 citations

PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space

NeurIPS 201720178,500 citations

Projects (27)

Ideogram

AI image generation platform known for reliable text rendering in images. Ideogram 2.0 sets the standard for typography accuracy in generated visuals.

by Mohammad Norouzi

Diffusers

Hugging Face library for state-of-the-art diffusion models. Provides pretrained pipelines for image, audio, and 3D generation with simple APIs.

by Sayak Paul

timm (PyTorch Image Models)

The definitive collection of state-of-the-art computer vision model implementations. 800+ pretrained models including EfficientNet, Vision Transformers, and ConvNeXt.

by Ross Wightman

Runway Gen-3

AI-powered creative suite for video generation and editing. Gen-3 Alpha produces photorealistic video from text and image prompts with fine temporal control.

by Cristóbal Valenzuela

Midjourney

The world's most popular AI image generation platform. Known for stunning artistic quality and used by millions of creators, designers, and artists.

by David Holz

Covariant Brain

Universal AI for robotic manipulation. The Covariant Brain enables robots to pick, place, and handle novel objects in real-world warehouse and logistics settings.

by Peter Chen

Pika

AI video generation platform that creates and edits cinematic videos from text, images, and existing clips. Known for intuitive creative controls.

by Demi Guo

Synthesia EXPRESSIVE-1

AI video generation platform creating realistic presenter-led videos from text. Used by 50K+ companies for training, marketing, and internal communications.

by Victor Riparbelli

Photoroom

AI photo editing app that instantly removes backgrounds, generates product images, and creates studio-quality visuals. Used by 150M+ people globally.

by Matthieu Rouif

Neurala Brain Builder

Edge AI platform for visual inspection and anomaly detection. Lifelong-learning neural networks that improve in production without retraining from scratch.

by Marc Pare

Lunit INSIGHT

AI-powered cancer detection in chest X-rays and mammography. FDA-cleared, CE-marked, deployed in 4000+ hospitals across 50+ countries.

by Hyun Kim

Allen AI (AI2)

Nonprofit AI research lab founded by Paul Allen. Builds open-source AI tools: Semantic Scholar, OLMo, AI2 THOR. Mission: AI for the common good.

by Ali Farhadi

Xnor.ai (acquired by Apple)

Ultra-efficient edge AI. Ran deep learning on $5 solar-powered chips using binary neural networks. Acquired by Apple for $200M in 2020.

by Rory Finnegan

Mighty AI (acquired by Uber)

Training data platform for autonomous vehicles. Computer vision annotation at scale for self-driving cars. Acquired by Uber ATG in 2019.

by Max Friedman

PyTorch

The most widely used open-source deep learning framework. Provides dynamic computation graphs, GPU acceleration, and a rich ecosystem for research and production.

by Soumith Chintala

Hugging Face Transformers

The definitive open-source library for state-of-the-art NLP, vision, and multimodal models. Provides 200K+ pretrained models and unified APIs.

by Thomas Wolf

Stable Diffusion / FLUX

Latent diffusion models for high-resolution image synthesis. Stable Diffusion democratized AI art; FLUX is the next-generation architecture from Black Forest Labs.

by Robin Rombach

Craiyon (DALL-E Mini)

Open-source text-to-image model that went viral in 2022. Demonstrated that AI image generation could be accessible and fun for millions of users.

by Boris Dayma

Hugging Face Spaces & Demos

Interactive ML art demos and creative AI tools on Hugging Face Spaces. Bridging the gap between generative AI research and creative communities.

by Apolinário Passos

LAION-5B Dataset

The largest openly available image-text dataset with 5.85 billion CLIP-filtered pairs. Enabled training of Stable Diffusion and open CLIP models.

by Ludwig Schmidt

Machine Learning Mastery

The largest practitioner-focused ML education platform with 1000+ tutorials and 20+ books covering deep learning, NLP, computer vision, and time series.

by Jason Brownlee

Scale AI Data Platform

Enterprise data labeling and AI infrastructure platform. Powers training data for OpenAI, Meta, and US DoD with human-in-the-loop annotation at scale.

by Alexandr Wang

Hugging Face Transformers

The most popular open-source library for state-of-the-art NLP, vision, and audio models. Hosts 500K+ models and 100K+ datasets. The GitHub of machine learning.

by Clem Delangue

fast.ai

Free deep learning courses and the fastai library. Made cutting-edge deep learning accessible to anyone who can code. Used by hundreds of thousands of students.

by Jeremy Howard

Stable Diffusion

Open-source text-to-image diffusion model that democratized AI image generation. Runs locally on consumer GPUs and powers thousands of apps.

by Emad Mostaque

openpilot

Open-source self-driving system by comma.ai. Runs on supported cars with a $200 device. 10,000+ users driving daily.

by George Hotz

MIT 6.S191 — Introduction to Deep Learning

MIT's official introductory course on deep learning. Open courseware with labs covering CNNs, RNNs, GANs, reinforcement learning, and more.

by Alexander Amini