Multimodal

8 researchers · 3 papers · 5 projects · 5 builders

Researchers (8)

Alec Radford

OpenAI

346,122 citations · h-index 24

Trevor Darrell

UC Berkeley

175,000 citations · h-index 120

Cordelia Schmid

INRIA / Google Research

98,000 citations · h-index 105

Antonio Torralba

MIT CSAIL

85,000 citations · h-index 90

Ali Farhadi

University of Washington / Allen AI

58,000 citations · h-index 72

Xiaodong He

JD AI Research

35,000 citations · h-index 58

Douwe Kiela

Contextual AI

24,000 citations · h-index 45

William Yang Wang

UC Santa Barbara

21,000 citations · h-index 52

Papers (3)

GPT-4 Technical Report

arXiv preprint20238,500 citations

Learning Transferable Visual Models From Natural Language Supervision

ICML 202120215,296 citations

Hierarchical Text-Conditional Image Generation with CLIP Latents

arXiv preprint20225,200 citations

Projects (5)

Qwen

Alibaba's family of large language and multimodal models. Qwen2 series achieves top performance among open-weight models across benchmarks.

by Junyang Lin

Reka AI Models

Multimodal AI models from Reka AI. Reka Core, Flash, and Edge deliver frontier performance across text, image, video, and audio understanding.

by Yi Tay

LAION-5B Dataset

The largest openly available image-text dataset with 5.85 billion CLIP-filtered pairs. Enabled training of Stable Diffusion and open CLIP models.

by Ludwig Schmidt

OpenAI API Platform

Built and scaled OpenAI's API platform serving GPT-4, DALL-E, and Whisper to millions of developers. Architected the product powering ChatGPT.

by Peter Welinder

Thinking Machines Lab

AI research startup founded after departing OpenAI as CTO. Led the technical development of ChatGPT, GPT-4, and DALL-E at OpenAI.

by Mira Murati