Multimodal
8 researchers · 3 papers · 5 projects · 5 builders
Researchers (8)
Alec Radford
OpenAI
346,122 citations · h-index 24
Trevor Darrell
UC Berkeley
175,000 citations · h-index 120
Cordelia Schmid
INRIA / Google Research
98,000 citations · h-index 105
Antonio Torralba
MIT CSAIL
85,000 citations · h-index 90
Ali Farhadi
University of Washington / Allen AI
58,000 citations · h-index 72
Xiaodong He
JD AI Research
35,000 citations · h-index 58
Douwe Kiela
Contextual AI
24,000 citations · h-index 45
William Yang Wang
UC Santa Barbara
21,000 citations · h-index 52
Papers (3)
Projects (5)
Qwen
Alibaba's family of large language and multimodal models. Qwen2 series achieves top performance among open-weight models across benchmarks.
by Junyang Lin
Reka AI Models
Multimodal AI models from Reka AI. Reka Core, Flash, and Edge deliver frontier performance across text, image, video, and audio understanding.
by Yi Tay
LAION-5B Dataset
The largest openly available image-text dataset with 5.85 billion CLIP-filtered pairs. Enabled training of Stable Diffusion and open CLIP models.
by Ludwig Schmidt
OpenAI API Platform
Built and scaled OpenAI's API platform serving GPT-4, DALL-E, and Whisper to millions of developers. Architected the product powering ChatGPT.
by Peter Welinder
Thinking Machines Lab
AI research startup founded after departing OpenAI as CTO. Led the technical development of ChatGPT, GPT-4, and DALL-E at OpenAI.
by Mira Murati