Satwik Kottur

Satwik is a Research Scientist with the Conversational AI Research (CAIR). His research interests lie at the intersection of computer vision and natural language, specifically multimodal AI agents that can interact with humans in natural language. Prior to joining Facebook, Satwik received his Ph.D. degree from Carnegie Mellon University, and an undergraduate degree from Indian Institute of Technology Bombay.

Satwik's Publications

December 23, 2020

RESEARCH

COMPUTER VISION

CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog

Visual Dialog is a multimodal task of answering a sequence of questions grounded in an image, using the conversation history as context. It entails challenges in vision, language, reasoning, and grounding. However, studying these subtasks in…

Satwik Kottur, José M.F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach,

December 23, 2020

December 23, 2020

RESEARCH

COMPUTER VISION

Visual Coreference Resolution in Visual Dialog using Neural Module Networks

Visual dialog entails answering a series of questions grounded in an image, using dialog history as context. In addition to the challenges found in visual question answering (VQA), which can be seen as one-round dialog, visual dialog…

Satwik Kottur, José M.F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach,

December 23, 2020

December 23, 2020

RESEARCH

NLP

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

We introduce the first goal-driven training for visual question answering and dialog agents. Specifically, we pose a cooperative ‘image guessing’ game between two agents – Q-BOT and A-BOT– who communicate in natural language dialog so that…

Abhishek Das, Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra,

December 23, 2020