Satwik Kottur

Satwik is a Research Scientist with the Conversational AI Research (CAIR). His research interests lie at the intersection of computer vision and natural language, specifically multimodal AI agents that can interact with humans in natural language. Prior to joining Facebook, Satwik received his Ph.D. degree from Carnegie Mellon University, and an undergraduate degree from Indian Institute of Technology Bombay.

Satwik's Publications

June 05, 2019

RESEARCH

COMPUTER VISION

CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog

Visual Dialog is a multimodal task of answering a sequence of questions grounded in an image, using the conversation history as context. It entails challenges in vision, language, reasoning, and grounding. However, studying these subtasks in…

Satwik Kottur, José M.F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach,

June 05, 2019

September 14, 2018

RESEARCH

COMPUTER VISION

Visual Coreference Resolution in Visual Dialog using Neural Module Networks

Visual dialog entails answering a series of questions grounded in an image, using dialog history as context. In addition to the challenges found in visual question answering (VQA), which can be seen as one-round dialog, visual dialog…

Satwik Kottur, José M.F. Moura, Devi Parikh, Dhruv Batra, Marcus Rohrbach,

September 14, 2018

October 22, 2017

RESEARCH

NLP

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning

We introduce the first goal-driven training for visual question answering and dialog agents. Specifically, we pose a cooperative ‘image guessing’ game between two agents – Q-BOT and A-BOT– who communicate in natural language dialog so that…

Abhishek Das, Satwik Kottur, José M.F. Moura, Stefan Lee, Dhruv Batra,

October 22, 2017