Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline (Poster)
By Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das
Watch a video recap.
Shonan Rotation Averaging: Global Optimality by Surfing SO(p)^n (SPOTLIGHT)
By Frank Dellaert, David Rosen, Jing Wu, Robert Mahony, Luca Carlone
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web (Spotlight)
By Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra
Watch a video recap.
Spatially Aware Multimodal Transformers for TextVQA
By Yash Kant, Dhruv Batra, Peter Anderson, Alex Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal
Watch a video recap.
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
By Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh
FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning (Poster)
By Chia-Wen Kuo, Chih-Yao Ma, Jia-Bin Huang, Zsolt Kira
Watch a video recap.
Learning to Generate Grounded Visual Captions without Localization Supervision (Poster)
Chih-Yao Ma, Yannis Kalantidis, Ghassan AlRegib, Peter Vajda, Marcus Rohrbach, Zsolt Kira
Watch a video recap.
Forecasting Human Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Vision (Oral)
By Miao Liu, Siyu Tang, Yin Li and James M. Rehg
Watch a video recap.
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments (Poster)
By Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee
ContactPose: A Dataset of Grasps with Object Contact and Hand Pose (poster)
By Samarth Brahmbhatt, Chengcheng Tang, Christopher D. Twigg, Charles C. Kemp, and James Hays
Watch a video recap.
Learning to Balance Specificity and Invariance for In and Out of Domain Generalization (Poster)
By Prithvijit Chattopadyay, Yogesh Balaji, Judy Hoffman
Watch a video recap.
A General Toolbox for Understanding Errors in Object Detection (Spotlight)
By Daniel Bolya, Sean Foley, James Hays, Judy Hoffman
Watch a video recap.
Graph Inference for Knowledge Transfer in Weakly Supervised Object Localization (poster)
By Amir Rahimi, Amirreza Shaban, Thalaiyasingam Ajanthan, Richard Hartle, and Byron Boots
Watch a video recap.
Neural Design Network: Graphic Layout Generation with Constraints (Spotlight)
By Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yan
Backpropagated Gradient Representations for Anomaly Detection (Poster)
By Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, and Ghassan AlRegib
Watch a video recap.