Minds vs. Machines: How far are we from the common sense of a toddler?

CVPR 2020 Workshop, June 15, Seattle, WA

The following papers were accepted for publication in the workshop proceedings.

Learning to Learn Words from Visual Scenes

Dídac Surís, Dave Epstein, Heng Ji, Shih-Fu Chang, and Carl Vondrick

Oops! Predicting Unintentional Action in Video

Dave Epstein, Boyuan Chen, and Carl Vondrick

Learning Intuitive Physics by Explaining Surprise

Hung Nguyen, Jay S Patravali, Fuxin Li, and Alan Fern

Story Completion with Explicit Modeling of Commonsense Knowledge

Mingda Zhang, Keren Ye, Rebecca Hwa, and Adriana Kovashka

Visual Commonsense Representation Learning via Causal Inference

Tan Wang, Jianqiang Huang, Hanwang Zhang, and Qianru Sun

SomethingFinder: Localizing undefined regions using referring expressions

Sungmin Eum, David Han, and Gordon Briggs

Response Time Analysis for Explainability of Visual Processing in CNNs

Eric Taylor, Shashank Shekhar, and Graham Taylor

Hierarchical Color Learning in Convolutional Neural Networks

Chris Hickey, and Byoung-Tak Zhang

Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing

Goonmeet Bajaj, Bortik Bandyopadhyay, Daniel Schmidt, Pranav Maneriker, Christopher Myers, and Srinivasan Parthasarathy

3DQ-Nets: Visual Concepts Emerge in Pose Equivariant 3D Quantized NeuralScene Representations

Mihir Prabhudesai, Shamit Lal, Hsiao-Yu Tung, Adam Harley, Shubankar Potdar, and Katerina Fragkiadaki