Director of Research (if dissertation) or Advisor (if thesis)
Doctoral Committee Chair(s)
Department of Study
Degree Granting Institution
University of Illinois at Urbana-Champaign
Visual question answering (VQA)
Fine-grained image recognition
Knowing where to look in an image can significantly improve performance in computer vision tasks by eliminating irrelevant information from the rest of the input image, and by breaking down complex scenes into simpler and more familiar sub-components. We show that a framework for identifying multiple task-relevant regions can be learned in current state-of-the-art deep network architectures, resulting in significant gains in several visual prediction tasks. We will demonstrate both directly and indirectly supervised models for selecting image regions and show how they can improve performance over baselines by means of focusing on the right areas.