ALOHA: from Attention to Likes -- a unified mOdel for understanding HumAn responses to diverse visual content

Peizhao Li, Junfeng He, Gang Li, Rachit Bhargava, Shaolei Shen, Nachiappan Valliappan, Youwei Liang, Hongxiang Gu, Venky Ramachandran, Golnaz Farhadi, Yang Li, Kai J Kohlhoff, Vidhya Navalpakkam

Progress in human behavior modeling involves understanding both implicit, early-stage perceptual behavior such as human attention and explicit, later-stage behavior such as subjective preferences/likes.

From Thumbnails to Summaries - A single Deep Neural Network to Rule Them All

Hongxiang Gu, Viswanathan Swaminathan

The encoder selects a subset from the input video while the decoder seeks to reconstruct the video from the selection.

