1 code implementation • 11 Sep 2017 • Jiuxiang Gu, Jianfei Cai, Gang Wang, Tsuhan Chen
On the other hand, multi-stage image caption model is hard to train due to the vanishing gradient problem.
2 code implementations • ICCV 2017 • Jiuxiang Gu, Gang Wang, Jianfei Cai, Tsuhan Chen
Language Models based on recurrent neural networks have dominated recent image caption generation tasks.
no code implementations • CVPR 2016 • Yuka Kihara, Matvey Soloviev, Tsuhan Chen
We present a new algorithm for multi-region segmentation of 2D images with objects that may partially occlude each other.
no code implementations • 20 Jan 2016 • Amandianeze O. Nwana, Tsuhan Chen
Previous work has correctly identified that many of the tags that users provide on images are not visually relevant (i. e. representative of the salient content in the image) and they go on to treat such tags as noise, ignoring that the users chose to provide those tags over others that could have been more visually relevant.
no code implementations • 22 Dec 2015 • Jiuxiang Gu, Zhenhua Wang, Jason Kuen, Lianyang Ma, Amir Shahroudy, Bing Shuai, Ting Liu, Xingxing Wang, Li Wang, Gang Wang, Jianfei Cai, Tsuhan Chen
In the last few years, deep learning has led to very good performance on a variety of problems, such as visual recognition, speech recognition and natural language processing.
no code implementations • ICCV 2015 • Hang Chu, Dong Ki Kim, Tsuhan Chen
A human can easily find his or her way in an unfamiliar building, by walking around and reading the floor-plan.
no code implementations • 15 Nov 2015 • Dong Ki Kim, Tsuhan Chen
Autonomous indoor navigation of Micro Aerial Vehicles (MAVs) possesses many challenges.
no code implementations • CVPR 2015 • Kuan-Chuan Peng, Tsuhan Chen, Amir Sadovnik, Andrew C. Gallagher
First, we show through psychovisual studies that different people have different emotional reactions to the same image, which is a strong and novel departure from previous work that only records and predicts a single dominant emotion for each image.
no code implementations • CVPR 2013 • Zhaoyin Jia, Andrew Gallagher, Ashutosh Saxena, Tsuhan Chen
Our algorithm incorporates the intuition that a good 3D representation of the scene is the one that fits the data well, and is a stable, self-supporting (i. e., one that does not topple) arrangement of objects.
no code implementations • CVPR 2013 • Amir Sadovnik, Andrew Gallagher, Tsuhan Chen
However, this is not a trivial task.
no code implementations • CVPR 2013 • Adarsh Kowdle, Andrew Gallagher, Tsuhan Chen
We cast the problem of depth-layer segmentation as a discrete labeling problem on a spatiotemporal Markov Random Field (MRF) that uses the motion occlusion cues along with monocular cues and a smooth motion prior for the moving object.
no code implementations • NeurIPS 2011 • Cong-Cong Li, Ashutosh Saxena, Tsuhan Chen
For most scene understanding tasks (such as object detection or depth estimation), the classifiers need to consider contextual information in addition to the local features.
no code implementations • NeurIPS 2010 • Cong-Cong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen
In many machine learning domains (such as scene understanding), several related sub-tasks (such as scene categorization, depth estimation, object detection) operate on the same raw data and provide correlated outputs.