no code implementations • EMNLP (Louhi) 2020 • Tarek Sakakini, Jong Yoon Lee, Aditya Duri, Renato F.L. Azevedo, Victor Sadauskas, Kuangxiao Gu, Suma Bhat, Dan Morrow, James Graumlich, Saqib Walayat, Mark Hasegawa-Johnson, Thomas Huang, Ann Willemsen-Dunlap, Donald Halpin
We also show the enhanced accuracy of our system over directly-supervised neural methods in this low-resource setting.
1 code implementation • ICLR 2021 • Wonkwang Lee, Whie Jung, Han Zhang, Ting Chen, Jing Yu Koh, Thomas Huang, Hyungsuk Yoon, Honglak Lee, Seunghoon Hong
Despite the recent advances in the literature, existing approaches are limited to moderately short-term prediction (less than a few seconds), while extrapolating it to a longer future quickly leads to destruction in structure and content.
no code implementations • 19 Aug 2020 • Md Yousuf Harun, M. Arifur Rahman, Joshua Mellinger, Willy Chang, Thomas Huang, Brienne Walker, Kristen Hori, Aaron T. Ohta
Automating human preimplantation embryo grading offers the potential for higher success rates with in vitro fertilization (IVF) by providing new quantitative and objective measures of embryo quality.
no code implementations • 19 Aug 2020 • Md Yousuf Harun, Thomas Huang, Aaron T. Ohta
Embryo quality assessment based on morphological attributes is important for achieving higher pregnancy rates from in vitro fertilization (IVF).
1 code implementation • 8 May 2020 • Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, Yue Cao, Zhilu Zhang, WangMeng Zuo, Xiaoling Zhang, Jiye Liu, Wendong Chen, Changyuan Wen, Meng Liu, Shuailin Lv, Yunchao Zhang, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Xiyu Yu, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Songhyun Yu, Bumjun Park, Jechang Jeong, Shuai Liu, Ziyao Zong, Nan Nan, Chenghua Li, Zengli Yang, Long Bao, Shuangquan Wang, Dongwoon Bai, Jungwon Lee, Youngjung Kim, Kyeongha Rho, Changyeop Shin, Sungho Kim, Pengliang Tang, Yiyun Zhao, Yuqian Zhou, Yuchen Fan, Thomas Huang, Zhihao LI, Nisarg A. Shah, Wei Liu, Qiong Yan, Yuzhi Zhao, Marcin Możejko, Tomasz Latkowski, Lukasz Treszczotko, Michał Szafraniuk, Krzysztof Trojanowski, Yanhong Wu, Pablo Navarrete Michelini, Fengshuo Hu, Yunhua Lu, Sujin Kim, Wonjin Kim, Jaayeon Lee, Jang-Hwan Choi, Magauiya Zhussip, Azamat Khassenov, Jong Hyun Kim, Hwechul Cho, Priya Kansal, Sabari Nathan, Zhangyu Ye, Xiwen Lu, Yaqi Wu, Jiangxin Yang, Yanlong Cao, Siliang Tang, Yanpeng Cao, Matteo Maggioni, Ioannis Marras, Thomas Tanay, Gregory Slabaugh, Youliang Yan, Myungjoo Kang, Han-Soo Choi, Kyungmin Song, Shusong Xu, Xiaomu Lu, Tingniao Wang, Chunxia Lei, Bin Liu, Rajat Gupta, Vineet Kumar
This challenge is based on a newly collected validation and testing image datasets, and hence, named SIDD+.
1 code implementation • ECCV 2020 • Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Ruoming Pang, Quoc Le
Without extra retraining or post-processing steps, we are able to train a single set of shared weights on ImageNet and use these weights to obtain child models whose sizes range from 200 to 1000 MFLOPs.
Ranked #30 on
Neural Architecture Search
on ImageNet
no code implementations • CVPR 2020 • Hanchao Yu, Shanhui Sun, Haichao Yu, Xiao Chen, Honghui Shi, Thomas Huang, Terrence Chen
In clinical deployment, however, they suffer dramatic performance drops due to mismatched distributions between training and testing datasets, commonly encountered in the clinical environment.
no code implementations • 21 Nov 2019 • Kuangxiao Gu, Yuqian Zhou, Thomas Huang
In this paper, We present a landmark driven two-stream network to generate faithful talking facial animation, in which more facial details are created, preserved and transferred from multiple source images instead of a single one.
no code implementations • 25 Sep 2019 • Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Quoc Le
In this work, we propose BigNAS, an approach that simplifies this workflow and scales up neural architecture search to target a wide range of model sizes simultaneously.
2 code implementations • 20 Sep 2019 • Xiaofan Zhang, Haoming Lu, Cong Hao, Jiachen Li, Bowen Cheng, Yuhong Li, Kyle Rupnow, JinJun Xiong, Thomas Huang, Honghui Shi, Wen-mei Hwu, Deming Chen
Object detection and tracking are challenging tasks for resource-constrained embedded systems.
no code implementations • ICCV 2019 • Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, JinJun Xiong, Thomas Huang, Wen-mei Hwu, Honghui Shi
The multi-scale context module refers to the operations to aggregate feature responses from a large spatial extent, while the single-stage encoder-decoder structure encodes the high-level semantic information in the encoder path and recovers the boundary information in the decoder path.
1 code implementation • 25 Jun 2019 • Xiaofan Zhang, Cong Hao, Haoming Lu, Jiachen Li, Yuhong Li, Yuchen Fan, Kyle Rupnow, JinJun Xiong, Thomas Huang, Honghui Shi, Wen-mei Hwu, Deming Chen
Developing artificial intelligence (AI) at the edge is always challenging, since edge devices have limited computation capability and memory resources but need to meet demanding requirements, such as real-time processing, high throughput performance, and high inference accuracy.
no code implementations • 7 May 2019 • Bowen Cheng, Rong Xiao, Jian-Feng Wang, Thomas Huang, Lei Zhang
We present a novel high frequency residual learning framework, which leads to a highly efficient multi-scale network (MSNet) architecture for mobile and embedded vision problems.
no code implementations • CVPR 2019 • Zhiqiang Shen, Mingyang Huang, Jianping Shi, xiangyang xue, Thomas Huang
The proposed INIT exhibits three import advantages: (1) the instance-level objective loss can help learn a more accurate reconstruction and incorporate diverse attributes of objects; (2) the styles used for target domain of local/global areas are from corresponding spatial regions in source domain, which intuitively is a more reasonable mapping; (3) the joint training process can benefit both fine and coarse granularity and incorporates instance information to improve the quality of global translation.
2 code implementations • 6 Apr 2019 • Yuqian Zhou, Jianbo Jiao, Haibin Huang, Yang Wang, Jue Wang, Honghui Shi, Thomas Huang
In this paper, we propose a novel approach to boost the performance of a real image denoiser which is trained only with synthetic pixel-independent noise data dominated by AWGN.
Ranked #1 on
Denoising
on Darmstadt Noise Dataset
6 code implementations • ICLR 2020 • Jiahui Yu, Thomas Huang
Notably, by setting optimized channel numbers, our AutoSlim-MobileNet-v2 at 305M FLOPs achieves 74. 2% top-1 accuracy, 2. 4% better than default MobileNet-v2 (301M FLOPs), and even 0. 2% better than RL-searched MNasNet (317M FLOPs).
1 code implementation • ICCV 2019 • Jiahui Yu, Thomas Huang
We also evaluate the proposed US-Nets and improved training techniques on tasks of image super-resolution and deep reinforcement learning.
3 code implementations • ICLR 2019 • Jiahui Yu, Linjie Yang, Ning Xu, Jianchao Yang, Thomas Huang
Instead of training individual networks with different width configurations, we train a shared network with switchable batch normalization.
1 code implementation • ICCV 2019 • Yang Fu, Yunchao Wei, Guanshuo Wang, Yuqian Zhou, Honghui Shi, Thomas Huang
Upon our SSG, we further introduce a clustering-guided semisupervised approach named SSG ++ to conduct the one-shot domain adaption in an open set setting (i. e. the number of independent identities from the target domain is unknown).
no code implementations • 9 Nov 2018 • Yang Fu, Xiaoyang Wang, Yunchao Wei, Thomas Huang
Thus, a more robust clip-level feature representation can be generated according to a weighted sum operation guided by the mined 2-D attention score matrix.
Large-Scale Person Re-Identification
Video-Based Person Re-Identification
no code implementations • 6 Nov 2018 • Rui Qian, Yunchao Wei, Honghui Shi, Jiachen Li, Jiaying Liu, Thomas Huang
Semantic scene parsing is suffering from the fact that pixel-level annotations are hard to be collected.
1 code implementation • 22 Oct 2018 • Xiaolin Zhang, Yunchao Wei, Yi Yang, Thomas Huang
In this way, the possibilities embedded in the produced similarity maps can be adapted to guide the process of segmenting objects.
Ranked #77 on
Few-Shot Semantic Segmentation
on PASCAL-5i (5-Shot)
3 code implementations • 5 Oct 2018 • Bowen Cheng, Yunchao Wei, Rogerio Feris, JinJun Xiong, Wen-mei Hwu, Thomas Huang, Humphrey Shi
In particular, DCR places a separate classification network in parallel with the localization network (base detector).
2 code implementations • 17 Sep 2018 • Tao Ruan, Ting Liu, Zilong Huang, Yunchao Wei, Shikui Wei, Yao Zhao, Thomas Huang
Human parsing has received considerable interest due to its wide application potentials.
Ranked #2 on
Person Re-Identification
on Market-1501-C
4 code implementations • ECCV 2018 • Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue, Yuchen Liang, Brian Price, Scott Cohen, Thomas Huang
End-to-end sequential learning to explore spatial-temporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i. e., even the largest video segmentation dataset only contains 90 short video clips.
Ranked #16 on
Video Object Segmentation
on YouTube-VOS 2018
(F-Measure (Unseen) metric)
10 code implementations • 27 Aug 2018 • Jiahui Yu, Yuchen Fan, Jianchao Yang, Ning Xu, Zhaowen Wang, Xinchao Wang, Thomas Huang
Keras-based implementation of WDSR, EDSR and SRGAN for single image super-resolution
Ranked #4 on
Multi-Frame Super-Resolution
on PROBA-V
1 code implementation • NeurIPS 2018 • Seunghoon Hong, Xinchen Yan, Thomas Huang, Honglak Lee
In this work, we present a novel hierarchical framework for semantic image manipulation.
1 code implementation • ECCV 2018 • Xiaolin Zhang, Yunchao Wei, Guoliang Kang, Yi Yang, Thomas Huang
A stagewise approach is proposed to incorporate high confident object regions to learn the SPG masks.
Ranked #1 on
Weakly-Supervised Object Localization
on ILSVRC 2016
no code implementations • ECCV 2018 • Yunchao Wei, Zhiqiang Shen, Bowen Cheng, Honghui Shi, JinJun Xiong, Jiashi Feng, Thomas Huang
This work provides a simple approach to discover tight object bounding boxes with only image-level supervision, called Tight box mining with Surrounding Segmentation Context (TS2C).
30 code implementations • ICCV 2019 • Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, Thomas Huang
We present a generative image inpainting system to complete images with free-form mask and guidance.
Ranked #3 on
Image Inpainting
on Places2 val
no code implementations • 19 Apr 2018 • Yuqian Zhou, Ding Liu, Thomas Huang
However, previous proposed models are mostly trained and tested on good-quality images which are not always the case for practical applications like surveillance systems.
1 code implementation • 19 Apr 2018 • Yuqian Zhou, Kuangxiao Gu, Thomas Huang
The newly proposed RepGAN is tested on MNIST, fashionMNIST, CelebA, and SVHN datasets to perform unsupervised classification, generation and reconstruction tasks.
2 code implementations • CVPR 2018 • Xiaolin Zhang, Yunchao Wei, Jiashi Feng, Yi Yang, Thomas Huang
With such an adversarial learning, the two parallel-classifiers are forced to leverage complementary object regions for classification and can finally generate integral object localization together.
Ranked #2 on
Weakly-Supervised Object Localization
on ILSVRC 2016
General Classification
Weakly-Supervised Object Localization
1 code implementation • 14 Apr 2018 • Yang Fu, Yunchao Wei, Yuqian Zhou, Honghui Shi, Gao Huang, Xinchao Wang, Zhiqiang Yao, Thomas Huang
Despite the remarkable recent progress, person re-identification (Re-ID) approaches are still suffering from the failure cases where the discriminative body parts are missing.
Ranked #53 on
Person Re-Identification
on DukeMTMC-reID
3 code implementations • ECCV 2018 • Bowen Cheng, Yunchao Wei, Honghui Shi, Rogerio Feris, JinJun Xiong, Thomas Huang
Recent region-based object detectors are usually built with separate classification and localization branches on top of shared feature extraction networks.
2 code implementations • 4 Dec 2017 • Zhiqiang Shen, Honghui Shi, Jiahui Yu, Hai Phan, Rogerio Feris, Liangliang Cao, Ding Liu, Xinchao Wang, Thomas Huang, Marios Savvides
In this paper, we present a simple and parameter-efficient drop-in module for one-stage object detectors like SSD when learning from scratch (i. e., without pre-trained models).
no code implementations • ICCV 2017 • Ding Liu, Zhaowen Wang, Yuchen Fan, Xian-Ming Liu, Zhangyang Wang, Shiyu Chang, Thomas Huang
Second, we reduce the complexity of motion between neighboring frames using a spatial alignment network that is much more robust and efficient than competing alignment methods and can be jointly trained with the temporal adaptive network in an end-to-end manner.
no code implementations • 2 Jul 2017 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang
In this paper, we propose a novel segmentation approach that uses a rectangle as a soft constraint by transforming it into an Euclidean distance map.
no code implementations • 22 Mar 2017 • Guo-Jun Qi, Wei Liu, Charu Aggarwal, Thomas Huang
One of our goals in this paper is to develop a model for revealing the functional relationships between text and image features as to directly transfer intermodal and intramodal labels to annotate the images.
7 code implementations • CVPR 2017 • Ning Xu, Brian Price, Scott Cohen, Thomas Huang
We evaluate our algorithm on the image matting benchmark, our testing set, and a wide variety of real images.
no code implementations • 3 Jan 2017 • Ding Liu, Zhaowen Wang, Nasser Nasrabadi, Thomas Huang
This paper proposes the method of learning a mixture of SR inference modules in a unified framework to tackle this problem.
no code implementations • 4 Aug 2016 • Jiahui Yu, Yuning Jiang, Zhangyang Wang, Zhimin Cao, Thomas Huang
In present object detection systems, the deep convolutional neural networks (CNNs) are utilized to predict bounding boxes of object candidates, and have gained performance advantages over the traditional region proposal methods.
3 code implementations • CVPR 2016 • Ning Xu, Brian Price, Scott Cohen, Jimei Yang, Thomas Huang
Interactive object selection is a very important research problem and has many applications.
Ranked #11 on
Interactive Segmentation
on SBD
no code implementations • ICCV 2015 • Jiangping Wang, Kai Ma, Vivek Kumar Singh, Thomas Huang, Terrence Chen
3D human body shape matching has large potential on many real world applications, especially with the recent advances in the 3D range sensing technology.
no code implementations • ICCV 2015 • Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang
We show that a sparse coding model particularly designed for super-resolution can be incarnated as a neural network, and trained in a cascaded structure from end to end.
no code implementations • 24 Apr 2014 • Zhaowen Wang, Jianchao Yang, Zhe Lin, Jonathan Brandt, Shiyu Chang, Thomas Huang
In this paper, we present an image similarity learning method that can scale well in both the number of images and the dimensionality of image descriptors.
no code implementations • 21 Dec 2013 • Thomas Paine, Hailin Jin, Jianchao Yang, Zhe Lin, Thomas Huang
The ability to train large-scale neural networks has resulted in state-of-the-art performance in many areas of computer vision.