Search Results for author: Wei Liu

Found 470 papers, 185 papers with code

Generalized Canonical Correlation Analysis and Its Application to Blind Source Separation Based on a Dual-Linear Predictor Structure

no code implementations9 Mar 2014 Wei Liu

Blind source separation (BSS) is one of the most important and established research topics in signal processing and many algorithms have been proposed based on different statistical properties of the source signals.

blind source separation

Image Fusion with Local Spectral Consistency and Dynamic Gradient Sparsity

no code implementations CVPR 2014 Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang

In this paper, we propose a novel method for image fusion from a high resolution panchromatic image and a low resolution multispectral image at the same geographical location.

Going Deeper with Convolutions

79 code implementations CVPR 2015 Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich

We propose a deep convolutional neural network architecture codenamed "Inception", which was responsible for setting the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC 2014).

General Classification Image Classification +2

Multiple Object Tracking: A Literature Review

no code implementations26 Sep 2014 Wenhan Luo, Junliang Xing, Anton Milan, Xiaoqin Zhang, Wei Liu, Tae-Kyun Kim

We inspect the recent advances in various aspects and propose some interesting directions for future research.

Multiple Object Tracking Object

Learning to Rank Binary Codes

no code implementations21 Oct 2014 Jie Feng, Wei Liu, Yan Wang

Binary codes have been widely used in vision problems as a compact feature representation to achieve both space and time advantages.

Binarization Image Retrieval +2

SIRF: Simultaneous Image Registration and Fusion in A Unified Framework

no code implementations18 Nov 2014 Chen Chen, Yeqing Li, Wei Liu, Junzhou Huang

In this paper, we propose a novel method for image fusion with a high-resolution panchromatic image and a low-resolution multispectral image at the same geographical location.

Image Registration

Discrete Graph Hashing

no code implementations NeurIPS 2014 Wei Liu, Cun Mu, Sanjiv Kumar, Shih-Fu Chang

Hashing has emerged as a popular technique for fast nearest neighbor search in gigantic databases.

Zeta Hull Pursuits: Learning Nonconvex Data Hulls

no code implementations NeurIPS 2014 Yuanjun Xiong, Wei Liu, Deli Zhao, Xiaoou Tang

Selecting a small informative subset from a given dataset, also called column sampling, has drawn much attention in machine learning.

Image Classification

A new hybrid metric for verifying parallel corpora of Arabic-English

no code implementations12 Feb 2015 Saad Alkahtani, Wei Liu, William J. Teahan

This paper discusses a new metric that has been applied to verify the quality in translation between sentence pairs in parallel corpora of Arabic-English.

Sentence Translation

Saliency Propagation From Simple to Difficult

no code implementations CVPR 2015 Chen Gong, DaCheng Tao, Wei Liu, Stephen J. Maybank, Meng Fang, Keren Fu, Jie Yang

In the teaching-to-learn step, a teacher is designed to arrange the regions from simple to difficult and then assign the simplest regions to the learner.

Saliency Detection

Understanding Image Structure via Hierarchical Shape Parsing

no code implementations CVPR 2015 Xian-Ming Liu, Rongrong Ji, Changhu Wang, Wei Liu, Bineng Zhong, Thomas S. Huang

A hierarchical shape parsing strategy is proposed to partition and organize image components into a hierarchical structure in the scale space.

Towards 3D Object Detection With Bimodal Deep Boltzmann Machines Over RGBD Imagery

no code implementations CVPR 2015 Wei Liu, Rongrong Ji, Shaozi Li

In particular, we slide a 3D detection window in the 3D point cloud to match the exemplar shape, which the lack of training data in 3D domain is conquered via (1) We collect 3D CAD models and 2D positive samples from Internet.

3D Object Detection object-detection

Discrete Hyper-Graph Matching

no code implementations CVPR 2015 Junchi Yan, Chao Zhang, Hongyuan Zha, Wei Liu, Xiaokang Yang, Stephen M. Chu

Evaluations on both synthetic and real-world data corroborate the efficiency of our method.

Graph Matching

ParseNet: Looking Wider to See Better

4 code implementations15 Jun 2015 Wei Liu, Andrew Rabinovich, Alexander C. Berg

When we add our proposed global feature, and a technique for learning normalization parameters, accuracy increases consistently even over our improved versions of the baselines.

Segmentation Semantic Segmentation

Robust High Quality Image Guided Depth Upsampling

no code implementations17 Jun 2015 Wei Liu, Yijun Li, Xiaogang Chen, Jie Yang, Qiang Wu, Jingyi Yu

A popular solution is upsampling the obtained noisy low resolution depth map with the guidance of the companion high resolution color image.

Vocal Bursts Intensity Prediction

Stochastic Gradient Made Stable: A Manifold Propagation Approach for Large-Scale Optimization

no code implementations28 Jun 2015 Yadong Mu, Wei Liu, Wei Fan

Stochastic gradient descent (SGD) holds as a classical method to build large scale machine learning models over big data.

Theoretic Analysis and Extremely Easy Algorithms for Domain Adaptive Feature Learning

no code implementations5 Sep 2015 Wenhao Jiang, Cheng Deng, Wei Liu, Feiping Nie, Fu-Lai Chung, Heng Huang

Domain adaptation problems arise in a variety of applications, where a training dataset from the \textit{source} domain and a test dataset from the \textit{target} domain typically follow different distributions.

Domain Adaptation

Learning to Hash for Indexing Big Data - A Survey

no code implementations17 Sep 2015 Jun Wang, Wei Liu, Sanjiv Kumar, Shih-Fu Chang

Such learning to hash methods exploit information such as data distributions or class labels when optimizing the hash codes or functions.

Top Rank Supervised Binary Coding for Visual Search

no code implementations ICCV 2015 Dongjin Song, Wei Liu, Rongrong Ji, David A. Meyer, John R. Smith

In this paper, we propose a novel supervised binary coding approach, namely Top Rank Supervised Binary Coding (Top-RSBC), which explicitly focuses on optimizing the precision of top positions in a Hamming-distance ranking list towards preserving the supervision information.

Image Retrieval

Learning Binary Codes for Maximum Inner Product Search

no code implementations ICCV 2015 Fumin Shen, Wei Liu, Shaoting Zhang, Yang Yang, Heng Tao Shen

Inspired by the latest advance in asymmetric hashing schemes, we propose an asymmetric binary code learning framework based on inner product fitting.

SSD: Single Shot MultiBox Detector

223 code implementations8 Dec 2015 Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg

Experimental results on the PASCAL VOC, MS COCO, and ILSVRC datasets confirm that SSD has comparable accuracy to methods that utilize an additional object proposal step and is much faster, while providing a unified framework for both training and inference.

LIDAR Semantic Segmentation Low-Light Image Enhancement +4

Data Driven Robust Image Guided Depth Map Restoration

no code implementations26 Dec 2015 Wei Liu, Yun Gu, Chunhua Shen, Xiaogang Chen, Qiang Wu, Jie Yang

Depth maps captured by modern depth cameras such as Kinect and Time-of-Flight (ToF) are usually contaminated by missing data, noises and suffer from being of low resolution.

Deep Learning Driven Visual Path Prediction from a Single Image

no code implementations27 Jan 2016 Siyu Huang, Xi Li, Zhongfei Zhang, Zhouzhou He, Fei Wu, Wei Liu, Jinhui Tang, Yueting Zhuang

The highly effective visual representation and deep context models ensure that our framework makes a deep semantic understanding of the scene and motion pattern, consequently improving the performance of the visual path prediction task.

Visual Tracking via Reliable Memories

no code implementations4 Feb 2016 Shu Wang, Shaoting Zhang, Wei Liu, Dimitris N. Metaxas

In this paper, we propose a novel visual tracking framework that intelligently discovers reliable patterns from a wide range of video to resist drift error for long-term tracking tasks.

Clustering Visual Tracking

Scalable Sequential Spectral Clustering

1 code implementation AAAI 2016 Yeqing Li, Junzhou Huang, Wei Liu

In the past decades, Spectral Clustering (SC) has become one of the most effective clustering approaches.

Clustering graph construction +1

Feature-Area Optimization: A Novel SAR Image Registration Method

no code implementations18 Feb 2016 Fuqiang Liu, Fukun Bi, Liang Chen, Hao Shi, Wei Liu

This letter proposes a synthetic aperture radar (SAR) image registration method named Feature-Area Optimization (FAO).

Image Registration

Multimodal Emotion Recognition Using Multimodal Deep Learning

no code implementations26 Feb 2016 Wei Liu, Wei-Long Zheng, Bao-liang Lu

To enhance the performance of affective models and reduce the cost of acquiring physiological signals for real-world applications, we adopt multimodal deep learning approach to construct affective models from multiple physiological signals.

EEG Multimodal Deep Learning +1

Stochastic Quasi-Newton Methods for Nonconvex Stochastic Optimization

no code implementations5 Jul 2016 Xiao Wang, Shiqian Ma, Donald Goldfarb, Wei Liu

In this paper we study stochastic quasi-Newton methods for nonconvex stochastic optimization, where we assume that noisy information about the gradients of the objective function is available via a stochastic first-order oracle (SFO).

Binary Classification General Classification +1

Scaling Up Sparse Support Vector Machines by Simultaneous Feature and Sample Reduction

1 code implementation ICML 2017 Weizhong Zhang, Bin Hong, Wei Liu, Jieping Ye, Deng Cai, Xiaofei He, Jie Wang

By noting that sparse SVMs induce sparsities in both feature and sample spaces, we propose a novel approach, which is based on accurate estimations of the primal and dual optima of sparse SVMs, to simultaneously identify the inactive features and samples that are guaranteed to be irrelevant to the outputs.

Fast Single Shot Detection and Pose Estimation

no code implementations19 Sep 2016 Patrick Poirson, Phil Ammirato, Cheng-Yang Fu, Wei Liu, Jana Kosecka, Alexander C. Berg

For applications in navigation and robotics, estimating the 3D pose of objects is as important as detection.

Object Tracking Pose Estimation

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

2 code implementations CVPR 2017 Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua

Existing visual attention models are generally spatial, i. e., the attention is modeled as spatial probabilities that re-weight the last conv-layer feature map of a CNN encoding an input image.

Image Captioning Sentence

Geometric descent method for convex composite minimization

no code implementations NeurIPS 2017 Shixiang Chen, Shiqian Ma, Wei Liu

In this paper, we extend the geometric descent method recently proposed by Bubeck, Lee and Singh to tackle nonsmooth and strongly convex composite problems.

regression

DSSD : Deconvolutional Single Shot Detector

2 code implementations23 Jan 2017 Cheng-Yang Fu, Wei Liu, Ananth Ranga, Ambrish Tyagi, Alexander C. Berg

The main contribution of this paper is an approach for introducing additional context into state-of-the-art general object detection.

object-detection Object Detection

Joint Intermodal and Intramodal Label Transfers for Extremely Rare or Unseen Classes

no code implementations22 Mar 2017 Guo-Jun Qi, Wei Liu, Charu Aggarwal, Thomas Huang

One of our goals in this paper is to develop a model for revealing the functional relationships between text and image features as to directly transfer intermodal and intramodal labels to annotate the images.

General Classification Image Classification +3

Robust Guided Image Filtering

no code implementations28 Mar 2017 Wei Liu, Xiaogang Chen, Chunhua Shen, Jingyi Yu, Qiang Wu, Jie Yang

In this paper, we propose a general framework for Robust Guided Image Filtering (RGIF), which contains a data term and a smoothness term, to solve the two issues mentioned above.

Deep Self-Taught Learning for Weakly Supervised Object Localization

no code implementations CVPR 2017 Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, Wei Liu

To overcome this issue, we propose a deep self-taught learning approach, which makes the detector learn the object-level features reliable for acquiring tight positive samples and afterwards re-train itself based on them.

Object Weakly Supervised Object Detection +1

End-to-end Active Object Tracking via Reinforcement Learning

no code implementations ICML 2018 Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

We study active object tracking, where a tracker takes as input the visual observation (i. e., frame sequence) and produces the camera control signal (e. g., move forward, turn left, etc.).

Object Object Tracking +2

Real-Time Neural Style Transfer for Videos

no code implementations CVPR 2017 Hao-Zhi Huang, Hao Wang, Wenhan Luo, Lin Ma, Wenhao Jiang, Xiaolong Zhu, Zhifeng Li, Wei Liu

More specifically, a hybrid loss is proposed to capitalize on the content information of input frames, the style information of a given style image, and the temporal information of consecutive frames.

Style Transfer Video Style Transfer

Diverse Image Annotation

no code implementations CVPR 2017 Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem

To this end, we treat the image annotation as a subset selection problem based on the conditional determinantal point process (DPP) model, which formulates the representation and diversity jointly.

TAG

Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization

no code implementations CVPR 2016 Canyi Lu, Jiashi Feng, Yudong Chen, Wei Liu, Zhouchen Lin, Shuicheng Yan

In this work, we prove that under certain suitable assumptions, we can recover both the low-rank and the sparse components exactly by simply solving a convex program whose objective is a weighted combination of the tensor nuclear norm and the $\ell_1$-norm, i. e., $\min_{{\mathcal{L}},\ {\mathcal{E}}} \ \|{{\mathcal{L}}}\|_*+\lambda\|{{\mathcal{E}}}\|_1, \ \text{s. t.}

Image Denoising

Detecting Faces Using Inside Cascaded Contextual CNN

no code implementations ICCV 2017 Kaipeng Zhang, Zhanpeng Zhang, Hao Wang, Zhifeng Li, Yu Qiao, Wei Liu

Deep Convolutional Neural Networks (CNNs) achieve substantial improvements in face detection in the wild.

Face Detection

Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores

no code implementations27 Oct 2017 Yong-Xian Wang, Li-Lun Zhang, Wei Liu, Xing-Hua Cheng, Yu Zhuang, Anthony T. Chronopoulos

For computational fluid dynamics (CFD) applications with a large number of grid points/cells, parallel computing is a common efficient strategy to reduce the computational time.

Performance

Dual Skipping Networks

no code implementations CVPR 2018 Changmao Cheng, Yanwei Fu, Yu-Gang Jiang, Wei Liu, Wenlian Lu, Jianfeng Feng, xiangyang xue

Inspired by the recent neuroscience studies on the left-right asymmetry of the human brain in processing low and high spatial frequency information, this paper introduces a dual skipping network which carries out coarse-to-fine object categorization.

General Classification Object +1

Stochastic Non-convex Ordinal Embedding with Stabilized Barzilai-Borwein Step Size

1 code implementation17 Nov 2017 Ke Ma, Jinshan Zeng, Jiechao Xiong, Qianqian Xu, Xiaochun Cao, Wei Liu, Yuan YAO

Learning representation from relative similarity comparisons, often called ordinal embedding, gains rising attention in recent years.

Mixture-Rank Matrix Approximation for Collaborative Filtering

1 code implementation NeurIPS 2017 Dongsheng Li, Chao Chen, Wei Liu, Tun Lu, Ning Gu, Stephen Chu

However, our studies show that submatrices with different ranks could coexist in the same user-item rating matrix, so that approximations with fixed ranks cannot perfectly describe the internal structures of the rating matrix, therefore leading to inferior recommendation accuracy.

Collaborative Filtering

Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Networks

1 code implementation CVPR 2018 Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang

We propose a novel framework called Semantics-Preserving Adversarial Embedding Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their classes are both unseen during training.

General Classification Zero-Shot Learning

A Bidirectional Adaptive Bandwidth Mean Shift Strategy for Clustering

no code implementations22 Dec 2017 Fanyang Meng, Hong Liu, Yongsheng Liang, Wei Liu, Jihong Pei

The bandwidth of a kernel function is a crucial parameter in the mean shift algorithm.

Clustering

NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction

1 code implementation CVPR 2019 Yuan Gao, Jiayi Ma, Mingbo Zhao, Wei Liu, Alan L. Yuille

In this paper, we propose a novel Convolutional Neural Network (CNN) structure for general-purpose multi-task learning (MTL), which enables automatic feature fusing at every layer from different tasks.

Multi-Task Learning Semantic Segmentation

Salient Object Detection by Lossless Feature Reflection

no code implementations19 Feb 2018 Pingping Zhang, Wei Liu, Huchuan Lu, Chunhua Shen

Inspired by the intrinsic reflection of natural images, in this paper we propose a novel feature learning framework for large-scale salient object detection.

Object object-detection +3

Non-rigid Object Tracking via Deep Multi-scale Spatial-temporal Discriminative Saliency Maps

no code implementations22 Feb 2018 Pingping Zhang, Wei Liu, Dong Wang, Yinjie Lei, Hongyu Wang, Chunhua Shen, Huchuan Lu

Extensive experiments demonstrate that the proposed algorithm achieves competitive performance in both saliency detection and visual tracking, especially outperforming other related trackers on the non-rigid object tracking datasets.

Object Object Tracking +2

Neural Stereoscopic Image Style Transfer

no code implementations ECCV 2018 Xinyu Gong, HaoZhi Huang, Lin Ma, Fumin Shen, Wei Liu, Tong Zhang

While each view of the stereoscopic pair is processed in an individual path, a novel feature aggregation strategy is proposed to effectively share information between the two paths.

Style Transfer

CNN in MRF: Video Object Segmentation via Inference in A CNN-Based Higher-Order Spatio-Temporal MRF

no code implementations CVPR 2018 Linchao Bao, Baoyuan Wu, Wei Liu

With temporal dependencies established by optical flow, the resulting MRF model combines both spatial and temporal cues for tackling video object segmentation.

Object One-Shot Segmentation +4

Adversarial Spatio-Temporal Learning for Video Deblurring

1 code implementation28 Mar 2018 Kaihao Zhang, Wenhan Luo, Yiran Zhong, Lin Ma, Wei Liu, Hongdong Li

To tackle the second challenge, we leverage the developed DBLRNet as a generator in the GAN (generative adversarial network) architecture, and employ a content loss in addition to an adversarial loss for efficient adversarial training.

Deblurring Generative Adversarial Network

Reconstruction Network for Video Captioning

3 code implementations CVPR 2018 Bairui Wang, Lin Ma, Wei zhang, Wei Liu

Unlike previous video captioning work mainly exploiting the cues of video contents to make a language description, we propose a reconstruction network (RecNet) with a novel encoder-decoder-reconstructor architecture, which leverages both the forward (video to sentence) and backward (sentence to video) flows for video captioning.

Sentence Video Captioning

Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

1 code implementation CVPR 2018 Xinpeng Chen, Lin Ma, Wenhao Jiang, Jian Yao, Wei Liu

Recently, caption generation with an encoder-decoder framework has been extensively studied and applied in different domains, such as image captioning, code captioning, and so on.

Caption Generation Image Captioning

Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning

1 code implementation CVPR 2018 Jingwen Wang, Wenhao Jiang, Lin Ma, Wei Liu, Yong Xu

We propose a bidirectional proposal method that effectively exploits both past and future contexts to make proposal predictions.

Dense Video Captioning

Multi-label Learning with Missing Labels using Mixed Dependency Graphs

no code implementations31 Mar 2018 Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem, Siwei Lyu

This work focuses on the problem of multi-label learning with missing labels (MLML), which aims to label each test instance with multiple class labels given training instances that have an incomplete/partial set of these labels.

Image Retrieval Missing Labels +2

Tagging like Humans: Diverse and Distinct Image Annotation

no code implementations CVPR 2018 Baoyuan Wu, Weidong Chen, Peng Sun, Wei Liu, Bernard Ghanem, Siwei Lyu

In D2IA, we generate a relevant and distinct tag subset, in which the tags are relevant to the image contents and semantically distinct to each other, using sequential sampling from a determinantal point process (DPP) model.

Generative Adversarial Network TAG

Left-Right Comparative Recurrent Model for Stereo Matching

no code implementations CVPR 2018 Zequn Jie, Pengfei Wang, Yonggen Ling, Bo Zhao, Yunchao Wei, Jiashi Feng, Wei Liu

Left-right consistency check is an effective way to enhance the disparity estimation by referring to the information from the opposite view.

Disparity Estimation Stereo Disparity Estimation +2

Learning to Guide Decoding for Image Captioning

no code implementations3 Apr 2018 Wenhao Jiang, Lin Ma, Xinpeng Chen, Hanwang Zhang, Wei Liu

Recently, much advance has been made in image captioning, and an encoder-decoder framework has achieved outstanding performance for this task.

Attribute Image Captioning

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

1 code implementation CVPR 2018 Chao Li, Cheng Deng, Ning li, Wei Liu, Xinbo Gao, DaCheng Tao

In addition, we harness a self-supervised semantic network to discover high-level semantic information in the form of multi-label annotations.

Cross-Modal Retrieval Retrieval

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

6 code implementations ECCV 2018 Nanyang Wang, yinda zhang, Zhuwen Li, Yanwei Fu, Wei Liu, Yu-Gang Jiang

We propose an end-to-end deep learning architecture that produces a 3D shape in triangular mesh from a single color image.

3D Object Reconstruction

Tensor Robust Principal Component Analysis with A New Tensor Nuclear Norm

1 code implementation10 Apr 2018 Canyi Lu, Jiashi Feng, Yudong Chen, Wei Liu, Zhouchen Lin, Shuicheng Yan

Equipped with the new tensor nuclear norm, we then solve the TRPCA problem by solving a convex program and provide the theoretical guarantee for the exact recovery.

Neural Compatibility Modeling with Attentive Knowledge Distillation

no code implementations17 Apr 2018 Xuemeng Song, Fuli Feng, Xianjing Han, Xin Yang, Wei Liu, Liqiang Nie

Nevertheless, existing studies overlook the rich valuable knowledge (rules) accumulated in fashion domain, especially the rules regarding clothing matching.

Image Classification Knowledge Distillation +2

Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition

no code implementations AAAI 2018 Wei Liu, Chaofeng Chen, Kwan-Yee K. Wong

Unlike previous work which employed a global spatial transformer network to rectify the entire distorted text image, we take an approach of detecting and rectifying individual characters.

Scene Text Recognition

Semantic Structure-based Unsupervised Deep Hashing

1 code implementation IJCAI2018 2018 Erkun Yang, Cheng Deng, Tongliang Liu, Wei Liu, DaCheng Tao

Hashing is becoming increasingly popular for approximate nearest neighbor searching in massive databases due to its storage and search efficiency.

Deep Hashing Semantic Similarity +1

Long-Term Human Motion Prediction by Modeling Motion Context and Enhancing Motion Dynamic

no code implementations7 May 2018 Yongyi Tang, Lin Ma, Wei Liu, Wei-Shi Zheng

Human motion prediction aims at generating future frames of human motion based on an observed sequence of skeletons.

Human motion prediction motion prediction

A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization

no code implementations9 May 2018 Li Wang, Junlin Yao, Yunzhe Tao, Li Zhong, Wei Liu, Qiang Du

In this paper, we propose a deep learning approach to tackle the automatic summarization tasks by incorporating topic information into the convolutional sequence-to-sequence (ConvS2S) model and using self-critical sequence training (SCST) for optimization.

Abstractive Text Summarization Informativeness

An Algorithmic Framework of Variable Metric Over-Relaxed Hybrid Proximal Extra-Gradient Method

no code implementations ICML 2018 Li Shen, Peng Sun, Yitong Wang, Wei Liu, Tong Zhang

Specifically, we find that a large class of primal and primal-dual operator splitting algorithms are all special cases of VMOR-HPE.

Safe Element Screening for Submodular Function Minimization

no code implementations ICML 2018 Weizhong Zhang, Bin Hong, Lin Ma, Wei Liu, Tong Zhang

Relying on this study, we subsequently propose a novel safe screening method to quickly identify the elements guaranteed to be included (we refer to them as active) or excluded (inactive) in the final optimal solution of SFM during the optimization process.

Combinatorial Optimization Sparse Learning

Video Description: A Survey of Methods, Datasets and Evaluation Metrics

no code implementations1 Jun 2018 Nayyer Aafaq, Ajmal Mian, Wei Liu, Syed Zulqarnain Gilani, Mubarak Shah

Video description is the automatic generation of natural language sentences that describe the contents of a given video.

Language Modelling Video Description

Nonlocal Neural Networks, Nonlocal Diffusion and Nonlocal Modeling

no code implementations NeurIPS 2018 Yunzhe Tao, Qi Sun, Qiang Du, Wei Liu

Nonlocal neural networks have been proposed and shown to be effective in several computer vision tasks, where the nonlocal operations can directly capture long-range dependencies in the feature space.

NovelPerspective: Identifying Point of View Characters

1 code implementation ACL 2018 Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Our tool detects the main character that each section is from the POV of, and allows the user to generate a new ebook with only those sections.

Named Entity Recognition (NER)

Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks

no code implementations ECCV 2018 Minjun Li, Hao-Zhi Huang, Lin Ma, Wei Liu, Tong Zhang, Yu-Gang Jiang

Recent studies on unsupervised image-to-image translation have made a remarkable progress by training a pair of generative adversarial networks with a cycle-consistent loss.

Translation Unsupervised Image-To-Image Translation

Recurrent Fusion Network for Image Captioning

no code implementations ECCV 2018 Wenhao Jiang, Lin Ma, Yu-Gang Jiang, Wei Liu, Tong Zhang

In this paper, in order to exploit the complementary information from multiple encoders, we propose a novel Recurrent Fusion Network (RFNet) for tackling image captioning.

Image Captioning

DataDeps.jl: Repeatable Data Setup for Replicable Data Science

2 code implementations3 Aug 2018 Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun

We present DataDeps. jl: a julia package for the reproducible handling of static datasets to enhance the repeatability of scripts used in the data and computational sciences.

Software Engineering

Video Re-localization

1 code implementation ECCV 2018 Yang Feng, Lin Ma, Wei Liu, Tong Zhang, Jiebo Luo

We first exploit and reorganize the videos in ActivityNet to form a new dataset for video re-localization research, which consists of about 10, 000 videos of diverse visual appearances associated with localized boundary information.

Copy Detection

STTM: A Tool for Short Text Topic Modeling

1 code implementation7 Aug 2018 Jipeng Qiang, Yun Li, Yunhao Yuan, Wei Liu, Xindong Wu

Along with the emergence and popularity of social communications on the Internet, topic discovery from short texts becomes fundamental to many applications that require semantic understanding of textual content.

Information Retrieval

End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning

no code implementations10 Aug 2018 Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

We further propose an environment augmentation technique and a customized reward function, which are crucial for successful training.

Object Object Tracking +1

A Unified Analysis of AdaGrad with Weighted Aggregation and Momentum Acceleration

no code implementations10 Aug 2018 Li Shen, Congliang Chen, Fangyu Zou, Zequn Jie, Ju Sun, Wei Liu

Integrating adaptive learning rate and momentum techniques into SGD leads to a large class of efficiently accelerated adaptive stochastic algorithms, such as AdaGrad, RMSProp, Adam, AccAdaGrad, \textit{etc}.

Stochastic Optimization

Reconstruction of a Photonic Qubit State with Reinforcement Learning

no code implementations28 Aug 2018 Shang Yu, F. Albarran-Arriagada, J. C. Retamal, Yi-Tao Wang, Wei Liu, Zhi-Jin Ke, Yu Meng, Zhi-Peng Li, Jian-Shun Tang, E. Solano, L. Lamata, Chuan-Feng Li, Guang-Can Guo

An experiment is performed to reconstruct an unknown photonic quantum state with a limited amount of copies.

Quantum Physics

Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting

1 code implementation ECCV 2018 Wei Liu, Shengcai Liao, Weidong Hu, Xuezhi Liang, Xiao Chen

However, current single-stage detectors (e. g. SSD) have not presented competitive accuracy on common pedestrian detection benchmarks.

Ranked #11 on Pedestrian Detection on Caltech (using extra training data)

Pedestrian Detection

Contour Knowledge Transfer for Salient Object Detection

1 code implementation ECCV 2018 Xin Li, Fan Yang, Hong Cheng, Wei Liu, Dinggang Shen

Our goal is to overcome this limitation by automatically converting an existing deep contour detection model into a salient object detection model without using any manual salient object masks.

Contour Detection Object +4

Incremental Multi-graph Matching via Diversity and Randomness based Graph Clustering

no code implementations ECCV 2018 Tianshu Yu, Junchi Yan, Wei Liu, Baoxin Li

In this paper, we present an incremental multi-graph matching approach, which deals with the arriving graph utilizing the previous matching results under the global consistency constraint.

Clustering Graph Clustering +1

Temporally Coherent Video Harmonization Using Adversarial Networks

1 code implementation5 Sep 2018 Hao-Zhi Huang, Senzhe Xu, Junxiong Cai, Wei Liu, Shi-Min Hu

Since existing video datasets which have ground-truth foreground masks and optical flows are not sufficiently large, we propose a simple yet efficient method to build up a synthetic dataset supporting supervised training of the proposed adversarial network.

Video Harmonization

Distilled Wasserstein Learning for Word Embedding and Topic Modeling

no code implementations NeurIPS 2018 Hongteng Xu, Wenlin Wang, Wei Liu, Lawrence Carin

When learning the topic model, we leverage a distilled underlying distance matrix to update the topic distributions and smoothly calculate the corresponding optimal transports.

Mortality Prediction Word Embeddings

Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition

no code implementations ECCV 2018 Yitong Wang, Dihong Gong, Zheng Zhou, Xing Ji, Hao Wang, Zhifeng Li, Wei Liu, Tong Zhang

Extensive experiments conducted on the three public domain face aging datasets (MORPH Album 2, CACD-VS and FG-NET) have shown the effectiveness of the proposed approach and the value of the constructed CAF dataset on AIFR.

Age-Invariant Face Recognition Benchmarking +1

PocketFlow: An Automated Framework for Compressing and Accelerating Deep Neural Networks

1 code implementation NIPS Workshop CDNNRIA 2018 Jiaxiang Wu, Yao Zhang, Haoli Bai, Huasong Zhong, Jinlong Hou, Wei Liu, Wenbing Huang, Junzhou Huang

Deep neural networks are widely used in various domains, but the prohibitive computational complexity prevents their deployment on mobile devices.

Model Compression

Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition

no code implementations1 Nov 2018 Xiangbo Shu, Jinhui Tang, Guo-Jun Qi, Wei Liu, Jian Yang

In a Co-LSTM unit, each sub-memory unit stores individual motion information, while this Co-LSTM unit selectively integrates and stores inter-related motion information between multiple interacting persons from multiple sub-memory units via the cell gate and co-memory cell, respectively.

Action Recognition Human Interaction Recognition +1

Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance

1 code implementation4 Nov 2018 Zechun Liu, Wenhan Luo, Baoyuan Wu, Xin Yang, Wei Liu, Kwang-Ting Cheng

To address the training difficulty, we propose a training algorithm using a tighter approximation to the derivative of the sign function, a magnitude-aware gradient for weight updating, a better initialization method, and a two-step scheme for training a deep network.

Depth Estimation

Super-Identity Convolutional Neural Network for Face Hallucination

no code implementations ECCV 2018 Kaipeng Zhang, Zhanpeng Zhang, Chia-Wen Cheng, Winston H. Hsu, Yu Qiao, Wei Liu, Tong Zhang

Face hallucination is a generative task to super-resolve the facial image with low resolution while human perception of face heavily relies on identity information.

Face Generation Face Hallucination +1

A Sufficient Condition for Convergences of Adam and RMSProp

no code implementations CVPR 2019 Fangyu Zou, Li Shen, Zequn Jie, Weizhong Zhang, Wei Liu

Adam and RMSProp are two of the most influential adaptive stochastic algorithms for training deep neural networks, which have been pointed out to be divergent even in the convex setting via a few simple counterexamples.

Stochastic Optimization

Unsupervised Image Captioning

1 code implementation CVPR 2019 Yang Feng, Lin Ma, Wei Liu, Jiebo Luo

Instead of relying on manually labeled image-sentence pairs, our proposed model merely requires an image set, a sentence corpus, and an existing visual concept detector.

Image Captioning Sentence

Multi-granularity Generator for Temporal Action Proposal

no code implementations CVPR 2019 Yuan Liu, Lin Ma, Yifeng Zhang, Wei Liu, Shih-Fu Chang

In this paper, we propose a multi-granularity generator (MGG) to perform the temporal action proposal from different granularity perspectives, relying on the video visual features equipped with the position embedding information.

Action Recognition Temporal Action Proposal Generation

Generalizing Graph Matching beyond Quadratic Assignment Model

no code implementations NeurIPS 2018 Tianshu Yu, Junchi Yan, Yilin Wang, Wei Liu, Baoxin Li

Graph matching has received persistent attention over decades, which can be formulated as a quadratic assignment problem (QAP).

Graph Matching

Deep Non-Blind Deconvolution via Generalized Low-Rank Approximation

no code implementations NeurIPS 2018 Wenqi Ren, Jiawei Zhang, Lin Ma, Jinshan Pan, Xiaochun Cao, WangMeng Zuo, Wei Liu, Ming-Hsuan Yang

In this paper, we present a deep convolutional neural network to capture the inherent properties of image degradation, which can handle different kernels and saturated pixels in a unified framework.

Deblurring

Learning to Compose Dynamic Tree Structures for Visual Contexts

6 code implementations CVPR 2019 Kaihua Tang, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu

We propose to compose dynamic tree structures that place the objects in an image into a visual context, helping visual reasoning tasks such as scene graph generation and visual Q&A.

Graph Generation Panoptic Scene Graph Generation +2

Real-Time Referring Expression Comprehension by Single-Stage Grounding Network

no code implementations9 Dec 2018 Xinpeng Chen, Lin Ma, Jingyuan Chen, Zequn Jie, Wei Liu, Jiebo Luo

Experiments on RefCOCO, RefCOCO+, and RefCOCOg datasets demonstrate that our proposed SSG without relying on any region proposals can achieve comparable performance with other advanced models.

Attribute Referring Expression +1

Semi-Supervised Learning for Face Sketch Synthesis in the Wild

1 code implementation12 Dec 2018 Chaofeng Chen, Wei Liu, Xiao Tan, Kwan-Yee K. Wong

Instead of supervising the network with ground truth sketches, we first perform patch matching in feature space between the input photo and photos in a small reference set of photo-sketch pairs.

Face Sketch Synthesis Patch Matching

Hierarchical Macro Strategy Model for MOBA Game AI

no code implementations19 Dec 2018 Bin Wu, Qiang Fu, Jing Liang, Peng Qu, Xiaoqian Li, Liang Wang, Wei Liu, Wei Yang, Yongsheng Liu

In this paper, we propose a novel learning-based Hierarchical Macro Strategy model for mastering MOBA games, a sub-genre of RTS games.

SAFE: Scale Aware Feature Encoder for Scene Text Recognition

no code implementations17 Jan 2019 Wei Liu, Chaofeng Chen, Kwan-Yee K. Wong

We propose a novel scale aware feature encoder (SAFE) that is designed specifically for encoding characters with different scales.

Scene Text Recognition

Salient Object Detection with Lossless Feature Reflection and Weighted Structural Loss

no code implementations21 Jan 2019 Pingping Zhang, Wei Liu, Huchuan Lu, Chunhua Shen

Inspired by the intrinsic reflection of natural images, in this paper we propose a novel feature learning framework for large-scale salient object detection.

Object object-detection +3

End-to-End Single Image Fog Removal using Enhanced Cycle Consistent Adversarial Networks

no code implementations4 Feb 2019 Wei Liu, Xianxu Hou, Jiang Duan, Guoping Qiu

In addition, we also contribute the first real world nature fog-fogfree image dataset for defogging research.

Fully-Featured Attribute Transfer

no code implementations17 Feb 2019 De Xie, Muli Yang, Cheng Deng, Wei Liu, DaCheng Tao

Image attribute transfer aims to change an input image to a target one with expected attributes, which has received significant attention in recent years.

Attribute Image Generation

PFLD: A Practical Facial Landmark Detector

18 code implementations28 Feb 2019 Xiaojie Guo, Siyuan Li, Jinke Yu, Jiawan Zhang, Jiayi Ma, Lin Ma, Wei Liu, Haibin Ling

Being accurate, efficient, and compact is essential to a facial landmark detector for practical use.

Face Alignment Facial Landmark Detection

Stacked Semantic-Guided Network for Zero-Shot Sketch-Based Image Retrieval

no code implementations3 Apr 2019 Hao Wang, Cheng Deng, Xinxu Xu, Wei Liu, Xinbo Gao, DaCheng Tao

Previous works mostly focus on a generative approach that takes a highly abstract and sparse sketch as input and then synthesizes the corresponding natural image.

Retrieval Sketch-Based Image Retrieval +1

Center and Scale Prediction: Anchor-free Approach for Pedestrian and Face Detection

2 code implementations CVPR 2019 Wei Liu, Irtiza Hasan, Shengcai Liao

Like edges, corners, blobs and other feature detectors, the proposed detector scans for feature points all over the image, for which the convolution is naturally suited.

Ranked #8 on Pedestrian Detection on Caltech (using extra training data)

Face Detection object-detection +2

Efficient Decision-based Black-box Adversarial Attacks on Face Recognition

no code implementations CVPR 2019 Yinpeng Dong, Hang Su, Baoyuan Wu, Zhifeng Li, Wei Liu, Tong Zhang, Jun Zhu

In this paper, we evaluate the robustness of state-of-the-art face recognition models in the decision-based black-box attack setting, where the attackers have no access to the model parameters and gradients, but can only acquire hard-label predictions by sending queries to the target model.

Face Recognition

MVF-Net: Multi-View 3D Face Morphable Model Regression

1 code implementation CVPR 2019 Fanzi Wu, Linchao Bao, Yajing Chen, Yonggen Ling, Yibing Song, Songnan Li, King Ngi Ngan, Wei Liu

The main ingredient of the view alignment loss is a differentiable dense optical flow estimator that can backpropagate the alignment errors between an input view and a synthetic rendering from another input view, which is projected to the target view through the 3D shape to be inferred.

Optical Flow Estimation regression

Decorrelated Adversarial Learning for Age-Invariant Face Recognition

1 code implementation CVPR 2019 Hao Wang, Dihong Gong, Zhifeng Li, Wei Liu

To reduce such a discrepancy, in this paper we propose a novel algorithm to remove age-related components from features mixed with both identity and age information.

Age-Invariant Face Recognition MORPH

Shared Predictive Cross-Modal Deep Quantization

no code implementations16 Apr 2019 Erkun Yang, Cheng Deng, Chao Li, Wei Liu, Jie Li, DaCheng Tao

In this paper, we propose a deep quantization approach, which is among the early attempts of leveraging deep neural networks into quantization-based cross-modal similarity search.

Quantization

Deep Spectral Clustering using Dual Autoencoder Network

no code implementations CVPR 2019 Xu Yang, Cheng Deng, Feng Zheng, Junchi Yan, Wei Liu

In this paper, we propose a joint learning framework for discriminative embedding and spectral clustering.

Clustering Deep Clustering +1

FaceShapeGene: A Disentangled Shape Representation for Flexible Face Image Editing

no code implementations6 May 2019 Sen-Zhe Xu, Hao-Zhi Huang, Shi-Min Hu, Wei Liu

On the basis of the FaceShapeGene, a novel part-wise face image editing system is developed, which contains a shape-remix network and a conditional label-to-face transformer.

Image Manipulation

DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs

no code implementations CVPR 2019 Erkun Yang, Tongliang Liu, Cheng Deng, Wei Liu, DaCheng Tao

To address this issue, we propose a novel deep unsupervised hashing model, dubbed DistillHash, which can learn a distilled data set consisted of data pairs, which have confidence similarity signals.

Deep Hashing Semantic Similarity +1

Spatio-temporal Video Re-localization by Warp LSTM

no code implementations CVPR 2019 Yang Feng, Lin Ma, Wei Liu, Jiebo Luo

The need for efficiently finding the video content a user wants is increasing because of the erupting of user-generated videos on the Web.

Retrieval Video Retrieval

Exact Adversarial Attack to Image Captioning via Structured Output Learning with Latent Variables

1 code implementation CVPR 2019 Yan Xu, Baoyuan Wu, Fumin Shen, Yanbo Fan, Yong Zhang, Heng Tao Shen, Wei Liu

Due to the sequential dependencies among words in a caption, we formulate the generation of adversarial noises for targeted partial captions as a structured output learning problem with latent variables.

Adversarial Attack Image Captioning

Electronic structure and $H$-$T$ phase diagram of Eu(Fe$_{1-x}$Rh$_x$)$_2$As$_2$

no code implementations28 May 2019 Shaozhu Xiao, Darren C. Peets, Wei Liu, Shiju Zhang, Ya Feng, Wen-He Jiao, Guang-Han Cao, Eike F. Schwier, Kenya Shimada, Cong Li, Xingjiang Zhou, Shaolong He

The iron-based superconductors represent a promising platform for high-temperature superconductivity, but the interactions underpinning their pairing present a puzzle.

Superconductivity Strongly Correlated Electrons

An Encoding Strategy Based Word-Character LSTM for Chinese NER

1 code implementation NAACL 2019 Wei Liu, Tongge Xu, Qinghua Xu, Jiayu Song, Yueran Zu

A recently proposed lattice model has demonstrated that words in character sequence can provide rich word boundary information for character-based Chinese NER model.

NER Segmentation

High-Level Semantic Feature Detection: A New Perspective for Pedestrian Detection

1 code implementation CVPR 2019 Wei Liu, Shengcai Liao, Weiqiang Ren, Weidong Hu, Yinan Yu

Like edges, corners, blobs and other feature detectors, the proposed detector scans for feature points all over the image, for which the convolution is naturally suited.

object-detection Object Detection +2

Understanding Distributional Ambiguity via Non-robust Chance Constraint

no code implementations3 Jun 2019 Qi Wu, Shumin Ma, Cheuk Hang Leung, Wei Liu, Nanbo Peng

Without the boundedness constraint, the CCO problem is shown to perform uniformly better than the DRO problem, irrespective of the radius of the ambiguity set, the choice of the divergence measure, or the tail heaviness of the center distribution.

Portfolio Optimization

Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning

no code implementations3 Jun 2019 Wei Zhang, Bairui Wang, Lin Ma, Wei Liu

Unlike previous video captioning work mainly exploiting the cues of video contents to make a language description, we propose a reconstruction network (RecNet) in a novel encoder-decoder-reconstructor architecture, which leverages both forward (video to sentence) and backward (sentence to video) flows for video captioning.

reinforcement-learning Reinforcement Learning (RL) +2

Global injectivity of differentiable maps via W-condition in R^2

no code implementations25 Jun 2019 Wei Liu

In this paper, we study the intrinsic relation between the global injectivity of differentiable local homeomorphisms $F$ and the rate that tends to zero of $Spec(F)$ in $\mathbb{R}^2$, where $Spec(F)$ denotes the set of all (complex) eigenvalues of $DF(x)$, for all $x\in \mathbb{R}^2$.

Functional Analysis Operator Algebras 14R15, 14E07, 14E09

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation23 Jul 2019 Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Ian Reid

In this paper, a non-convex non-smooth optimization framework is proposed to achieve diverse smoothing natures where even contradictive smoothing behaviors can be achieved.

image smoothing

Central Similarity Quantization for Efficient Image and Video Retrieval

1 code implementation CVPR 2020 Li Yuan, Tao Wang, Xiaopeng Zhang, Francis EH Tay, Zequn Jie, Wei Liu, Jiashi Feng

In this work, we propose a new \emph{global} similarity metric, termed as \emph{central similarity}, with which the hash codes of similar data pairs are encouraged to approach a common center and those for dissimilar pairs to converge to different centers, to improve hash learning efficiency and retrieval accuracy.

Quantization Retrieval +1

Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion

no code implementations ICCV 2019 Pingping Zhang, Wei Liu, Yinjie Lei, Huchuan Lu, Xiaoyun Yang

To address these issues, in this work we propose a novel deep learning framework, named Cascaded Context Pyramid Network (CCPNet), to jointly infer the occupancy and semantic labels of a volumetric 3D scene from a single depth image.

Ranked #5 on 3D Semantic Scene Completion on NYUv2 (using extra training data)

3D Semantic Scene Completion

Multi-Frame Content Integration with a Spatio-Temporal Attention Mechanism for Person Video Motion Transfer

no code implementations12 Aug 2019 Kun Cheng, Hao-Zhi Huang, Chun Yuan, Lingyiqing Zhou, Wei Liu

Specifically, we transfer the motion of one person in a target video to another person in a source video, while preserving the appearance of the source person.

Video Generation

Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis

1 code implementation13 Aug 2019 Wei Liu, Jie-Lin Qiu, Wei-Long Zheng, Bao-liang Lu

We evaluate the performance of DCCA on five multimodal datasets: the SEED, SEED-IV, SEED-V, DEAP, and DREAMER datasets.

Binary Classification General Classification +1

Occlusion Robust Face Recognition Based on Mask Learning with PairwiseDifferential Siamese Network

1 code implementation17 Aug 2019 Lingxue Song, Dihong Gong, Zhifeng Li, Changsong Liu, Wei Liu

Deep Convolutional Neural Networks (CNNs) have been pushing the frontier of the face recognition research in the past years.

Face Recognition Robust Face Recognition

From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories

no code implementations20 Aug 2019 Songwei Ge, Curtis Xuan, Ruihua Song, Chao Zou, Wei Liu, Jin Zhou

In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model.

Retrieval TAG

Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network

1 code implementation ICCV 2019 Bairui Wang, Lin Ma, Wei zhang, Wenhao Jiang, Jingwen Wang, Wei Liu

In this paper, we propose to guide the video caption generation with Part-of-Speech (POS) information, based on a gated fusion of multiple representations of input videos.

Caption Generation POS +2

DV3+HED+: A DCNNs-based Framework to Monitor Temporary Works and ESAs in Railway Construction Project Using VHR Satellite Images

1 code implementation29 Aug 2019 Rui Guo, Ronghua Liu, Na Li, Wei Liu

Current VHR(Very High Resolution) satellite images enable the detailed monitoring of the earth and can capture the ongoing works of railway construction.

Edge Detection Semantic Segmentation

Multi-lingual Wikipedia Summarization and Title Generation On Low Resource Corpus

no code implementations RANLP 2019 Wei Liu, Lei LI, Zuying Huang, Yinan Liu

MultiLing 2019 Headline Generation Task on Wikipedia Corpus raised a critical and practical problem: multilingual task on low resource corpus.

Extractive Summarization Headline Generation +3

ICDM 2019 Knowledge Graph Contest: Team UWA

2 code implementations4 Sep 2019 Michael Stewart, Majigsuren Enkhsaikhan, Wei Liu

We present an overview of our triple extraction system for the ICDM 2019 Knowledge Graph Contest.

graph construction

Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning

no code implementations ICCV 2019 Yipeng Sun, Jiaming Liu, Wei Liu, Junyu Han, Errui Ding, Jingtuo Liu

Most existing text reading benchmarks make it difficult to evaluate the performance of more advanced deep learning models in large vocabularies due to the limited amount of training data.

Spatiotemporal Co-attention Recurrent Neural Networks for Human-Skeleton Motion Prediction

no code implementations29 Sep 2019 Xiangbo Shu, Liyan Zhang, Guo-Jun Qi, Wei Liu, Jinhui Tang

To this end, we propose a novel Skeleton-joint Co-attention Recurrent Neural Networks (SC-RNN) to capture the spatial coherence among joints, and the temporal evolution among skeletons simultaneously on a skeleton-joint co-attention feature map in spatiotemporal space.

Human motion prediction motion prediction

Accelerating Federated Learning via Momentum Gradient Descent

no code implementations8 Oct 2019 Wei Liu, Li Chen, Yunfei Chen, Wenyi Zhang

The proposed momentum federated learning (MFL) uses momentum gradient descent (MGD) in the local update step of FL system.

BIG-bench Machine Learning Federated Learning

Deep Multiphase Level Set for Scene Parsing

no code implementations8 Oct 2019 Pingping Zhang, Wei Liu, Yinjie Lei, Hongyu Wang, Huchuan Lu

The proposed method consists of three modules, i. e., recurrent FCNs, adaptive multiphase level set, and deeply supervised learning.

Image Segmentation Scene Parsing +1

Context-Gated Convolution

1 code implementation ECCV 2020 Xudong Lin, Lin Ma, Wei Liu, Shih-Fu Chang

As such, being aware of the global context, the modulated convolution kernel of our proposed CGC can better extract representative local patterns and compose discriminative features.

Ranked #61 on Image Classification on ObjectNet (using extra training data)

Action Recognition Image Classification +1

Vatex Video Captioning Challenge 2020: Multi-View Features and Hybrid Reward Strategies for Video Captioning

no code implementations17 Oct 2019 Xinxin Zhu, Longteng Guo, Peng Yao, Shichen Lu, Wei Liu, Jing Liu

This report describes our solution for the VATEX Captioning Challenge 2020, which requires generating descriptions for the videos in both English and Chinese languages.

Video Captioning

Diversifying Topic-Coherent Response Generation for Natural Multi-turn Conversations

no code implementations24 Oct 2019 Fei Hu, Wei Liu, Ajmal Saeed Mian, Li Li

In this paper, we propose the Topic-coherent Hierarchical Recurrent Encoder-Decoder model (THRED) to diversify the generated responses without deviating the contextual topics for multi-turn conversations.

Response Generation

Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation

1 code implementation NeurIPS 2019 Qiming Zhang, Jing Zhang, Wei Liu, DaCheng Tao

Although there has been a progress in matching the marginal distributions between two domains, the classifier favors the source domain features and makes incorrect predictions on the target domain due to category-agnostic feature alignment.

Semantic Segmentation Synthetic-to-Real Translation +1

Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos

1 code implementation NeurIPS 2019 Yitian Yuan, Lin Ma, Jingwen Wang, Wei Liu, Wenwu Zhu

Temporal sentence grounding in videos aims to detect and localize one target video segment, which semantically corresponds to a given sentence.

Sentence Temporal Sentence Grounding

Word-level Lexical Normalisation using Context-Dependent Embeddings

no code implementations13 Nov 2019 Michael Stewart, Wei Liu, Rachel Cardell-Oliver

In this paper we introduce a word-level GRU-based LN model and investigate the effectiveness of recent embedding techniques on word-level LN.

Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression

20 code implementations19 Nov 2019 Zhaohui Zheng, Ping Wang, Wei Liu, Jinze Li, Rongguang Ye, Dongwei Ren

By incorporating DIoU and CIoU losses into state-of-the-art object detection algorithms, e. g., YOLO v3, SSD and Faster RCNN, we achieve notable performance gains in terms of not only IoU metric but also GIoU metric.

object-detection Object Detection +1

Empirical Autopsy of Deep Video Captioning Frameworks

no code implementations21 Nov 2019 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Ajmal Mian

We perform extensive experiments by varying the constituent components of the video captioning framework, and quantify the performance gains that are possible by mere component selection.

Language Modelling Video Captioning +1

Multi-Task Driven Feature Models for Thermal Infrared Tracking

1 code implementation26 Nov 2019 Qiao Liu, Xin Li, Zhenyu He, Nana Fan, Di Yuan, Wei Liu, Yonsheng Liang

These two feature models are learned using a multi-task matching framework and are jointly optimized on the TIR tracking task.

Thermal Infrared Object Tracking

Learning Multi-level Weight-centric Features for Few-shot Learning

no code implementations28 Nov 2019 Mingjiang Liang, Shaoli Huang, Shirui Pan, Mingming Gong, Wei Liu

Few-shot learning is currently enjoying a considerable resurgence of interest, aided by the recent advance of deep learning.

Few-Shot Learning

Cross-Modal Learning with Adversarial Samples

1 code implementation NeurIPS 2019 Chao Li, Shangqian Gao, Cheng Deng, De Xie, Wei Liu

Extensive experiments on two cross-modal benchmark datasets show that the adversarial examples produced by our CMLA are efficient in fooling a target deep cross-modal hashing network.

Retrieval

Fast Stochastic Ordinal Embedding with Variance Reduction and Adaptive Step Size

no code implementations1 Dec 2019 Ke Ma, Jinshan Zeng, Qianqian Xu, Xiaochun Cao, Wei Liu, Yuan YAO

Learning representation from relative similarity comparisons, often called ordinal embedding, gains rising attention in recent years.

Potential Passenger Flow Prediction: A Novel Study for Urban Transportation Development

no code implementations7 Dec 2019 Yongshun Gong, Zhibin Li, Jian Zhang, Wei Liu, Jin-Feng Yi

In this paper, this specific problem is termed as potential passenger flow (PPF) prediction, which is a novel and important study connected with urban computing and intelligent transportation systems.

MULTI-VIEW LEARNING Recommendation Systems

Graph Inference Learning for Semi-supervised Classification

no code implementations ICLR 2020 Chunyan Xu, Zhen Cui, Xiaobin Hong, Tong Zhang, Jian Yang, Wei Liu

In this work, we address semi-supervised classification of graph data, where the categories of those unlabeled nodes are inferred from labeled nodes as well as graph structures.

Classification General Classification +1

Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization

no code implementations NeurIPS 2020 Yan Yan, Yi Xu, Qihang Lin, Wei Liu, Tianbao Yang

In this paper, we bridge this gap by providing a sharp analysis of epoch-wise stochastic gradient descent ascent method (referred to as Epoch-GDA) for solving strongly convex strongly concave (SCSC) min-max problems, without imposing any additional assumption about smoothness or the function's structure.

LEMMA

Adversarial Perturbations Prevail in the Y-Channel of the YCbCr Color Space

1 code implementation25 Feb 2020 Camilo Pestana, Naveed Akhtar, Wei Liu, David Glance, Ajmal Mian

Our results show that our approach achieves the best balance between defence against adversarial attacks such as FGSM, PGD and DDN and maintaining the original accuracies of VGG-16, ResNet50 and DenseNet121 on clean images.

An Improved DOA Estimation Method for a Mixture of Circular and Non-Circular Signals Based on Sparse Arrays

no code implementations11 Mar 2020 Jingjing Cai, Wei Liu, Ru Zong, Yangyang Dong

Sparse arrays have attracted a lot of interests recently for their capability of providing more degrees of freedom than traditional uniform linear arrays.

Towards Photo-Realistic Virtual Try-On by Adaptively Generating$\leftrightarrow$Preserving Image Content

3 code implementations12 Mar 2020 Han Yang, Ruimao Zhang, Xiaobao Guo, Wei Liu, WangMeng Zuo, Ping Luo

First, a semantic layout generation module utilizes semantic segmentation of the reference image to progressively predict the desired semantic layout after try-on.

Ranked #4 on Virtual Try-on on VITON (IS metric)

Semantic Segmentation Virtual Try-on

E2EET: From Pipeline to End-to-end Entity Typing via Transformer-Based Embeddings

no code implementations23 Mar 2020 Michael Stewart, Wei Liu

They are therefore sensitive to window size selection and are unable to incorporate the context of the entire document.

Entity Typing named-entity-recognition +3

Progressive Multi-Stage Learning for Discriminative Tracking

no code implementations1 Apr 2020 Weichao Li, Xi Li, Omar Elfarouk Bourahla, Fuxian Huang, Fei Wu, Wei Liu, Zhiheng Wang, Hongmin Liu

Visual tracking is typically solved as a discriminative learning problem that usually requires high-quality samples for online model adaptation.

Visual Tracking

Deblurring by Realistic Blurring

1 code implementation CVPR 2020 Kaihao Zhang, Wenhan Luo, Yiran Zhong, Lin Ma, Bjorn Stenger, Wei Liu, Hongdong Li

To address this problem, we propose a new method which combines two GAN models, i. e., a learning-to-Blur GAN (BGAN) and learning-to-DeBlur GAN (DBGAN), in order to learn a better model for image deblurring by primarily learning how to blur images.

Deblurring Image Deblurring

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation

1 code implementation EMNLP 2020 Dayiheng Liu, Yeyun Gong, Jie Fu, Wei Liu, Yu Yan, Bo Shao, Daxin Jiang, Jiancheng Lv, Nan Duan

Furthermore, we propose a simple and effective method to mine the keyphrases of interest in the news article and build a first large-scale keyphrase-aware news headline corpus, which contains over 180K aligned triples of $<$news article, headline, keyphrase$>$.

Headline Generation Sentence

Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation

no code implementations17 Apr 2020 Xiaocong Chen, Chaoran Huang, Lina Yao, Xianzhi Wang, Wei Liu, Wenjie Zhang

Interactive recommendation aims to learn from dynamic interactions between items and users to achieve responsiveness and accuracy.

Decision Making Knowledge-Aware Recommendation +3

Quantized Adam with Error Feedback

no code implementations29 Apr 2020 Congliang Chen, Li Shen, Hao-Zhi Huang, Wei Liu

In this paper, we present a distributed variant of adaptive stochastic gradient method for training deep neural networks in the parameter-server model.

Quantization

Energy Efficient User Clustering, Hybrid Precoding and Power Optimization in Terahertz MIMO-NOMA Systems

no code implementations3 May 2020 Haijun Zhang, Haisen Zhang, Wei Liu, Keping Long, Jiangbo Dong, Victor C. M. Leung

Considering the power consumption and implementation complexity, the hybrid precoding scheme based on the sub-connection structure is adopted.

Clustering

Communication-Efficient Distributed Stochastic AUC Maximization with Deep Neural Networks

1 code implementation ICML 2020 Zhishuai Guo, Mingrui Liu, Zhuoning Yuan, Li Shen, Wei Liu, Tianbao Yang

In this paper, we study distributed algorithms for large-scale AUC maximization with a deep neural network as a predictive model.

Distributed Optimization

Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation

6 code implementations7 May 2020 Zhaohui Zheng, Ping Wang, Dongwei Ren, Wei Liu, Rongguang Ye, QinGhua Hu, WangMeng Zuo

In this paper, we propose Complete-IoU (CIoU) loss and Cluster-NMS for enhancing geometric factors in both bounding box regression and Non-Maximum Suppression (NMS), leading to notable gains of average precision (AP) and average recall (AR), without the sacrifice of inference efficiency.

Clustering Instance Segmentation +6

Hierarchical Regression Network for Spectral Reconstruction from RGB Images

1 code implementation10 May 2020 Yuzhi Zhao, Lai-Man Po, Qiong Yan, Wei Liu, Tingyu Lin

Hyperspectral reconstruction from RGB images denotes a reverse process of hyperspectral imaging by discovering an inverse response function.

regression Spectral Reconstruction

CPOT: Channel Pruning via Optimal Transport

no code implementations21 May 2020 Yucong Shen, Li Shen, Hao-Zhi Huang, Xuan Wang, Wei Liu

Recent advances in deep neural networks (DNNs) lead to tremendously growing network parameters, making the deployments of DNNs on platforms with limited resources extremely difficult.

Image-to-Image Translation Translation

TCDesc: Learning Topology Consistent Descriptors

no code implementations5 Jun 2020 Honghu Pan, Fanyang Meng, Zhenyu He, Yongsheng Liang, Wei Liu

Then we define topology distance between descriptors as the difference of their topology vectors.

DFraud3- Multi-Component Fraud Detection freeof Cold-start

no code implementations10 Jun 2020 Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

In this research, instead of focusing only on one component, detecting either fraud reviews or fraud users (fraudsters), vector representations are learnt for each component, enabling multi-component classification.

Component Classification Fraud Detection +1

Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning

no code implementations16 Jun 2020 Jie An, Tao Li, Hao-Zhi Huang, Li Shen, Xuan Wang, Yongyi Tang, Jinwen Ma, Wei Liu, Jiebo Luo

Extracting effective deep features to represent content and style information is the key to universal style transfer.

Style Transfer

Cannot find the paper you are looking for? You can Submit a new open access paper.