Search Results for author: Jian Yang

Found 277 papers, 113 papers with code

Regularized Robust Coding for Face Recognition

no code implementations20 Feb 2012 Meng Yang, Lei Zhang, Jian Yang, David Zhang

Recently the sparse representation based classification (SRC) has been proposed for robust face recognition (FR).

Face Recognition Robust Face Recognition +1

SHALE: An Efficient Algorithm for Allocation of Guaranteed Display Advertising

no code implementations16 Mar 2012 Vijay Bharadwaj, Peiji Chen, Wenjing Ma, Chandrashekhar Nagarajan, John Tomlin, Sergei Vassilvitskii, Erik Vee, Jian Yang

Motivated by the problem of optimizing allocation in guaranteed display advertising, we develop an efficient, lightweight method of generating a compact {\em allocation plan} that can be used to guide ad server decisions.

Data Structures and Algorithms

Unsupervised Pretraining Encourages Moderate-Sparseness

no code implementations20 Dec 2013 Jun Li, Wei Luo, Jian Yang, Xiao-Tong Yuan

It is well known that direct training of deep neural networks will generally lead to poor results.

Nuclear Norm based Matrix Regression with Applications to Face Recognition with Occlusion and Illumination Changes

no code implementations6 May 2014 Jian Yang, Jianjun Qian, Lei Luo, Fanlong Zhang, Yicheng Gao

Compared with the current regression methods, the proposed Nuclear Norm based Matrix Regression (NMR) model is more robust for alleviating the effect of illumination, and more intuitive and powerful for removing the structural noise caused by occlusion.

Face Recognition regression

Feature Selection in Conditional Random Fields for Map Matching of GPS Trajectories

no code implementations2 Sep 2014 Jian Yang, Liqiu Meng

Map matching of the GPS trajectory serves the purpose of recovering the original route on a road network from a sequence of noisy GPS observations.

feature selection

Feature Engineering for Map Matching of Low-Sampling-Rate GPS Trajectories in Road Network

no code implementations2 Sep 2014 Jian Yang, Liqiu Meng

Map matching of GPS trajectories from a sequence of noisy observations serves the purpose of recovering the original routes in a road network.

Feature Engineering

Sparse Deep Stacking Network for Image Classification

no code implementations5 Jan 2015 Jun Li, Heyou Chang, Jian Yang

Luckily, a simplified neural network module (SNNM) has been proposed to directly learn the discriminative dictionaries for avoiding the expensive inference.

Classification General Classification +1

Stochastic Behavior of the Nonnegative Least Mean Fourth Algorithm for Stationary Gaussian Inputs and Slow Learning

no code implementations24 Aug 2015 Jingen Ni, Jian Yang, Jie Chen, Cédric Richard, José Carlos M. Bermudez

Some system identification problems impose nonnegativity constraints on the parameters to estimate due to inherent physical characteristics of the unknown system.

Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy

no code implementations22 Oct 2015 Michael N. Katehakis, Jian Yang, Tingting Zhou

Inventory control with unknown demand distribution is considered, with emphasis placed on the case involving discrete nonperishable items.

A survey of sparse representation: algorithms and applications

no code implementations23 Feb 2016 Zheng Zhang, Yong Xu, Jian Yang, Xuelong. Li, David Zhang

The main purpose of this article is to provide a comprehensive study and an updated review on sparse representation and to supply a guidance for researchers.

Tensor Graphical Model: Non-convex Optimization and Statistical Inference

no code implementations15 Sep 2016 Xiang Lyu, Will Wei Sun, Zhaoran Wang, Han Liu, Jian Yang, Guang Cheng

We consider the estimation and inference of graphical models that characterize the dependency structure of high-dimensional tensor-valued data.

LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

no code implementations NeurIPS 2016 Xiang Li, Tao Qin, Jian Yang, Tie-Yan Liu

Based on the 2-Component shared embedding, we design a new RNN algorithm and evaluate it using the language modeling task on several benchmark datasets.

Language Modelling Machine Translation

Large Margin Discriminant Dimensionality Reduction in Prediction Space

no code implementations NeurIPS 2016 Mohammad Saberian, Jose Costa Pereira, Can Xu, Jian Yang, Nuno Nvasconcelos

We argue that the intermediate mapping, e. g. boosting predictor, is preserving the discriminant aspects of the data and by controlling the dimension of this mapping it is possible to achieve discriminant low dimensional representations for the data.

Dimensionality Reduction General Classification +1

Image Super-Resolution via Deep Recursive Residual Network

1 code implementation CVPR 2017 Ying Tai, Jian Yang, Xiaoming Liu

Specifically, residual learning is adopted, both in global and local manners, to mitigate the difficulty of training very deep networks; recursive learning is used to control the model parameters while increasing the depth.

Image Super-Resolution Video Super-Resolution

Discriminative Block-Diagonal Representation Learning for Image Recognition

no code implementations12 Jul 2017 Zheng Zhang, Yong Xu, Ling Shao, Jian Yang

In particular, the elaborate BDLRR is formulated as a joint optimization problem of shrinking the unfavorable representation from off-block-diagonal elements and strengthening the compact block-diagonal representation under the semi-supervised framework of low-rank representation.

Representation Learning

Spectral Filter Tracking

no code implementations18 Jul 2017 Zhen Cui, You yi Cai, Wen ming Zheng, Jian Yang

Visual object tracking is a challenging computer vision task with numerous real-world applications.

Graph Matching regression +2

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

no code implementations1 Nov 2017 Yu Chen, Chunhua Shen, Hao Chen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang

In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity.

Pose Estimation

Action-Attending Graphic Neural Network

no code implementations17 Nov 2017 Chaolong Li, Zhen Cui, Wenming Zheng, Chunyan Xu, Rongrong Ji, Jian Yang

The motion analysis of human skeletons is crucial for human action recognition, which is one of the most active topics in computer vision.

Action Analysis Action Recognition +3

FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors

4 code implementations CVPR 2018 Yu Chen, Ying Tai, Xiaoming Liu, Chunhua Shen, Jian Yang

We present a novel deep end-to-end trainable Face Super-Resolution Network (FSRNet), which makes full use of the geometry prior, i. e., facial landmark heatmaps and parsing maps, to super-resolve very low-resolution (LR) face images without well-aligned requirement.

Face Alignment Generative Adversarial Network +1

Understanding the Disharmony between Dropout and Batch Normalization by Variance Shift

4 code implementations CVPR 2019 Xiang Li, Shuo Chen, Xiaolin Hu, Jian Yang

Theoretically, we find that Dropout would shift the variance of a specific neural unit when we transfer the state of that network from train to test.

Densely Connected Bidirectional LSTM with Applications to Sentence Classification

2 code implementations3 Feb 2018 Zixiang Ding, Rui Xia, Jianfei Yu, Xiang Li, Jian Yang

Deep neural networks have recently been shown to achieve highly competitive performance in many computer vision tasks due to their abilities of exploring in a much larger hypothesis space.

Classification General Classification +2

Mixed Link Networks

1 code implementation6 Feb 2018 Wenhai Wang, Xiang Li, Jian Yang, Tong Lu

Basing on the analysis by revealing the equivalence of modern networks, we find that both ResNet and DenseNet are essentially derived from the same "dense topology", yet they only differ in the form of connection -- addition (dubbed "inner link") vs. concatenation (dubbed "outer link").

Representation Learning

Adversarial Metric Learning

no code implementations9 Feb 2018 Shuo Chen, Chen Gong, Jian Yang, Xiang Li, Yang Wei, Jun Li

In distinguishment stage, a metric is exhaustively learned to try its best to distinguish both the adversarial pairs and the original training pairs.

Metric Learning

Spatio-Temporal Graph Convolution for Skeleton Based Action Recognition

no code implementations27 Feb 2018 Chaolong Li, Zhen Cui, Wenming Zheng, Chunyan Xu, Jian Yang

To encode dynamic graphs, the constructed multi-scale local graph convolution filters, consisting of matrices of local receptive fields and signal mappings, are recursively performed on structured graph data of temporal and spatial domain.

Action Recognition Skeleton Based Action Recognition +1

Provable Convex Co-clustering of Tensors

no code implementations17 Mar 2018 Eric C. Chi, Brian R. Gaines, Will Wei Sun, Hua Zhou, Jian Yang

Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data.

Clustering Computational Efficiency

Walk-Steered Convolution for Graph Classification

no code implementations16 Apr 2018 Jiatao Jiang, Chunyan Xu, Zhen Cui, Tong Zhang, Wenming Zheng, Jian Yang

As an analogy to a standard convolution kernel on image, Gaussian models implicitly coordinate those unordered vertices/nodes and edges in a local receptive field after projecting to the gradient space of Gaussian parameters.

Clustering General Classification +2

Greedy Graph Searching for Vascular Tracking in Angiographic Image Sequences

no code implementations25 May 2018 Huihui Fang, Jian Yang, Jianjun Zhu, Danni Ai, Yong Huang, Yurong Jiang, Hong Song, Yongtian Wang

The vascular branch was described using a vascular centerline extraction method with multi-probability fusion-based topology optimization.

Dynamic Time Warping Image Registration

Occluded Pedestrian Detection Through Guided Attention in CNNs

no code implementations CVPR 2018 Shanshan Zhang, Jian Yang, Bernt Schiele

In this paper, we aim to propose a simple and compact method based on the FasterRCNN architecture for occluded pedestrian detection.

Pedestrian Detection

Shape Robust Text Detection with Progressive Scale Expansion Network

9 code implementations7 Jun 2018 Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, Jian Yang

To address these problems, we propose a novel Progressive Scale Expansion Network (PSENet), designed as a segmentation-based detector with multiple predictions for each text instance.

Curved Text Detection Text Detection

When Work Matters: Transforming Classical Network Structures to Graph CNN

no code implementations7 Jul 2018 Wenting Zhao, Chunyan Xu, Zhen Cui, Tong Zhang, Jiatao Jiang, Zhen-Yu Zhang, Jian Yang

In this paper, we aim to give a comprehensive analysis of when work matters by transforming different classical network structures to graph CNN, particularly in the basic graph recognition problem.

Graph Classification Video Understanding

Person Search via A Mask-Guided Two-Stream CNN Model

no code implementations ECCV 2018 Di Chen, Shanshan Zhang, Wanli Ouyang, Jian Yang, Ying Tai

In this work, we tackle the problem of person search, which is a challenging task consisted of pedestrian detection and person re-identification~(re-ID).

Pedestrian Detection Person Re-Identification +2

Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation

no code implementations ECCV 2018 Zhen-Yu Zhang, Zhen Cui, Chunyan Xu, Zequn Jie, Xiang Li, Jian Yang

In this paper, we propose a novel joint Task-Recursive Learning (TRL) framework for the closing-loop semantic segmentation and monocular depth estimation tasks.

Monocular Depth Estimation Segmentation +1

Context-Dependent Diffusion Network for Visual Relationship Detection

no code implementations11 Sep 2018 Zhen Cui, Chunyan Xu, Wenming Zheng, Jian Yang

Visual relationship detection can bridge the gap between computer vision and natural language for scene understanding of images.

Object Object Recognition +2

Triple Attention Mixed Link Network for Single Image Super Resolution

no code implementations8 Oct 2018 Xi Cheng, Xiang Li, Jian Yang

Single image super resolution is of great importance as a low-level computer vision task.

Image Super-Resolution

DSFD: Dual Shot Face Detector

4 code implementations CVPR 2019 Jian Li, Yabiao Wang, Changan Wang, Ying Tai, Jianjun Qian, Jian Yang, Chengjie Wang, Jilin Li, Feiyue Huang

In this paper, we propose a novel face detection network with three novel contributions that address three key aspects of face detection, including better feature learning, progressive loss design and anchor assign based data augmentation, respectively.

Data Augmentation Occluded Face Detection

Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition

no code implementations1 Nov 2018 Xiangbo Shu, Jinhui Tang, Guo-Jun Qi, Wei Liu, Jian Yang

In a Co-LSTM unit, each sub-memory unit stores individual motion information, while this Co-LSTM unit selectively integrates and stores inter-related motion information between multiple interacting persons from multiple sub-memory units via the cell gate and co-memory cell, respectively.

Action Recognition Human Interaction Recognition +1

Gaussian-Induced Convolution for Graphs

no code implementations11 Nov 2018 Jiatao Jiang, Zhen Cui, Chunyan Xu, Jian Yang

In this work, we propose a Gaussian-induced convolution (GIC) framework to conduct local convolution filtering on irregular graphs.

Graph Classification Learning Representation On Graph

2017 Robotic Instrument Segmentation Challenge

3 code implementations18 Feb 2019 Max Allan, Alex Shvets, Thomas Kurmann, Zichen Zhang, Rahul Duggal, Yun-Hsuan Su, Nicola Rieke, Iro Laina, Niveditha Kalavakonda, Sebastian Bodenstedt, Luis Herrera, Wenqi Li, Vladimir Iglovikov, Huoling Luo, Jian Yang, Danail Stoyanov, Lena Maier-Hein, Stefanie Speidel, Mahdi Azizian

In mainstream computer vision and machine learning, public datasets such as ImageNet, COCO and KITTI have helped drive enormous improvements by enabling researchers to understand the strengths and limitations of different algorithms via performance comparison.

Benchmarking Person Re-Identification +2

Learning with Inadequate and Incorrect Supervision

no code implementations20 Feb 2019 Chen Gong, Hengmin Zhang, Jian Yang, DaCheng Tao

To address label insufficiency, we use a graph to bridge the data points so that the label information can be propagated from the scarce labeled examples to unlabeled examples along the graph edges.

Image Classification speech-recognition +2

Selective Kernel Networks

20 code implementations CVPR 2019 Xiang Li, Wenhai Wang, Xiaolin Hu, Jian Yang

A building block called Selective Kernel (SK) unit is designed, in which multiple branches with different kernel sizes are fused using softmax attention that is guided by the information in these branches.

Ranked #98 on Image Classification on CIFAR-100 (using extra training data)

Image Classification

Manifold Criterion Guided Transfer Learning via Intermediate Domain Generation

1 code implementation25 Mar 2019 Lei Zhang, Shan-Shan Wang, Guang-Bin Huang, WangMeng Zuo, Jian Yang, David Zhang

The merits of the proposed MCTL are four-fold: 1) the concept of manifold criterion (MC) is first proposed as a measure validating the distribution matching across domains, and domain adaptation is achieved if the MC is satisfied; 2) the proposed MC can well guide the generation of the intermediate domain sharing similar distribution with the target domain, by minimizing the local domain discrepancy; 3) a global generative discrepancy metric (GGDM) is presented, such that both the global and local discrepancy can be effectively and positively reduced; 4) a simplified version of MCTL called MCTL-S is presented under a perfect domain generation assumption for more generic learning scenario.

Transfer Learning Unsupervised Domain Adaptation

Sparse Tensor Additive Regression

no code implementations31 Mar 2019 Botao Hao, Boxiang Wang, Pengyuan Wang, Jingfei Zhang, Jian Yang, Will Wei Sun

Tensors are becoming prevalent in modern applications such as medical imaging and digital marketing.

Click-Through Rate Prediction Marketing +1

A Regularization Approach for Instance-Based Superset Label Learning

no code implementations5 Apr 2019 Chen Gong, Tongliang Liu, Yuanyan Tang, Jian Yang, Jie Yang, DaCheng Tao

As a result, the intrinsic constraints among different candidate labels are deployed, and the disambiguated labels generated by RegISL are more discriminative and accurate than those output by existing instance-based algorithms.

ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems

2 code implementations6 Apr 2019 Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer

This paper introduces a new database of voice recordings with the goal of supporting research on vulnerabilities and protection of voice-controlled systems (VCSs).

Voice Anti-spoofing

Ensemble Teaching for Hybrid Label Propagation

no code implementations8 Apr 2019 Chen Gong, DaCheng Tao, Xiaojun Chang, Jian Yang

More importantly, HyDEnT conducts propagation under the guidance of an ensemble of teachers.

Online Adaptation through Meta-Learning for Stereo Depth Estimation

no code implementations17 Apr 2019 Zhen-Yu Zhang, Stéphane Lathuilière, Andrea Pilzer, Nicu Sebe, Elisa Ricci, Jian Yang

Our proposal is evaluated on the wellestablished KITTI dataset, where we show that our online method is competitive withstate of the art algorithms trained in a batch setting.

Meta-Learning Stereo Depth Estimation

Multi-scale Dynamic Graph Convolutional Network for Hyperspectral Image Classification

1 code implementation14 May 2019 Sheng Wan, Chen Gong, Ping Zhong, Bo Du, Lefei Zhang, Jian Yang

To alleviate this shortcoming, we consider employing the recently proposed Graph Convolutional Network (GCN) for hyperspectral image classification, as it can conduct the convolution on arbitrarily structured non-Euclidean data and is applicable to the irregular image regions represented by graph topological information.

Classification General Classification +1

Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks

3 code implementations23 May 2019 Xiang Li, Xiaolin Hu, Jian Yang

The Convolutional Neural Networks (CNNs) generate the feature representation of complex objects by collecting hierarchical and different parts of semantic sub-features.

Image Classification Object Detection

Robust Classification with Sparse Representation Fusion on Diverse Data Subsets

no code implementations10 Jun 2019 Chun-Mei Feng, Yong Xu, Zuoyong Li, Jian Yang

It performs Sparse Representation Fusion based on the Diverse Subset of training samples (SRFDS), which reduces the impact of randomness of the sample set and enhances the robustness of classification results.

General Classification Robust classification

Refined-Segmentation R-CNN: A Two-stage Convolutional Neural Network for Punctate White Matter Lesion Segmentation in Preterm Infants

1 code implementation24 Jun 2019 Yalong Liu, Jie Li, Ying Wang, Miaomiao Wang, Xianjun Li, Zhicheng Jiao, Jian Yang, Xingbo Gao

In this paper, we construct an efficient two-stage PWML semantic segmentation network based on the characteristics of the lesion, called refined segmentation R-CNN (RS RCNN).

Image Segmentation Lesion Segmentation +3

Image Formation Model Guided Deep Image Super-Resolution

1 code implementation18 Aug 2019 Jinshan Pan, Yang Liu, Deqing Sun, Jimmy Ren, Ming-Ming Cheng, Jian Yang, Jinhui Tang

We present a simple and effective image super-resolution algorithm that imposes an image formation constraint on the deep neural networks via pixel substitution.

Image Super-Resolution

Cross-X Learning for Fine-Grained Visual Categorization

no code implementations ICCV 2019 Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim

Recognizing objects from subcategories with very subtle differences remains a challenging task due to the large intra-class and small inter-class variation.

Ranked #18 on Fine-Grained Image Classification on NABirds (using extra training data)

Fine-Grained Image Classification Fine-Grained Visual Categorization

Multi-scale Dynamic Feature Encoding Network for Image Demoireing

1 code implementation26 Sep 2019 Xi Cheng, Zhen-Yong Fu, Jian Yang

The prevalence of digital sensors, such as digital cameras and mobile phones, simplifies the acquisition of photos.

Image Restoration

Hyperspectral Image Classification With Context-Aware Dynamic Graph Convolutional Network

no code implementations26 Sep 2019 Sheng Wan, Chen Gong, Ping Zhong, Shirui Pan, Guangyu Li, Jian Yang

In hyperspectral image (HSI) classification, spatial context has demonstrated its significance in achieving promising performance.

Classification General Classification +1

Low-Resource Response Generation with Template Prior

1 code implementation IJCNLP 2019 Ze Yang, Wei Wu, Jian Yang, Can Xu, Zhoujun Li

Since the paired data now is no longer enough to train a neural generation model, we consider leveraging the large scale of unpaired data that are much easier to obtain, and propose response generation with both paired and unpaired data.

Response Generation

Trident Segmentation CNN: A Spatiotemporal Transformation CNN for Punctate White Matter Lesions Segmentation in Preterm Neonates

1 code implementation22 Oct 2019 Yalong Liu, Jie Li, Miaomiao Wang, Zhicheng Jiao, Jian Yang, Xianjun Li

In this paper, a novel spatiotemporal transformation deep learning method called Trident Segmentation CNN (TS-CNN) is proposed to segment PWML in MR images.

Segmentation Specificity

Dual-Attention Graph Convolutional Network

no code implementations28 Nov 2019 Xueya Zhang, Tong Zhang, Wenting Zhao, Zhen Cui, Jian Yang

Graph convolutional networks (GCNs) have shown the powerful ability in text structure representation and effectively facilitate the task of text classification.

text-classification Text Classification

Curvilinear Distance Metric Learning

1 code implementation NeurIPS 2019 Shuo Chen, Lei Luo, Jian Yang, Chen Gong, Jun Li, Heng Huang

To address this issue, we first reveal that the traditional linear distance metric is equivalent to the cumulative arc length between the data pair's nearest points on the learned straight measurer lines.

Metric Learning

LiDAR Iris for Loop-Closure Detection

no code implementations9 Dec 2019 Ying Wang, Zezhou Sun, Cheng-Zhong Xu, Sanjay Sarma, Jian Yang, Hui Kong

In this paper, a global descriptor for a LiDAR point cloud, called LiDAR Iris, is proposed for fast and accurate loop-closure detection.

Loop Closure Detection

Graph Inference Learning for Semi-supervised Classification

no code implementations ICLR 2020 Chunyan Xu, Zhen Cui, Xiaobin Hong, Tong Zhang, Jian Yang, Wei Liu

In this work, we address semi-supervised classification of graph data, where the categories of those unlabeled nodes are inferred from labeled nodes as well as graph structures.

Classification General Classification +1

Network Cooperation with Progressive Disambiguation for Partial Label Learning

no code implementations22 Feb 2020 Yao Yao, Chen Gong, Jiehui Deng, Jian Yang

Partial Label Learning (PLL) aims to train a classifier when each training instance is associated with a set of candidate labels, among which only one is correct but is not accessible during the training phase.

Partial Label Learning

Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks

no code implementations4 Mar 2020 Jun Chen, Yong liu, Hao Zhang, Shengnan Hou, Jian Yang

Meanwhile, we propose a M-bit Inputs and N-bit Weights Network (MINW-Net) trained by AQE, a quantized neural network with 1-3 bits weights and activations.

Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method

2 code implementations18 Mar 2020 Yuan Gong, Jian Yang, Christian Poellabauer

With the rapidly growing number of security-sensitive systems that use voice as the primary input, it becomes increasingly important to address these systems' potential vulnerability to replay attacks.

Deep Learning for Community Detection: Progress, Challenges and Opportunities

1 code implementation17 May 2020 Fanzhen Liu, Shan Xue, Jia Wu, Chuan Zhou, Wenbin Hu, Cecile Paris, Surya Nepal, Jian Yang, Philip S. Yu

As communities represent similar opinions, similar functions, similar purposes, etc., community detection is an important and extremely useful tool in both scientific inquiry and data analytics.

Clustering Community Detection +1

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection

7 code implementations NeurIPS 2020 Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang

Specifically, we merge the quality estimation into the class prediction vector to form a joint representation of localization quality and classification, and use a vector to represent arbitrary distribution of box locations.

Dense Object Detection General Classification

Learning the Redundancy-free Features for Generalized Zero-Shot Object Recognition

no code implementations CVPR 2020 Zongyan Han, Zhen-Yong Fu, Jian Yang

Zero-shot object recognition or zero-shot learning aims to transfer the object recognition ability among the semantically related categories, such as fine-grained animal or bird species.

Generalized Zero-Shot Learning Object +1

Improving Neural Machine Translation with Soft Template Prediction

no code implementations ACL 2020 Jian Yang, Shuming Ma, Dong-dong Zhang, Zhoujun Li, Ming Zhou

Although neural machine translation (NMT) has achieved significant progress in recent years, most previous NMT models only depend on the source text to generate translation.

Machine Translation NMT +1

Progressive Point Cloud Deconvolution Generation Network

1 code implementation ECCV 2020 Le Hui, Rui Xu, Jin Xie, Jianjun Qian, Jian Yang

Starting from the low-resolution point clouds, with the bilateral interpolation and max-pooling operations, the deconvolution network can progressively output high-resolution local and global feature maps.

Point Cloud Generation

Graph Wasserstein Correlation Analysis for Movie Retrieval

no code implementations ECCV 2020 Xueya Zhang, Tong Zhang, Xiaobin Hong, Zhen Cui, Jian Yang

Spectral graph filtering is introduced to encode graph signals, which are then embedded as probability distributions in a Wasserstein space, called graph Wasserstein metric learning.

Metric Learning Retrieval

S2OSC: A Holistic Semi-Supervised Approach for Open Set Classification

no code implementations11 Aug 2020 Yang Yang, Zhen-Qiang Sun, Hui Xiong, Jian Yang

Open set classification (OSC) tackles the problem of determining whether the data are in-class or out-of-class during inference, when only provided with a set of in-class examples at training time.

General Classification Knowledge Distillation +1

Instance-Aware Graph Convolutional Network for Multi-Label Classification

no code implementations19 Aug 2020 Yun Wang, Tong Zhang, Zhen Cui, Chunyan Xu, Jian Yang

For label diffusion of instance-awareness in graph convolution, rather than using the statistical label correlation alone, an image-dependent label correlation matrix (LCM), fusing both the statistical LCM and an individual one of each image instance, is constructed for graph inference on labels to inject adaptive information of label-awareness into the learned features of the model.

Classification General Classification +2

Localizing Anomalies from Weakly-Labeled Videos

1 code implementation20 Aug 2020 Hui Lv, Chuanwei Zhou, Chunyan Xu, Zhen Cui, Jian Yang

In addition, in order to fully utilize the spatial context information, the immediate semantics are directly derived from the segment representations.

Anomaly Detection In Surveillance Videos Video Anomaly Detection

ICS-Assist: Intelligent Customer Inquiry Resolution Recommendation in Online Customer Service for Large E-Commerce Businesses

no code implementations22 Aug 2020 Min Fu, Jiwei Guan, Xi Zheng, Jie zhou, Jianchao Lu, Tianyi Zhang, Shoujie Zhuo, Lijun Zhan, Jian Yang

Existing solution recommendation methods for online customer service are unable to determine the best solutions at runtime, leading to poor satisfaction of end customers.

Learning Adaptive Embedding Considering Incremental Class

1 code implementation31 Aug 2020 Yang Yang, Zhen-Qiang Sun, HengShu Zhu, Yanjie Fu, Hui Xiong, Jian Yang

To this end, we propose a Class-Incremental Learning without Forgetting (CILF) framework, which aims to learn adaptive embedding for processing novel class detection and model update in a unified framework.

Class Incremental Learning Clustering +1

Spatial Transformer Point Convolution

no code implementations3 Sep 2020 Yuan Fang, Chunyan Xu, Zhen Cui, Yuan Zong, Jian Yang

In this paper, we propose a spatial transformer point convolution (STPC) method to achieve anisotropic convolution filtering on point clouds.

Dictionary Learning Semantic Segmentation

Frontier Detection and Reachability Analysis for Efficient 2D Graph-SLAM Based Active Exploration

1 code implementation7 Sep 2020 Zezhou Sun, Banghe Wu, Cheng-Zhong Xu, Sanjay E. Sarma, Jian Yang, Hui Kong

We propose an integrated approach to active exploration by exploiting the Cartographer method as the base SLAM module for submap creation and performing efficient frontier detection in the geometrically co-aligned submaps induced by graph optimization.

Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Learning

no code implementations15 Sep 2020 Sheng Wan, Shirui Pan, Jian Yang, Chen Gong

Graph-based Semi-Supervised Learning (SSL) aims to transfer the labels of a handful of labeled data to the remaining massive unlabeled data via a graph.

Multi-Level Graph Convolutional Network with Automatic Graph Learning for Hyperspectral Image Classification

no code implementations19 Sep 2020 Sheng Wan, Chen Gong, Shirui Pan, Jie Yang, Jian Yang

Nowadays, deep learning methods, especially the Graph Convolutional Network (GCN), have shown impressive performance in hyperspectral image (HSI) classification.

General Classification graph construction +2

Interest-Behaviour Multiplicative Network for Resource-limited Recommendation

no code implementations24 Sep 2020 Qianliang Wu, Tong Zhang, Zhen Cui, Jian Yang

In this paper, we aim to mine the cue of user preferences in resource-limited recommendation tasks, for which purpose we specifically build a large used car transaction dataset possessing resource-limitation characteristics.

Progressive Training of Multi-level Wavelet Residual Networks for Image Denoising

2 code implementations23 Oct 2020 Yali Peng, Yue Cao, Shigang Liu, Jian Yang, WangMeng Zuo

To cope with this issue, this paper presents a multi-level wavelet residual network (MWRN) architecture as well as a progressive training (PTMWRN) scheme to improve image denoising performance.

Image Denoising

Aspect Based Sentiment Analysis with Self-Attention and Gated Convolutional Networks

no code implementations4 Nov 2020 Jian Yang, Juan Yang

Therefore, to solve the problems above, we build a new model based on gating mechanism, combined with convolutional neural networks (CNN) and self-attention mechanism.

Aspect-Based Sentiment Analysis Aspect Category Sentiment Analysis +1

They are Not Completely Useless: Towards Recycling Transferable Unlabeled Data for Class-Mismatched Semi-Supervised Learning

no code implementations27 Nov 2020 Zhuo Huang, Ying Tai, Chengjie Wang, Jian Yang, Chen Gong

Semi-Supervised Learning (SSL) with mismatched classes deals with the problem that the classes-of-interests in the limited labeled data is only a subset of the classes in massive unlabeled data.

Domain Adaptation

Globally Optimal Relative Pose Estimation with Gravity Prior

no code implementations CVPR 2021 Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova

Smartphones, tablets and camera systems used, e. g., in cars and UAVs, are typically equipped with IMUs (inertial measurement units) that can measure the gravity vector accurately.

Pose Estimation

Pyramid Point Cloud Transformer for Large-Scale Place Recognition

1 code implementation ICCV 2021 Le Hui, Hang Yang, Mingmei Cheng, Jin Xie, Jian Yang

In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors.

3D Place Recognition Point Cloud Retrieval +1

Wasserstein Coupled Graph Learning for Cross-Modal Retrieval

no code implementations ICCV 2021 Yun Wang, Tong Zhang, Xueya Zhang, Zhen Cui, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang

Then, a Wasserstein coupled dictionary, containing multiple pairs of counterpart graph keys with each key corresponding to one modality, is constructed for further feature learning.

Cross-Modal Retrieval Graph Embedding +2

Graph Deformer Network

no code implementations1 Jan 2021 Wenting Zhao, Yuan Fang, Zhen Cui, Tong Zhang, Jian Yang, Wei Liu

In this paper, we propose a simple yet effective graph deformer network (GDN) to fulfill anisotropic convolution filtering on graphs, analogous to the standard convolution operation on images.

Isomorphism Testing

Superpoint Network for Point Cloud Oversegmentation

1 code implementation ICCV 2021 Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, Jian Yang

Specifically, in our clustering network, we first jointly learn a soft point-superpoint association map from the coordinate and feature spaces of point clouds, where each point is assigned to the superpoint with a learned weight.

Clustering Semantic Segmentation

Scribble-Supervised Semantic Segmentation Inference

no code implementations ICCV 2021 Jingshan Xu, Chuanwei Zhou, Zhen Cui, Chunyan Xu, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jian Yang

In this paper, we propose a progressive segmentation inference (PSI) framework to tackle with scribble-supervised semantic segmentation.

Segmentation Semantic Segmentation

Efficient 3D Point Cloud Feature Learning for Large-Scale Place Recognition

1 code implementation7 Jan 2021 Le Hui, Mingmei Cheng, Jin Xie, Jian Yang

In this paper, we develop an efficient point cloud learning network (EPC-Net) to form a global descriptor for visual place recognition, which can obtain good performance and reduce computation memory and inference time.

Point Cloud Retrieval Retrieval +1

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

1 code implementation14 Jan 2021 Qizhou Wang, Bo Han, Tongliang Liu, Gang Niu, Jian Yang, Chen Gong

The drastic increase of data quantity often brings the severe decrease of data quality, such as incorrect label annotations, which poses a great challenge for robustly training Deep Neural Networks (DNNs).

Spatial-Temporal Tensor Graph Convolutional Network for Traffic Prediction

no code implementations10 Mar 2021 Xuran Xu, Tong Zhang, Chunyan Xu, Zhen Cui, Jian Yang

We further extend graph convolution into tensor space and propose a tensor graph convolution network to extract more discriminating features from spatial-temporal graph data.

Management Tensor Decomposition +1

MSCFNet: A Lightweight Network With Multi-Scale Context Fusion for Real-Time Semantic Segmentation

no code implementations24 Mar 2021 Guangwei Gao, Guoan Xu, Yi Yu, Jin Xie, Jian Yang, Dong Yue

In recent years, how to strike a good trade-off between accuracy and inference speed has become the core issue for real-time semantic segmentation applications, which plays a vital role in real-world scenarios such as autonomous driving systems and drones.

Autonomous Driving Real-Time Semantic Segmentation +1

JDSR-GAN: Constructing An Efficient Joint Learning Network for Masked Face Super-Resolution

no code implementations25 Mar 2021 Guangwei Gao, Lei Tang, Fei Wu, Huimin Lu, Jian Yang

In this work, we treat the mask occlusion as image noise and construct a joint and collaborative learning network, called JDSR-GAN, for the masked face super-resolution task.

Denoising Super-Resolution

Hierarchical Deep CNN Feature Set-Based Representation Learning for Robust Cross-Resolution Face Recognition

no code implementations25 Mar 2021 Guangwei Gao, Yi Yu, Jian Yang, Guo-Jun Qi, Meng Yang

(i) To learn more robust and discriminative features, we desire to adaptively fuse the contextual features from different layers.

Face Recognition Representation Learning

Contrastive Embedding for Generalized Zero-Shot Learning

3 code implementations CVPR 2021 Zongyan Han, ZhenYong Fu, Shuo Chen, Jian Yang

To tackle this issue, we propose to integrate the generation model with the embedding model, yielding a hybrid GZSL framework.

Generalized Zero-Shot Learning

Learning Normal Dynamics in Videos with Meta Prototype Network

1 code implementation CVPR 2021 Hui Lv, Chen Chen, Zhen Cui, Chunyan Xu, Yong Li, Jian Yang

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection.

Anomaly Detection Meta-Learning +1

SSPC-Net: Semi-supervised Semantic 3D Point Cloud Segmentation Network

1 code implementation16 Apr 2021 Mingmei Cheng, Le Hui, Jin Xie, Jian Yang

In order to reduce the number of annotated labels, we propose a semi-supervised semantic point cloud segmentation network, named SSPC-Net, where we train the semantic segmentation network by inferring the labels of unlabeled points from the few annotated 3D points.

Point Cloud Segmentation Scene Understanding +2

Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks

1 code implementation3 May 2021 Haobo Jiang, Jin Xie, Jian Yang

Finally, we use the maximum value in the second set of estimators to clip the action value of the chosen action in the first set of estimators and the clipped value is used for approximating the maximum expected action value.

Q-Learning

DONet: Dual-Octave Network for Fast MR Image Reconstruction

no code implementations12 May 2021 Chun-Mei Feng, Zhanyuan Yang, Huazhu Fu, Yong Xu, Jian Yang, Ling Shao

In this paper, we propose the Dual-Octave Network (DONet), which is capable of learning multi-scale spatial-frequency features from both the real and imaginary components of MR data, for fast parallel MR image reconstruction.

Image Reconstruction

A Comprehensive Survey on Community Detection with Deep Learning

no code implementations26 May 2021 Xing Su, Shan Xue, Fanzhen Liu, Jia Wu, Jian Yang, Chuan Zhou, Wenbin Hu, Cecile Paris, Surya Nepal, Di Jin, Quan Z. Sheng, Philip S. Yu

A community reveals the features and connections of its members that are different from those in other communities in a network.

Clustering Community Detection +3

Robotic Brain Storm Optimization: A Multi-target Collaborative Searching Paradigm for Swarm Robotics

no code implementations27 May 2021 Jian Yang, Yuhui Shi

Swarm intelligence optimization algorithms can be adopted in swarm robotics for target searching tasks in a 2-D or 3-D space by treating the target signal strength as fitness values.

Clustering

Attention-oriented Brain Storm Optimization for Multimodal Optimization Problems

1 code implementation27 May 2021 Jian Yang, Yuhui Shi

Rather than converge to a single global optimum, the proposed method can guide the search procedure to converge to multiple "salient" solutions.

Clustering

Smart-Start Decoding for Neural Machine Translation

no code implementations NAACL 2021 Jian Yang, Shuming Ma, Dongdong Zhang, Juncheng Wan, Zhoujun Li, Ming Zhou

Most current neural machine translation models adopt a monotonic decoding order of either left-to-right or right-to-left.

Machine Translation Translation

dFDA-VeD: A Dynamic Future Demand Aware Vehicle Dispatching System

no code implementations10 Jun 2021 Yang Guo, Tarique Anwar, Jian Yang, Jia Wu

As the process should be socially and economically profitable, the task of vehicle dispatching is highly challenging, specially due to the time-varying travel demands and traffic conditions.

A Comprehensive Survey on Graph Anomaly Detection with Deep Learning

1 code implementation14 Jun 2021 Xiaoxiao Ma, Jia Wu, Shan Xue, Jian Yang, Chuan Zhou, Quan Z. Sheng, Hui Xiong, Leman Akoglu

In this survey, we aim to provide a systematic and comprehensive review of the contemporary deep learning techniques for graph anomaly detection.

Graph Anomaly Detection

Graph Jigsaw Learning for Cartoon Face Recognition

1 code implementation14 Jul 2021 Yong Li, Lingjie Lao, Zhen Cui, Shiguang Shan, Jian Yang

To mitigate this issue, we propose the GraphJigsaw that constructs jigsaw puzzles at various stages in the classification network and solves the puzzles with the graph convolutional network (GCN) in a progressive manner.

Classification Face Recognition

RigNet: Repetitive Image Guided Network for Depth Completion

no code implementations29 Jul 2021 Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang

However, blurry guidance in the image and unclear structure in the depth still impede the performance of the image guided frameworks.

Depth Completion Depth Estimation +1

Multilingual Agreement for Multilingual Neural Machine Translation

no code implementations ACL 2021 Jian Yang, Yuwei Yin, Shuming Ma, Haoyang Huang, Dongdong Zhang, Zhoujun Li, Furu Wei

Although multilingual neural machine translation (MNMT) enables multiple language translations, the training process is based on independent multilingual objectives.

Machine Translation Translation

Planning with Learned Dynamic Model for Unsupervised Point Cloud Registration

no code implementations5 Aug 2021 Haobo Jiang, Jin Xie, Jianjun Qian, Jian Yang

By modeling the point cloud registration process as a Markov decision process (MDP), we develop a latent dynamic model of point clouds, consisting of a transformation network and evaluation network.

Point Cloud Registration

Learning Fair Face Representation With Progressive Cross Transformer

no code implementations11 Aug 2021 Yong Li, Yufei Sun, Zhen Cui, Shiguang Shan, Jian Yang

To mitigate racial bias and meantime preserve robust FR, we abstract face identity-related representation as a signal denoising problem and propose a progressive cross transformer (PCT) method for fair face recognition.

Denoising Face Recognition

FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation

1 code implementation2 Sep 2021 Guangwei Gao, Guoan Xu, Juncheng Li, Yi Yu, Huimin Lu, Jian Yang

Specifically, FBSNet employs a symmetrical encoder-decoder structure with two branches, semantic information branch and spatial detail branch.

Autonomous Driving Drone navigation +1

Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration

1 code implementation ICCV 2021 Haobo Jiang, Yaqi Shen, Jin Xie, Jun Li, Jianjun Qian, Jian Yang

Based on the reward function, for each state, we then construct a fused score function to evaluate the sampled transformations, where we weight the current and future rewards of the transformations.

Point Cloud Registration

Student Helping Teacher: Teacher Evolution via Self-Knowledge Distillation

no code implementations1 Oct 2021 Zheng Li, Xiang Li, Lingfeng Yang, Jian Yang, Zhigeng Pan

Knowledge distillation usually transfers the knowledge from a pre-trained cumbersome teacher network to a compact student network, which follows the classical teacher-teaching-student paradigm.

Self-Knowledge Distillation

A Survey of Knowledge Enhanced Pre-trained Models

no code implementations1 Oct 2021 Jian Yang, Xinyu Hu, Gang Xiao, Yulong Shen

Pre-trained language models learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of natural language processing (NLP) after fine-tuning.

Logical Reasoning Representation Learning +1

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning

no code implementations22 Oct 2021 Yang Yang, Hongchen Wei, HengShu Zhu, dianhai yu, Hui Xiong, Jian Yang

In detail, considering that the heterogeneous gap between modalities always leads to the supervision difficulty of using the global embedding directly, CPRC turns to transform both the raw image and corresponding generated sentence into the shared semantic space, and measure the generated sentence from two aspects: 1) Prediction consistency.

Image Captioning Informativeness +2

Neural BRDFs: Representation and Operations

no code implementations6 Nov 2021 Jiahui Fan, Beibei Wang, Miloš Hašan, Jian Yang, Ling-Qi Yan

Bidirectional reflectance distribution functions (BRDFs) are pervasively used in computer graphics to produce realistic physically-based appearance.

3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds

1 code implementation NeurIPS 2021 Le Hui, Lingpeng Wang, Mingmei Cheng, Jin Xie, Jian Yang

The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified.

3D Object Tracking Object Tracking

Fine-Grained Image Analysis with Deep Learning: A Survey

no code implementations11 Nov 2021 Xiu-Shen Wei, Yi-Zhe Song, Oisin Mac Aodha, Jianxin Wu, Yuxin Peng, Jinhui Tang, Jian Yang, Serge Belongie

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications.

Fine-Grained Image Recognition Image Retrieval +1

Keypoint Message Passing for Video-based Person Re-Identification

1 code implementation16 Nov 2021 Di Chen, Andreas Doering, Shanshan Zhang, Jian Yang, Juergen Gall, Bernt Schiele

Video-based person re-identification (re-ID) is an important technique in visual surveillance systems which aims to match video snippets of people captured by different cameras.

Representation Learning Video-Based Person Re-Identification

Fast and Light-Weight Network for Single Frame Structured Illumination Microscopy Super-Resolution

no code implementations17 Nov 2021 Xi Cheng, Jun Li, Qiang Dai, ZhenYong Fu, Jian Yang

In our SF-SIM, we propose a noise estimator which can effectively suppress the noise in the image and enable our method to work under the low light and short exposure environment, without the need for stacking multiple frames for non-local denoising.

Denoising Super-Resolution

A$^2$-Net: Learning Attribute-Aware Hash Codes for Large-Scale Fine-Grained Image Retrieval

1 code implementation NeurIPS 2021 Xiu-Shen Wei, Yang shen, Xuhao Sun, Han-Jia Ye, Jian Yang

Specifically, based on the captured visual representations by attention, we develop an encoder-decoder structure network of a reconstruction task to unsupervisedly distill high-level attribute-specific vectors from the appearance-specific visual representations without attribute annotations.

Attribute Image Retrieval +1

Learning to Adapt via Latent Domains for Adaptive Semantic Segmentation

no code implementations NeurIPS 2021 Yunan Liu, Shanshan Zhang, Yang Li, Jian Yang

In this setting, we embed an additional pair of “latent-latent” to reduce the domain gap between the source and different latent domains, allowing the model to adapt well on multiple target domains simultaneously.

Domain Adaptation Meta-Learning +1

Universal Semi-Supervised Learning

no code implementations NeurIPS 2021 Zhuo Huang, Chao Xue, Bo Han, Jian Yang, Chen Gong

Universal Semi-Supervised Learning (UniSSL) aims to solve the open-set problem where both the class distribution (i. e., class set) and feature distribution (i. e., feature domain) are different between labeled dataset and unlabeled dataset.

Domain Adaptation

TransLog: A Unified Transformer-based Framework for Log Anomaly Detection

no code implementations31 Dec 2021 Hongcheng Guo, Xingyu Lin, Jian Yang, Yi Zhuang, Jiaqi Bai, Tieqiao Zheng, Bo Zhang, Zhoujun Li

Therefore, we propose a unified Transformer-based framework for log anomaly detection (\ourmethod{}), which is comprised of the pretraining and adapter-based tuning stage.

Anomaly Detection

Relative Pose From a Calibrated and an Uncalibrated Smartphone Image

no code implementations CVPR 2022 Yaqing Ding, Daniel Barath, Jian Yang, Zuzana Kukelova

In this paper, we propose a new minimal and a non-minimal solver for estimating the relative camera pose together with the unknown focal length of the second camera.

CVNet: Contour Vibration Network for Building Extraction

1 code implementation CVPR 2022 Ziqiang Xu, Chunyan Xu, Zhen Cui, Xiangwei Zheng, Jian Yang

The classic active contour model raises a great promising solution to polygon-based object extraction with the progress of deep learning recently.

Model Optimization

A Proposal-Based Paradigm for Self-Supervised Sound Source Localization in Videos

no code implementations CVPR 2022 Hanyu Xuan, Zhiliang Wu, Jian Yang, Yan Yan, Xavier Alameda-Pineda

Humans can easily recognize where and how the sound is produced via watching a scene and listening to corresponding audio cues.

Multiple Instance Learning

SMDT: Selective Memory-Augmented Neural Document Translation

no code implementations5 Jan 2022 Xu Zhang, Jian Yang, Haoyang Huang, Shuming Ma, Dongdong Zhang, Jinlong Li, Furu Wei

Existing document-level neural machine translation (NMT) models have sufficiently explored different context settings to provide guidance for target generation.

Document Level Machine Translation Document Translation +4

Synthesizing Tensor Transformations for Visual Self-attention

no code implementations5 Jan 2022 Xian Wei, Xihao Wang, Hai Lan, JiaMing Lei, Yanhui Huang, Hui Yu, Jian Yang

Self-attention shows outstanding competence in capturing long-range relationships while enhancing performance on vision tasks, such as image classification and image captioning.

Image Captioning Image Classification

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation

no code implementations COLING 2022 Juncheng Wan, Jian Yang, Shuming Ma, Dongdong Zhang, Weinan Zhang, Yong Yu, Zhoujun Li

While end-to-end neural machine translation (NMT) has achieved impressive progress, noisy input usually leads models to become fragile and unstable.

Machine Translation NMT +1

Webly-Supervised Fine-Grained Recognition with Partial Label Learning

1 code implementation IJCAI 2022 Yu-Yan Xu, Yang shen, Xiu-Shen Wei, Jian Yang

The task of webly-supervised fne-grained recognition is to boost recognition accuracy of classifying subordinate categories (e. g., different bird species)by utilizing freely available but noisy web data. As the label noises signifcantly hurt the network training, it is desirable to distinguish and eliminate noisy images.

Partial Label Learning

Reliable Inlier Evaluation for Unsupervised Point Cloud Registration

1 code implementation23 Feb 2022 Yaqi Shen, Le Hui, Haobo Jiang, Jin Xie, Jian Yang

In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration.

Model Optimization Point Cloud Registration

Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information

1 code implementation CVPR 2022 Lingfeng Yang, Xiang Li, RenJie Song, Borui Zhao, Juntian Tao, Shihao Zhou, Jiajun Liang, Jian Yang

Therefore, it is helpful to leverage additional information, e. g., the locations and dates for data shooting, which can be easily accessible but rarely exploited.

Fine-Grained Image Classification

RecursiveMix: Mixed Learning with History

1 code implementation14 Mar 2022 Lingfeng Yang, Xiang Li, Borui Zhao, RenJie Song, Jian Yang

In semantic segmentation, RM also surpasses the baseline and CutMix by 1. 9 and 1. 1 mIoU points under UperNet on ADE20K, respectively.

object-detection Object Detection +1

Multi-Modal Masked Pre-Training for Monocular Panoramic Depth Completion

no code implementations18 Mar 2022 Zhiqiang Yan, Xiang Li, Kun Wang, Zhenyu Zhang, Jun Li, Jian Yang

To deal with the PDC task, we train a deep network that takes both depth and image as inputs for the dense panoramic depth recovery.

Depth Completion Transfer Learning

Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action Tasks

1 code implementation22 Mar 2022 Haobo Jiang, Jin Xie, Jian Yang

Finally, we use the maximum value in the second set of estimators to clip the action value of the chosen action in the first set of estimators and the clipped value is used for approximating the maximum expected action value.

Q-Learning

Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation

1 code implementation CVPR 2022 Jinchao Yang, Fei Guo, Shuo Chen, Jun Li, Jian Yang

Given a source product, a target product, and an art style image, our method produces a neural warping field that warps the source shape to imitate the geometric style of the target and a neural texture transformation network that transfers the artistic style to the warped source product.

Style Transfer

OTFace: Hard Samples Guided Optimal Transport Loss for Deep Face Representation

no code implementations28 Mar 2022 Jianjun Qian, Shumin Zhu, Chaoyu Zhao, Jian Yang, Wai Keung Wong

To this end, some deep convolutional neural networks (CNNs) have been developed to learn discriminative feature by designing properly margin-based losses, which perform well on easy samples but fail on hard samples.

Towards Explainable Meta-Learning for DDoS Detection

no code implementations5 Apr 2022 Qianru Zhou, Rongzhen Li, Lei Xu, Arumugam Nallanathan, Jian Yang, Anmin Fu

With the ever increasing of new intrusions, intrusion detection task rely on Artificial Intelligence more and more.

Intrusion Detection Meta-Learning

Semantics-Guided Moving Object Segmentation with 3D LiDAR

no code implementations6 May 2022 Shuo Gu, Suling Yao, Jian Yang, Hui Kong

Instead of segmenting the moving objects directly, the network conducts single-scan-based semantic segmentation and multiple-scan-based moving object segmentation in turn.

Object Segmentation +1

Hyperspectral Image Classification With Contrastive Graph Convolutional Network

no code implementations11 May 2022 Wentao Yu, Sheng Wan, Guangyu Li, Jian Yang, Chen Gong

To enhance the feature representation ability, in this paper, a GCN model with contrastive learning is proposed to explore the supervision signals contained in both spectral information and spatial relations, which is termed Contrastive Graph Convolutional Network (ConGCN), for HSI classification.

Classification Contrastive Learning +2

Bi-level Alignment for Cross-Domain Crowd Counting

1 code implementation CVPR 2022 Shenjian Gong, Shanshan Zhang, Jian Yang, Dengxin Dai, Bernt Schiele

The main challenge for this task is to achieve high-quality manual annotations on a large amount of training data.

AutoML Crowd Counting +2

Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality

1 code implementation20 May 2022 Xiang Li, Wenhai Wang, Lingfeng Yang, Jian Yang

Masked AutoEncoder (MAE) has recently led the trends of visual self-supervision area by an elegant asymmetric encoder-decoder design, which significantly optimizes both the pre-training efficiency and fine-tuning accuracy.

Object Detection

Graph-level Neural Networks: Current Progress and Future Directions

no code implementations31 May 2022 Ge Zhang, Jia Wu, Jian Yang, Shan Xue, Wenbin Hu, Chuan Zhou, Hao Peng, Quan Z. Sheng, Charu Aggarwal

To frame this survey, we propose a systematic taxonomy covering GLNNs upon deep neural networks, graph neural networks, and graph pooling.

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

no code implementations27 Jun 2022 Chuang Zhang, Li Shen, Jian Yang, Chen Gong

To exploit this effect, the model prediction-based methods have been widely adopted, which aim to exploit the outputs of DNNs in the early stage of learning to correct noisy labels.

Learning with noisy labels Memorization

Cross-receptive Focused Inference Network for Lightweight Image Super-Resolution

1 code implementation6 Jul 2022 Wenjie Li, Juncheng Li, Guangwei Gao, Jiantao Zhou, Jian Yang, Guo-Jun Qi

Recently, Transformer-based methods have shown impressive performance in single image super-resolution (SISR) tasks due to the ability of global feature extraction.

Image Super-Resolution

GCN-based Multi-task Representation Learning for Anomaly Detection in Attributed Networks

no code implementations8 Jul 2022 Venus Haghighi, Behnaz Soltani, Adnan Mahmood, Quan Z. Sheng, Jian Yang

Anomaly detection in attributed networks has received a considerable attention in recent years due to its applications in a wide range of domains such as finance, network security, and medicine.

Anomaly Detection Community Detection +2

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

1 code implementation11 Jul 2022 Jian Yang, Yuwei Yin, Shuming Ma, Dongdong Zhang, Zhoujun Li, Furu Wei

Nonetheless, multilingual training is plagued by language interference degeneration in shared parameters because of the negative interference among different translation directions, especially on high-resource languages.

Machine Translation Translation

3D Siamese Transformer Network for Single Object Tracking on Point Clouds

1 code implementation25 Jul 2022 Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang

Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area.

3D Single Object Tracking Object Tracking

RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation

1 code implementation25 Jul 2022 Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang

In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA-Depth) by learning the scale invariance of the scene depth.

Data Augmentation Monocular Depth Estimation

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation

1 code implementation29 Jul 2022 Jian Yang, Yuwei Yin, Liqun Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Furu Wei, Zhoujun Li

Transformer structure, stacked by a sequence of encoder and decoder network layers, achieves significant development in neural machine translation.

Machine Translation Translation

LogLG: Weakly Supervised Log Anomaly Detection via Log-Event Graph Construction

no code implementations23 Aug 2022 Hongcheng Guo, Yuhui Guo, Renjie Chen, Jian Yang, Jiaheng Liu, Zhoujun Li, Tieqiao Zheng, Weichao Hou, Liangfan Zheng, Bo Zhang

Experiments on five benchmarks validate the effectiveness of LogLG for detecting anomalies on unlabeled log data and demonstrate that LogLG, as the state-of-the-art weakly supervised method, achieves significant performance improvements compared to existing methods.

Anomaly Detection graph construction +1

Point Cloud Registration-Driven Robust Feature Matching for 3D Siamese Object Tracking

no code implementations14 Sep 2022 Haobo Jiang, Kaihao Lan, Le Hui, Guangyu Li, Jin Xie, Jian Yang

The core of Siamese feature matching is how to assign high feature similarity on the corresponding points between the template and search area for precise object localization.

Object Localization Object Tracking +1

Grouped Adaptive Loss Weighting for Person Search

no code implementations23 Sep 2022 Yanling Tian, Di Chen, Yunan Liu, Shanshan Zhang, Jian Yang

A straightforward solution is to manually assign different weights to different tasks, compensating for the diverse convergence rates.

Model Optimization Multi-Task Learning +2

Spatio-Temporal Relation Learning for Video Anomaly Detection

no code implementations27 Sep 2022 Hui Lv, Zhen Cui, Biao Wang, Jian Yang

Anomaly identification is highly dependent on the relationship between the object and the scene, as different/same object actions in same/different scenes may lead to various degrees of normality and anomaly.

Anomaly Detection Knowledge Graph Embedding +5

SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval

4 code implementations28 Sep 2022 Yang shen, Xuhao Sun, Xiu-Shen Wei, Qing-Yuan Jiang, Jian Yang

In this paper, we propose Suppression-Enhancing Mask based attention and Interactive Channel transformatiON (SEMICON) to learn binary hash codes for dealing with large-scale fine-grained image retrieval tasks.

Image Retrieval Retrieval

DAGAD: Data Augmentation for Graph Anomaly Detection

1 code implementation18 Oct 2022 Fanzhen Liu, Xiaoxiao Ma, Jia Wu, Jian Yang, Shan Xue, Amin Beheshti, Chuan Zhou, Hao Peng, Quan Z. Sheng, Charu C. Aggarwal

To bridge the gaps, this paper devises a novel Data Augmentation-based Graph Anomaly Detection (DAGAD) framework for attributed graphs, equipped with three specially designed modules: 1) an information fusion module employing graph neural network encoders to learn representations, 2) a graph data augmentation module that fertilizes the training set with generated samples, and 3) an imbalance-tailored learning module to discriminate the distributions of the minority (anomalous) and majority (normal) classes.

Data Augmentation Graph Anomaly Detection

LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation

no code implementations19 Oct 2022 Hongcheng Guo, Jiaheng Liu, Haoyang Huang, Jian Yang, Zhoujun Li, Dongdong Zhang, Zheng Cui, Furu Wei

To this end, we first propose the Multilingual MMT task by establishing two new Multilingual MMT benchmark datasets covering seven languages.

Multimodal Machine Translation Translation

DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion

no code implementations20 Nov 2022 Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang

Unsupervised depth completion aims to recover dense depth from the sparse one without using the ground-truth annotation.

Depth Completion Depth Estimation +2

Curriculum Temperature for Knowledge Distillation

1 code implementation29 Nov 2022 Zheng Li, Xiang Li, Lingfeng Yang, Borui Zhao, RenJie Song, Lei Luo, Jun Li, Jian Yang

In this paper, we propose a simple curriculum-based technique, termed Curriculum Temperature for Knowledge Distillation (CTKD), which controls the task difficulty level during the student's learning career through a dynamic and learnable temperature.

Image Classification Knowledge Distillation

Feature Aggregation and Propagation Network for Camouflaged Object Detection

1 code implementation2 Dec 2022 Tao Zhou, Yi Zhou, Chen Gong, Jian Yang, Yu Zhang

In this paper, we propose a novel Feature Aggregation and Propagation Network (FAP-Net) for camouflaged object detection.

Object object-detection +1

One-Stage Cascade Refinement Networks for Infrared Small Target Detection

1 code implementation16 Dec 2022 Yimian Dai, Xiang Li, Fei Zhou, Yulei Qian, Yaohong Chen, Jian Yang

Finally, we present a new research benchmark for infrared small target detection, consisting of the SIRST-V2 dataset of real-world, high-resolution single-frame targets, the normalized contrast evaluation metric, and the DeepInfrared toolkit for detection.

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

1 code implementation20 Dec 2022 Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei, Zhoujun Li

Inspired by the idea of Generative Adversarial Networks (GANs), we propose a GAN-style model for encoder-decoder pre-training by introducing an auxiliary discriminator, unifying the ability of language understanding and generation in a single model.

Denoising Sentence +1

Mining User-aware Multi-relations for Fake News Detection in Large Scale Online Social Networks

1 code implementation21 Dec 2022 Xing Su, Jian Yang, Jia Wu, Yuchen Zhang

In this paper, we construct a dual-layer graph (i. e., the news layer and the user layer) to extract multiple relations of news and users in social networks to derive rich information for detecting fake news.

Fake News Detection

Demystifying Advertising Campaign Bid Recommendation: A Constraint target CPA Goal Optimization

no code implementations26 Dec 2022 Deguang Kong, Konstantin Shmakov, Jian Yang

In cost-per-click (CPC) or cost-per-impression (CPM) advertising campaigns, advertisers always run the risk of spending the budget without getting enough conversions.

Do not Waste Money on Advertising Spend: Bid Recommendation via Concavity Changes

no code implementations26 Dec 2022 Deguang Kong, Konstantin Shmakov, Jian Yang

In computational advertising, a challenging problem is how to recommend the bid for advertisers to achieve the best return on investment (ROI) given budget constraint.

Robust Consensus Clustering and its Applications for Advertising Forecasting

no code implementations27 Dec 2022 Deguang Kong, Miao Lu, Konstantin Shmakov, Jian Yang

Consensus clustering aggregates partitions in order to find a better fit by reconciling clustering results from different sources/executions.

Clustering

Efficient Image Super-Resolution with Feature Interaction Weighted Hybrid Network

no code implementations29 Dec 2022 Wenjie Li, Juncheng Li, Guangwei Gao, Weihong Deng, Jian Yang, Guo-Jun Qi, Chia-Wen Lin

Recently, great progress has been made in single-image super-resolution (SISR) based on deep learning technology.

Image Super-Resolution

Few-shot Continual Infomax Learning

no code implementations ICCV 2023 Ziqi Gu, Chunyan Xu, Jian Yang, Zhen Cui

Further, considering that the learned knowledge in the human brain is a generalization of actual information and exists in a certain relational structure, we perform continual structure infomax learning to relieve the catastrophic forgetting problem in the continual learning process.

Continual Learning Few-Shot Learning

Center-Based Decoupled Point-cloud Registration for 6D Object Pose Estimation

no code implementations ICCV 2023 Haobo Jiang, Zheng Dang, Shuo Gu, Jin Xie, Mathieu Salzmann, Jian Yang

Our method decouples the translation from the entire transformation by predicting the object center and estimating the rotation in a center-aware manner.

6D Pose Estimation using RGB Object +2

Efficient LiDAR Point Cloud Oversegmentation Network

no code implementations ICCV 2023 Le Hui, Linghua Tang, Yuchao Dai, Jin Xie, Jian Yang

Then, to generate homogeneous superpoints from the sparse LiDAR point cloud, we propose a LiDAR point grouping algorithm that simultaneously considers the similarity of point embeddings and the Euclidean distance of points in 3D space.

LIDAR Semantic Segmentation Semantic Segmentation

Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields

no code implementations CVPR 2023 Kangkan Wang, Guofeng Zhang, Suxu Cong, Jian Yang

Previous methods capture the performance of full humans with a personalized template or recover the garments from a single frame with static human poses.

Revisiting the P3P Problem

1 code implementation CVPR 2023 Yaqing Ding, Jian Yang, Viktor Larsson, Carl Olsson, Kalle Åström

One of the classical multi-view geometry problems is the so called P3P problem, where the absolute pose of a calibrated camera is determined from three 2D-to-3D correspondences.

Multilingual Entity and Relation Extraction from Unified to Language-specific Training

no code implementations11 Jan 2023 Zixiang Wang, Jian Yang, Tongliang Li, Jiaheng Liu, Ying Mo, Jiaqi Bai, Longtao He, Zhoujun Li

In this paper, we propose a two-stage multilingual training method and a joint model called Multilingual Entity and Relation Extraction framework (mERE) to mitigate language interference across languages.

Relation Relation Extraction +1

State of the Art and Potentialities of Graph-level Learning

no code implementations14 Jan 2023 Zhenyu Yang, Ge Zhang, Jia Wu, Jian Yang, Quan Z. Sheng, Shan Xue, Chuan Zhou, Charu Aggarwal, Hao Peng, Wenbin Hu, Edwin Hancock, Pietro Liò

Traditional approaches to learning a set of graphs heavily rely on hand-crafted features, such as substructures.

Graph Learning

Recurrent Structure Attention Guidance for Depth Super-Resolution

no code implementations31 Jan 2023 Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang

Second, instead of the coarse concatenation guidance, we propose a recurrent structure attention block, which iteratively utilizes the latest depth estimation and the image features to jointly select clear patterns and boundaries, aiming at providing refined guidance for accurate depth recovery.

Depth Estimation Super-Resolution

Cannot find the paper you are looking for? You can Submit a new open access paper.