Search Results for author: Yang Wang

Found 267 papers, 69 papers with code

Learning a discriminative hidden part model for human action recognition

no code implementations NeurIPS 2008 Yang Wang, Greg Mori

In particular, our experimental results demonstrate that combining large-scale global features and local patch features performs significantly better than directly applying hCRF on local patches alone.

Action Recognition Object +2

Kernel Latent SVM for Visual Recognition

no code implementations NeurIPS 2012 Weilong Yang, Yang Wang, Arash Vahdat, Greg Mori

Latent SVMs (LSVMs) are a class of powerful tools that have been successfully applied to many applications in computer vision.

Invertibility and Robustness of Phaseless Reconstruction

no code implementations21 Aug 2013 Radu Balan, Yang Wang

This paper is concerned with the question of reconstructing a vector in a finite-dimensional real Hilbert space when only the magnitudes of the coefficients of the vector under a redundant linear map are known.

Stable Learning in Coding Space for Multi-Class Decoding and Its Extension for Multi-Class Hypothesis Transfer Learning

no code implementations CVPR 2014 Bang Zhang, Yi Wang, Yang Wang, Fang Chen

Many prevalent multi-class classification approaches can be unified and generalized by the output coding framework which usually consists of three phases: (1) coding, (2) learning binary classifiers, and (3) decoding.

General Classification Multi-class Classification +1

Multiple Authors Detection: A Quantitative Analysis of Dream of the Red Chamber

1 code implementation19 Dec 2014 Xianfeng Hu, Yang Wang, Qiang Wu

Inspired by the authorship controversy of Dream of the Red Chamber and the application of machine learning in the study of literary stylometry, we develop a rigorous new method for the mathematical analysis of authorship by testing for a so-called chrono-divide in writing styles.

Authorship Attribution

A new approach for physiological time series

no code implementations23 Apr 2015 Dong Mao, Yang Wang, Qiang Wu

We developed a new approach for the analysis of physiological time series.

Time Series Time Series Analysis

Weakly Supervised Localization of Novel Objects Using Appearance Transfer

no code implementations CVPR 2015 Mrigank Rochan, Yang Wang

We propose a method for transferring the appearance models of the familiar objects to the unseen object.

Object

rnn : Recurrent Library for Torch

1 code implementation24 Nov 2015 Nicholas Léonard, Sagar Waghmare, Yang Wang, Jin-Hwa Kim

The rnn package provides components for implementing a wide range of Recurrent Neural Networks.

Improving Human Action Recognition by Non-action Classification

no code implementations CVPR 2016 Yang Wang, Minh Hoai

In this paper we consider the task of recognizing human actions in realistic video where human actions are dominated by irrelevant factors.

Action Classification Action Recognition +3

Stochastic Patching Process

no code implementations23 May 2016 Xuhui Fan, Bin Li, Yi Wang, Yang Wang, Fang Chen

Due to constraints of partition strategy, existing models may cause unnecessary dissections in sparse regions when fitting data in dense regions.

Iterative Views Agreement: An Iterative Low-Rank based Structured Optimization Method to Multi-View Spectral Clustering

no code implementations19 Aug 2016 Yang Wang, Wenjie Zhang, Lin Wu, Xuemin Lin, Meng Fang, Shirui Pan

Multi-view spectral clustering, which aims at yielding an agreement or consensus data objects grouping across multi-views with their graph laplacian matrices, is a fundamental clustering problem.

Clustering

PrivLogit: Efficient Privacy-preserving Logistic Regression by Tailoring Numerical Optimizers

no code implementations3 Nov 2016 Wei Xie, Yang Wang, Steven M. Boker, Donald E. Brown

Leveraging this new method, we propose two new secure protocols for conducting logistic regression in a privacy-preserving and distributed manner.

BIG-bench Machine Learning Privacy Preserving +1

Robust Hashing for Multi-View Data: Jointly Learning Low-Rank Kernelized Similarity Consensus and Hash Functions

no code implementations17 Nov 2016 Lin Wu, Yang Wang

To learn robust hash functions, a latent low-rank kernel function is used to construct hash functions in order to accommodate linearly inseparable data.

graph construction

Infinite Hidden Semi-Markov Modulated Interaction Point Process

no code implementations NeurIPS 2016 Matt Zhang, Peng Lin, Ting Guo, Yang Wang, Fang Chen

The proposed approach can simultaneously model both the observations and arrival times of temporal events, and determine the number of latent states from data.

Personalized Video Recommendation Using Rich Contents from Videos

1 code implementation21 Dec 2016 Xingzhong Du, Hongzhi Yin, Ling Chen, Yang Wang, Yi Yang, Xiaofang Zhou

In the existing video recommender systems, the models make the recommendations based on the user-video interactions and single specific content features.

Recommendation Systems

MARTA GANs: Unsupervised Representation Learning for Remote Sensing Image Classification

no code implementations28 Dec 2016 Daoyu Lin, Kun fu, Yang Wang, Guangluan Xu, Xian Sun

With the development of deep learning, supervised learning has frequently been adopted to classify remotely sensed images using convolutional networks (CNNs).

Classification General Classification +3

Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval

no code implementations18 Jan 2017 Yang Wang, Xuemin Lin, Lin Wu, Wenjie Zhang

Given a query photo issued by a user (q-user), the landmark retrieval is to return a set of photos with their landmarks similar to those of the query, while the existing studies on the landmark retrieval focus on exploiting geometries of landmarks for similarity matches between candidate photos and a query photo.

Collaborative Filtering Retrieval

Structured Deep Hashing with Convolutional Neural Networks for Fast Person Re-identification

no code implementations14 Feb 2017 Lin Wu, Yang Wang

Given a pedestrian image as a query, the purpose of person re-identification is to identify the correct match from a large collection of gallery images depicting the same person captured by disjoint camera views.

Deep Hashing Person Re-Identification

Evolution-Preserving Dense Trajectory Descriptors

no code implementations14 Feb 2017 Yang Wang, Vinh Tran, Minh Hoai

Recently Trajectory-pooled Deep-learning Descriptors were shown to achieve state-of-the-art human action recognition results on a number of datasets.

Action Recognition Temporal Action Localization

Label Refinement Network for Coarse-to-Fine Semantic Segmentation

no code implementations1 Mar 2017 Md Amirul Islam, Shujon Naha, Mrigank Rochan, Neil Bruce, Yang Wang

We propose a novel network architecture called the label refinement network that predicts segmentation labels in a coarse-to-fine fashion at several resolutions.

Image Segmentation Segmentation +1

Finding Modes by Probabilistic Hypergraphs Shifting

no code implementations12 Apr 2017 Yang Wang, Lin Wu

Unlike the existing techniques to seek graph modes by shifting vertices based on pair-wise edges (i. e, an edge with $2$ ends), our paradigm is based on shifting high-order edges (hyperedges) to deliver graph modes.

Clustering Graph Matching

Call Attention to Rumors: Deep Attention Based Recurrent Neural Networks for Early Rumor Detection

no code implementations20 Apr 2017 Tong Chen, Lin Wu, Xue Li, Jun Zhang, Hongzhi Yin, Yang Wang

The proposed model delves soft-attention into the recurrence to simultaneously pool out distinct features with particular focus and produce hidden representations that capture contextual variations of relevant posts over time.

Deep Attention

Flexible and Creative Chinese Poetry Generation Using Neural Memory

no code implementations ACL 2017 Jiyuan Zhang, Yang Feng, Dong Wang, Yang Wang, Andrew Abel, Shiyue Zhang, Andi Zhang

It has been shown that Chinese poems can be successfully generated by sequence-to-sequence neural models, particularly with the attention mechanism.

Unsupervised Learning Layers for Video Analysis

no code implementations24 May 2017 Liang Zhao, Yang Wang, Yi Yang, Wei Xu

This paper presents two unsupervised learning layers (UL layers) for label-free video analysis: one for fully connected layers, and the other for convolutional ones.

Object Localization

Deep Adaptive Feature Embedding with Local Sample Distributions for Person Re-identification

no code implementations10 Jun 2017 Lin Wu, Yang Wang, Junbin Gao, Xue Li

To this end, a novel objective function is proposed to jointly optimize similarity metric learning, local positive mining and robust deep embedding.

Metric Learning Person Re-Identification

Gated Feedback Refinement Network for Dense Image Labeling

no code implementations CVPR 2017 Md Amirul Islam, Mrigank Rochan, Neil D. B. Bruce, Yang Wang

Effective integration of local and global contextual information is crucial for dense labeling problems.

What-and-Where to Match: Deep Spatially Multiplicative Integration Networks for Person Re-identification

no code implementations21 Jul 2017 Lin Wu, Yang Wang, Xue Li, Junbin Gao

To address \emph{what} to match, our deep network emphasizes common local patterns by learning joint representations in a multiplicative way.

Person Re-Identification

Beyond Low-Rank Representations: Orthogonal Clustering Basis Reconstruction with Optimized Graph Structure for Multi-view Spectral Clustering

no code implementations4 Aug 2017 Yang Wang, Lin Wu

Low-Rank Representation (LRR) is arguably one of the most powerful paradigms for Multi-view spectral clustering, which elegantly encodes the multi-view local graph/manifold structures into an intrinsic low-rank self-expressive data similarity embedded in high-dimensional space, to yield a better graph partition than their single-view counterparts.

Clustering

Eigen Evolution Pooling for Human Action Recognition

no code implementations17 Aug 2017 Yang Wang, Vinh Tran, Minh Hoai

We introduce Eigen Evolution Pooling, an efficient method to aggregate a sequence of feature vectors.

Action Recognition Temporal Action Localization

Multi-View Spectral Clustering via Structured Low-Rank Matrix Factorization

no code implementations5 Sep 2017 Yang Wang, Lin Wu

However, as we observed, such classical paradigm still suffers from (1) overlooking the flexible local manifold structure, caused by (2) enforcing the low-rank data correlation agreement among all views; worse still, (3) LRR is not intuitively flexible to capture the latent data clustering structures.

Clustering

Where to Focus: Deep Attention-based Spatially Recurrent Bilinear Networks for Fine-Grained Visual Recognition

no code implementations18 Sep 2017 Lin Wu, Yang Wang

Given an image, two different Convolutional Neural Networks (CNNs) are constructed, where the outputs of two CNNs are correlated through bilinear pooling to simultaneously focus on discriminative regions and extract relevant features.

Deep Attention Fine-Grained Image Classification +2

When Point Process Meets RNNs: Predicting Fine-Grained User Interests with Mutual Behavioral Infectivity

no code implementations14 Oct 2017 Tong Chen, Lin Wu, Yang Wang, Jun Zhang, Hongxu Chen, Xue Li

Inspired by point process in modeling temporal point process, in this paper we present a deep prediction method based on two recurrent neural networks (RNNs) to jointly model each user's continuous browsing history and asynchronous event sequences in the context of inter-user behavioral mutual infectivity.

Occlusion Aware Unsupervised Learning of Optical Flow

no code implementations CVPR 2018 Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu

Especially on KITTI dataset where abundant unlabeled samples exist, our unsupervised method outperforms its counterpart trained with supervised learning.

Optical Flow Estimation

Crossing Generative Adversarial Networks for Cross-View Person Re-identification

no code implementations4 Jan 2018 Chengyuan Zhang, Lin Wu, Yang Wang

Given a pair of person images, the proposed model consists of the variational auto-encoder to encode the pair into respective latent variables, a proposed cross-view alignment to reduce the view disparity, and an adversarial layer to seek the joint distribution of latent representations.

Cross-Modal Person Re-Identification Generative Adversarial Network

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images

no code implementations19 Jan 2018 Jing Zhang, Yang Cao, Yang Wang, Chenglin Wen, Chang Wen Chen

Specifically, we propose to randomly shuffle the pixels in the origin images and leverage the shuffled image as input to make CNN more concerned with the statistical properties.

Color Constancy Image Dehazing

Learning Semantic Segmentation with Diverse Supervision

no code implementations1 Feb 2018 Linwei Ye, Zhi Liu, Yang Wang

Models based on deep convolutional neural networks (CNN) have significantly improved the performance of semantic segmentation.

object-detection Object Detection +2

LEGO: Learning Edge with Geometry all at Once by Watching Videos

1 code implementation CVPR 2018 Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia

In our framework, the predicted depths, normals and edges are forced to be consistent all the time.

BigDL: A Distributed Deep Learning Framework for Big Data

1 code implementation16 Apr 2018 Jason Dai, Yiheng Wang, Xin Qiu, Ding Ding, Yao Zhang, Yanzhang Wang, Xianyan Jia, Cherry Zhang, Yan Wan, Zhichao Li, Jiao Wang, Shengsheng Huang, Zhongyuan Wu, Yang Wang, Yuhao Yang, Bowen She, Dongjie Shi, Qi Lu, Kai Huang, Guoqiong Song

This paper presents BigDL (a distributed deep learning framework for Apache Spark), which has been used by a variety of users in the industry for building deep learning applications on production big data platforms.

Fraud Detection Management +1

Cycle-Consistent Deep Generative Hashing for Cross-Modal Retrieval

no code implementations30 Apr 2018 Lin Wu, Yang Wang, Ling Shao

In this paper, we propose a novel deep generative approach to cross-modal retrieval to learn hash functions in the absence of paired training samples through the cycle consistency loss.

Cross-Modal Retrieval Retrieval +1

Deep Co-attention based Comparators For Relative Representation Learning in Person Re-identification

1 code implementation30 Apr 2018 Lin Wu, Yang Wang, Junbin Gao, DaCheng Tao

Recent effective methods are developed in a pair-wise similarity learning system to detect a fixed set of features from distinct regions which are mapped to their vector embeddings for the distance measuring.

Foveation Person Re-Identification +1

OMG - Emotion Challenge Solution

no code implementations30 Apr 2018 Yuqi Cui, Xiao Zhang, Yang Wang, Chenfeng Guo, Dongrui Wu

This short paper describes our solution to the 2018 IEEE World Congress on Computational Intelligence One-Minute Gradual-Emotional Behavior Challenge, whose goal was to estimate continuous arousal and valence values from short videos.

regression

Video Summarization by Learning from Unpaired Data

no code implementations CVPR 2019 Mrigank Rochan, Yang Wang

Our model aims to learn a mapping function $F : V \rightarrow S$ such that the distribution of resultant summary videos from $F(V)$ is similar to the distribution of $S$ with the help of an adversarial objective.

Video Summarization

Every Pixel Counts: Unsupervised Geometry Learning with Holistic 3D Motion Understanding

no code implementations27 Jun 2018 Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia

The four types of information, i. e. 2D flow, camera pose, segment mask and depth maps, are integrated into a differentiable holistic 3D motion parser (HMP), where per-pixel 3D motion for rigid background and moving objects are recovered.

Depth And Camera Motion Optical Flow Estimation +1

Gated Feedback Refinement Network for Coarse-to-Fine Dense Semantic Image Labeling

no code implementations29 Jun 2018 Md Amirul Islam, Mrigank Rochan, Shujon Naha, Neil D. B. Bruce, Yang Wang

In order to address this issue, we also propose Gated Feedback Refinement Network (G-FRNet) that addresses this limitation.

Segmentation Semantic Segmentation

Manifold: A Model-Agnostic Framework for Interpretation and Diagnosis of Machine Learning Models

no code implementations1 Aug 2018 Jiawei Zhang, Yang Wang, Piero Molino, Lezhi Li, David S. Ebert

We present Manifold, a framework that utilizes visual analysis techniques to support interpretation, debugging, and comparison of machine learning models in a more transparent and interactive manner.

BIG-bench Machine Learning

Where-and-When to Look: Deep Siamese Attention Networks for Video-based Person Re-identification

no code implementations3 Aug 2018 Lin Wu, Yang Wang, Junbin Gao, Xue Li

Video-based person re-identification (re-id) is a central application in surveillance systems with significant concern in security.

Metric Learning Video-Based Person Re-Identification

Unbiased LambdaMART: An Unbiased Pairwise Learning-to-Rank Algorithm

1 code implementation16 Sep 2018 Ziniu Hu, Yang Wang, Qu Peng, Hang Li

Although click data is widely used in search systems in practice, so far the inherent bias, most notably position bias, has prevented it from being used in training of a ranker for search, i. e., learning-to-rank.

Learning-To-Rank Position

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

1 code implementation8 Oct 2018 Yang Wang, Zhenheng Yang, Peng Wang, Yi Yang, Chenxu Luo, Wei Xu

Then the whole scene is decomposed into moving foreground and static background by compar- ing the estimated optical flow and rigid flow derived from the depth and ego-motion.

Motion Estimation Optical Flow Estimation

Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding

1 code implementation14 Oct 2018 Chenxu Luo, Zhenheng Yang, Peng Wang, Yang Wang, Wei Xu, Ram Nevatia, Alan Yuille

Performance on the five tasks of depth estimation, optical flow estimation, odometry, moving object segmentation and scene flow estimation shows that our approach outperforms other SoTA methods.

Depth Estimation Optical Flow Estimation +2

Data-Driven Tight Frame for Cryo-EM Image Denoising and Conformational Classification

1 code implementation20 Oct 2018 Yin Xian, Hanlin Gu, Wei Wang, Xuhui Huang, Yuan YAO, Yang Wang, Jian-Feng Cai

We introduce the use of data-driven tight frame (DDTF) algorithm for cryo-EM image denoising.

Computation Image and Video Processing

Spelling Error Correction Using a Nested RNN Model and Pseudo Training Data

no code implementations1 Nov 2018 Hao Li, Yang Wang, Xinyu Liu, Zhichao Sheng, Si Wei

We propose a nested recurrent neural network (nested RNN) model for English spelling error correction and generate pseudo data based on phonetic similarity to train it.

Feature Engineering

Multi-view Laplacian Eigenmaps Based on Bag-of-Neighbors For RGBD Human Emotion Recognition

no code implementations8 Nov 2018 Shenglan Liu, Shuai Guo, Hong Qiao, Yang Wang, Bin Wang, Wenbo Luo, Mingming Zhang, Keye Zhang, Bixuan Du

As RGB view and depth view lie in different spaces, a new distance metric bag of neighbors (BON) used in MvLE can get the similar distributions of the two views.

Emotion Recognition

3D PersonVLAD: Learning Deep Global Representations for Video-based Person Re-identification

no code implementations26 Dec 2018 Lin Wu, Yang Wang, Ling Shao, Meng Wang

In this paper, we introduce a global video representation to video-based person re-identification (re-ID) that aggregates local 3D features across the entire video extent.

Video-Based Person Re-Identification

A Remote Sensing Image Dataset for Cloud Removal

2 code implementations3 Jan 2019 Daoyu Lin, Guangluan Xu, Xiaoke Wang, Yang Wang, Xian Sun, Kun fu

Removing clouds is an indispensable pre-processing step in remote sensing image analysis.

Change Detection Cloud Removal +1

GIF2Video: Color Dequantization and Temporal Interpolation of GIF images

no code implementations CVPR 2019 Yang Wang, Haibin Huang, Chuan Wang, Tong He, Jue Wang, Minh Hoai

In this paper, we propose GIF2Video, the first learning-based method for enhancing the visual quality of GIFs in the wild.

Quantization

Deep Generative Learning via Variational Gradient Flow

1 code implementation24 Jan 2019 Yuan Gao, Yuling Jiao, Yang Wang, Yao Wang, Can Yang, Shunkang Zhang

We propose a general framework to learn deep generative models via \textbf{V}ariational \textbf{Gr}adient Fl\textbf{ow} (VGrow) on probability spaces.

Binary Classification

Deep Discriminative Representation Learning with Attention Map for Scene Classification

no code implementations21 Feb 2019 Jun Li, Daoyu Lin, Yang Wang, Guangluan Xu, Chibiao Ding

However, most recent approaches to remote sensing scene classification are based on Convolutional Neural Networks (CNNs).

Classification Face Recognition +3

Wasserstein-Wasserstein Auto-Encoders

no code implementations25 Feb 2019 Shunkang Zhang, Yuan Gao, Yuling Jiao, Jin Liu, Yang Wang, Can Yang

To address the challenges in learning deep generative models (e. g., the blurriness of variational auto-encoder and the instability of training generative adversarial networks, we propose a novel deep generative model, named Wasserstein-Wasserstein auto-encoders (WWAE).

Few-Shot Deep Adversarial Learning for Video-based Person Re-identification

no code implementations29 Mar 2019 Lin Wu, Yang Wang, Hongzhi Yin, Meng Wang, Ling Shao

Video-based person re-identification (re-ID) refers to matching people across camera views from arbitrary unaligned video footages.

Time Series Time Series Analysis +1

Cross-Entropy Adversarial View Adaptation for Person Re-identification

no code implementations3 Apr 2019 Lin Wu, Richang Hong, Yang Wang, Meng Wang

The main contribution is to learn coupled asymmetric mappings regarding view characteristics which are adversarially trained to address the view discrepancy by optimising the cross-entropy view confusion objective.

Person Re-Identification

When AWGN-based Denoiser Meets Real Noises

2 code implementations6 Apr 2019 Yuqian Zhou, Jianbo Jiao, Haibin Huang, Yang Wang, Jue Wang, Honghui Shi, Thomas Huang

In this paper, we propose a novel approach to boost the performance of a real image denoiser which is trained only with synthetic pixel-independent noise data dominated by AWGN.

Denoising

Convolutional Temporal Attention Model for Video-based Person Re-identification

no code implementations9 Apr 2019 Tanzila Rahman, Mrigank Rochan, Yang Wang

A common approach for person re-identification is to first extract image features for all frames in the video, then aggregate all the features to form a video-level feature.

Semantic Segmentation Video-Based Person Re-Identification

Knowledge Distillation for Human Action Anticipation

no code implementations9 Apr 2019 Vinh Tran, Yang Wang, Minh Hoai

In this paper, we propose a novel knowledge distillation framework that uses an action recognition network to supervise the training of an action anticipation network, guiding the latter to attend to the relevant information needed for correctly anticipating the future actions.

Action Anticipation Action Recognition +3

Contextual Attention for Hand Detection in the Wild

1 code implementation ICCV 2019 Supreeth Narasimhaswamy, Zhengwei Wei, Yang Wang, Justin Zhang, Minh Hoai

We also conduct ablation studies on hand detection to show the effectiveness of the proposed contextual attention module.

Hand Detection object-detection +1

Attentive Action and Context Factorization

no code implementations10 Apr 2019 Yang Wang, Vinh Tran, Gedas Bertasius, Lorenzo Torresani, Minh Hoai

This is a challenging task due to the subtlety of human actions in video and the co-occurrence of contextual elements.

Action Recognition Temporal Action Localization

Visualizing and Understanding the Semantics of Embedding Spaces via Algebraic Formulae

no code implementations ICLR 2019 Piero Molino, Yang Wang, Jiawei Zhang

Embeddings are a fundamental component of many modern machine learning and natural language processing models.

Efficient EM-Variational Inference for Hawkes Process

no code implementations29 May 2019 Feng Zhou, Zhidong Li, Xuhui Fan, Yang Wang, Arcot Sowmya, Fang Chen

In classical Hawkes process, the baseline intensity and triggering kernel are assumed to be a constant and parametric function respectively, which limits the model flexibility.

Variational Inference

UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos

1 code implementation CVPR 2019 Yang Wang, Peng Wang, Zhenheng Yang, Chenxu Luo, Yi Yang, Wei Xu

In this paper, we propose UnOS, an unified system for unsupervised optical flow and stereo depth estimation using convolutional neural network (CNN) by taking advantages of their inherent geometrical consistency based on the rigid-scene assumption.

Motion Segmentation Optical Flow Estimation +2

Optimal low rank tensor recovery

no code implementations12 Jun 2019 Jian-Feng Cai, Lizhang Miao, Yang Wang, Yin Xian

We investigate the sample size requirement for exact recovery of a high order tensor of low rank from a subset of its entries.

Riemannian optimization

A Targeted Acceleration and Compression Framework for Low bit Neural Networks

no code implementations9 Jul 2019 Biao Qian, Yang Wang

In this paper, we propose a novel Targeted Acceleration and Compression (TAC) framework to improve the performance of 1 bit deep neural networks W e consider that the acceleration and compression effects of binarizing fully connected layer s are not sufficient to compensate for the accuracy loss caused by it In the proposed framework, t he convolutional and fully connected layer are separated and optimized i ndividually .

Binarization Computational Efficiency +2

Learning Structured Twin-Incoherent Twin-Projective Latent Dictionary Pairs for Classification

no code implementations21 Aug 2019 Zhao Zhang, Yulin Sun, Zheng Zhang, Yang Wang, Guangcan Liu, Meng Wang

In this setting, our TP-DPL integrates the twin-incoherence based latent flexible DPL and the joint embedding of codes as well as salient features by twin-projection into a unified model in an adaptive neighborhood-preserving manner.

General Classification

Adaptive Structure-constrained Robust Latent Low-Rank Coding for Image Recovery

no code implementations21 Aug 2019 Zhao Zhang, Lei Wang, Sheng Li, Yang Wang, Zheng Zhang, Zheng-Jun Zha, Meng Wang

Specifically, AS-LRC performs the latent decomposition of given data into a low-rank reconstruction by a block-diagonal codes matrix, a group sparse locality-adaptive salient feature part and a sparse error part.

Representation Learning

Multi-Task Deep Learning with Dynamic Programming for Embryo Early Development Stage Classification from Time-Lapse Videos

no code implementations22 Aug 2019 Zihan Liu, Bo Huang, Yuqi Cui, Yifan Xu, Bo Zhang, Lixia Zhu, Yang Wang, Lei Jin, Dongrui Wu

Accurate classification of embryo early development stages can provide embryologists valuable information for assessing the embryo quality, and hence is critical to the success of IVF.

General Classification

A Convolutional Neural Network with Mapping Layers for Hyperspectral Image Classification

no code implementations26 Aug 2019 Rui Li, Zhibin Pan, Yang Wang, Ping Wang

In this paper, we propose a convolutional neural network with mapping layers (MCNN) for hyperspectral image (HSI) classification.

Classification General Classification +1

Future Frame Prediction Using Convolutional VRNN for Anomaly Detection

no code implementations5 Sep 2019 Yiwei Lu, Mahesh Kumar Krishna Reddy, Seyed shahabeddin Nabavi, Yang Wang

Anomaly detection in videos aims at reporting anything that does not conform the normal behaviour or distribution.

Anomaly Detection

Region Mutual Information Loss for Semantic Segmentation

2 code implementations NeurIPS 2019 Shuai Zhao, Yang Wang, Zheng Yang, Deng Cai

In this paper, we develop a region mutual information (RMI) loss to model the dependencies among pixels more simply and efficiently.

Semantic Segmentation

Scalable Inference for Nonparametric Hawkes Process Using Pólya-Gamma Augmentation

no code implementations29 Oct 2019 Feng Zhou, Zhidong Li, Xuhui Fan, Yang Wang, Arcot Sowmya, Fang Chen

In this paper, we consider the sigmoid Gaussian Hawkes process model: the baseline intensity and triggering kernel of Hawkes process are both modeled as the sigmoid transformation of random trajectories drawn from Gaussian processes (GP).

Bayesian Inference Gaussian Processes +1

Kernelized Multiview Subspace Analysis by Self-weighted Learning

no code implementations23 Nov 2019 Huibing Wang, Yang Wang, Zhao Zhang, Xianping Fu, Zhuo Li, Mingliang Xu, Meng Wang

With the popularity of multimedia technology, information is always represented or transmitted from multiple views.

Dimensionality Reduction Image Retrieval +1

Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition

no code implementations1 Dec 2019 Biao Qian, Yang Wang, Zhao Zhang, Richang Hong, Meng Wang, Ling Shao

We intuitively find that M$^2$Net can essentially promote the diversity of the inference path (selected blocks subset) selection, so as to enhance the recognition accuracy.

Landmark Recognition

Learning to Recommend via Meta Parameter Partition

no code implementations4 Dec 2019 Liang Zhao, Yang Wang, daxiang dong, Hao Tian

The fixed part, capturing user invariant features, is shared by all users and is learned during offline meta learning stage.

Meta-Learning

Multilayer Collaborative Low-Rank Coding Network for Robust Deep Subspace Discovery

no code implementations13 Dec 2019 Xianzhen Li, Zhao Zhang, Yang Wang, Guangcan Liu, Shuicheng Yan, Meng Wang

In this paper, we explore the deep multi-subspace recovery problem by designing a multilayer architecture for latent LRR.

Clustering Representation Learning

Fully-Convolutional Intensive Feature Flow Neural Network for Text Recognition

no code implementations13 Dec 2019 Zhao Zhang, Zemin Tang, Zheng Zhang, Yang Wang, Jie Qin, Meng Wang

But existing CNNs based frameworks still have several drawbacks: 1) the traditaional pooling operation may lose important feature information and is unlearnable; 2) the tradi-tional convolution operation optimizes slowly and the hierar-chical features from different layers are not fully utilized.

DerainCycleGAN: Rain Attentive CycleGAN for Single Image Deraining and Rainmaking

1 code implementation15 Dec 2019 Yanyan Wei, Zhao Zhang, Yang Wang, Mingliang Xu, Yi Yang, Shuicheng Yan, Meng Wang

However, in practice it is rather common to have no un-paired images in real deraining task, in such cases how to remove the rain streaks in an unsupervised way will be a very challenging task due to lack of constraints between images and hence suffering from low-quality recovery results.

Single Image Deraining

Compressed DenseNet for Lightweight Character Recognition

no code implementations15 Dec 2019 Zhao Zhang, Zemin Tang, Yang Wang, Haijun Zhang, Shuicheng Yan, Meng Wang

LDB is a convolutional block similarly as dense block, but it can reduce the computation cost and weight size to (1/L, 2/L), compared with original ones, where L is the number of layers in blocks.

Convolutional Dictionary Pair Learning Network for Image Representation Learning

no code implementations17 Dec 2019 Zhao Zhang, Yulin Sun, Yang Wang, Zheng-Jun Zha, Shuicheng Yan, Meng Wang

To address this issue, we propose a novel generalized end-to-end representation learning architecture, dubbed Convolutional Dictionary Pair Learning Network (CDPL-Net) in this paper, which integrates the learning schemes of the CNN and dictionary pair learning into a unified framework.

Dictionary Learning Representation Learning

Learning Hybrid Representation by Robust Dictionary Learning in Factorized Compressed Space

no code implementations26 Dec 2019 Jiahuan Ren, Zhao Zhang, Sheng Li, Yang Wang, Guangcan Liu, Shuicheng Yan, Meng Wang

Specifically, J-RFDL performs the robust representation by DL in a factorized compressed space to eliminate the negative effects of noise and outliers on the results, which can also make the DL process efficient.

Dictionary Learning

Dense Residual Network: Enhancing Global Dense Feature Flow for Character Recognition

no code implementations23 Jan 2020 Zhao Zhang, Zemin Tang, Yang Wang, Zheng Zhang, Choujun Zhan, ZhengJun Zha, Meng Wang

To construct FDRN, we propose a new fast residual dense block (f-RDB) to retain the ability of local feature fusion and local residual learning of original RDB, which can reduce the computing efforts at the same time.

Semi-DerainGAN: A New Semi-supervised Single Image Deraining Network

no code implementations23 Jan 2020 Yanyan Wei, Zhao Zhang, Yang Wang, Haijun Zhang, Mingbo Zhao, Mingliang Xu, Meng Wang

Although supervised deep deraining networks have obtained impressive results on synthetic datasets, they still cannot obtain satisfactory results on real images due to weak generalization of rain removal capacity, i. e., the pre-trained models usually cannot handle new shapes and directions that may lead to over-derained/under-derained results.

Single Image Deraining

Deep Learning-based Image Compression with Trellis Coded Quantization

no code implementations26 Jan 2020 Binglin Li, Mohammad Akbari, Jie Liang, Yang Wang

Recently many works attempt to develop image compression models based on deep learning architectures, where the uniform scalar quantizer (SQ) is commonly applied to the feature maps between the encoder and decoder.

Image Compression Quantization

Dual Convolutional LSTM Network for Referring Image Segmentation

no code implementations30 Jan 2020 Linwei Ye, Zhi Liu, Yang Wang

Given an input image and a referring expression in the form of a natural language sentence, the goal is to segment the object of interest in the image referred by the linguistic query.

Image Segmentation Natural Language Understanding +4

RiskOracle: A Minute-level Citywide Traffic Accident Forecasting Framework

no code implementations19 Feb 2020 Zhengyang Zhou, Yang Wang, Xike Xie, Lianliang Chen, Hengchang Liu

Real-time traffic accident forecasting is increasingly important for public safety and urban management (e. g., real-time safe route planning and emergency response deployment).

Management

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

no code implementations COLING 2020 Yihuan Mao, Yujing Wang, Chufan Wu, Chen Zhang, Yang Wang, Yaming Yang, Quanlu Zhang, Yunhai Tong, Jing Bai

BERT is a cutting-edge language representation model pre-trained by a large corpus, which achieves superior performances on various natural language understanding tasks.

Blocking Knowledge Distillation +2

SimpleMKKM: Simple Multiple Kernel K-means

1 code implementation11 May 2020 Xinwang Liu, En Zhu, Jiyuan Liu, Timothy Hospedales, Yang Wang, Meng Wang

We propose a simple yet effective multiple kernel clustering algorithm, termed simple multiple kernel k-means (SimpleMKKM).

Clustering

Try This Instead: Personalized and Interpretable Substitute Recommendation

no code implementations19 May 2020 Tong Chen, Hongzhi Yin, Guanhua Ye, Zi Huang, Yang Wang, Meng Wang

Then, by treating attributes as the bridge between users and items, we can thoroughly model the user-item preferences (i. e., personalization) and item-item relationships (i. e., substitution) for recommendation.

Attribute Collaborative Filtering +1

Approximation in shift-invariant spaces with deep ReLU neural networks

no code implementations25 May 2020 Yunfei Yang, Zhen Li, Yang Wang

We also give lower bounds of the $L^p (1\le p \le \infty)$ approximation error for Sobolev spaces, which show that our construction of neural network is asymptotically optimal up to a logarithmic factor.

Deep Degradation Prior for Low-Quality Image Classification

no code implementations CVPR 2020 Yang Wang, Yang Cao, Zheng-Jun Zha, Jing Zhang, Zhiwei Xiong

Since the statistical properties are independent to image content, deep degradation prior can be learned on a training set of limited images without supervision of semantic labels and served in a form of "plugging-in" module of the existing classification networks to improve their performance on degraded images.

Classification General Classification +1

Utilizing machine learning to prevent water main breaks by understanding pipeline failure drivers

no code implementations5 Jun 2020 Dilusha Weeraddana, Bin Liang, Zhidong Li, Yang Wang, Fang Chen, Livia Bonazzi, Dean Phillips, Nitin Saxena

Data61 and Western Water worked collaboratively to apply engineering expertise and Machine Learning tools to find a cost-effective solution to the pipe failure problem in the region west of Melbourne, where on average 400 water main failures occur per year.

BIG-bench Machine Learning

Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry and Fusion

1 code implementation15 Jun 2020 Yang Wang

In this paper, we provide a substantial overview of the existing state-of-the-arts on the filed of multi-modal data analytics from shallow to deep spaces.

Unsupervised Vehicle Re-identification with Progressive Adaptation

no code implementations20 Jun 2020 Jinjia Peng, Yang Wang, Huibing Wang, Zhao Zhang, Xianping Fu, Meng Wang

For PAL, a data adaptation module is employed for source domain, which generates the images with similar data distribution to unlabeled target domain as ``pseudo target samples''.

Unsupervised Vehicle Re-Identification Vehicle Re-Identification

Recovering Accurate Labeling Information from Partially Valid Data for Effective Multi-Label Learning

no code implementations20 Jun 2020 Xi-Ming Li, Yang Wang

Partial Multi-label Learning (PML) aims to induce the multi-label predictor from datasets with noisy supervision, where each training instance is associated with several candidate labels but only partially valid.

Multi-Label Learning valid

Recurrent Relational Memory Network for Unsupervised Image Captioning

no code implementations24 Jun 2020 Dan Guo, Yang Wang, Peipei Song, Meng Wang

Unsupervised image captioning with no annotations is an emerging challenge in computer vision, where the existing arts usually adopt GAN (Generative Adversarial Networks) models.

Computational Efficiency Image Captioning +2

Cross-Modal Weighting Network for RGB-D Salient Object Detection

2 code implementations ECCV 2020 Gongyang Li, Zhi Liu, Linwei Ye, Yang Wang, Haibin Ling

In this paper, we propose a novel Cross-Modal Weighting (CMW) strategy to encourage comprehensive interactions between RGB and depth channels for RGB-D SOD.

object-detection Object Localization +3

Few-shot Scene-adaptive Anomaly Detection

1 code implementation ECCV 2020 Yiwei Lu, Frank Yu, Mahesh Kumar Krishna Reddy, Yang Wang

In this paper, we propose a novel few-shot scene-adaptive anomaly detection problem to address the limitations of previous approaches.

Anomaly Detection Meta-Learning

Adaptive Video Highlight Detection by Learning from User History

1 code implementation ECCV 2020 Mrigank Rochan, Mahesh Kumar Krishna Reddy, Linwei Ye, Yang Wang

In this paper, we propose a simple yet effective framework that learns to adapt highlight detection to a user by exploiting the user's history in the form of highlights that the user has previously created.

Highlight Detection

Ontology-based annotation and analysis of COVID-19 phenotypes

no code implementations5 Aug 2020 Yang Wang, Fengwei Zhang, Hong Yu, Xianwei Ye, Yongqun He

The commonly occurring 17 phenotypes were classified into different groups based on the Human Phenotype Ontology (HPO).

A Mathematical Introduction to Generative Adversarial Nets (GAN)

no code implementations1 Sep 2020 Yang Wang

This paper attempts to provide an overview of GANs from a mathematical point of view.

Dual-constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior

no code implementations8 Sep 2020 Yan Zhang, Zhao Zhang, Yang Wang, Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

Nonnegative matrix factorization is usually powerful for learning the "shallow" parts-based representation, but it clearly fails to discover deep hierarchical information within both the basis and representation spaces.

Clustering Graph Learning +1

Where is the Model Looking At?--Concentrate and Explain the Network Attention

no code implementations29 Sep 2020 Wenjia Xu, Jiuniu Wang, Yang Wang, Guangluan Xu, Wei Dai, Yirong Wu

We generate attribute-based textual explanations for the network and ground the attributes on the image to show visual explanations.

Attribute Image Classification +1

High Quality Remote Sensing Image Super-Resolution Using Deep Memory Connected Network

no code implementations1 Oct 2020 Wenjia Xu, Guangluan Xu, Yang Wang, Xian Sun, Daoyu Lin, Yirong Wu

Single image super-resolution is an effective way to enhance the spatial resolution of remote sensing image, which is crucial for many applications such as target detection and image classification.

Image Classification Image Super-Resolution

Deep-HOSeq: Deep Higher Order Sequence Fusion for Multimodal Sentiment Analysis

1 code implementation16 Oct 2020 Sunny Verma, Jiwei Wang, Zhefeng Ge, Rujia Shen, Fan Jin, Yang Wang, Fang Chen, Wei Liu

In this research, we first propose a common network to discover both intra-modal and inter-modal dynamics by utilizing basic LSTMs and tensor based convolution networks.

Multimodal Sentiment Analysis Sentiment Classification

AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting

1 code implementation23 Oct 2020 Mahesh Kumar Krishna Reddy, Mrigank Rochan, Yiwei Lu, Yang Wang

In particular, we propose a new problem called unlabeled scene-adaptive crowd counting.

Crowd Counting

Long-Term Pipeline Failure Prediction Using Nonparametric Survival Analysis

no code implementations11 Nov 2020 Dilusha Weeraddana, Sudaraka MallawaArachchi, Tharindu Warnakula, Zhidong Li, Yang Wang

We applied Machine Learning techniques to find a cost-effective solution to the pipe failure problem in these Australian cities, where on average 1500 of water main failures occur each year.

BIG-bench Machine Learning Survival Analysis

Learning Hybrid Representations for Automatic 3D Vessel Centerline Extraction

no code implementations14 Dec 2020 Jiafa He, Chengwei Pan, Can Yang, Ming Zhang, Yang Wang, Xiaowei Zhou, Yizhou Yu

The main idea is to use CNNs to learn local appearances of vessels in image crops while using another point-cloud network to learn the global geometry of vessels in the entire image.

Representation Learning

On the capacity of deep generative networks for approximating distributions

no code implementations29 Jan 2021 Yunfei Yang, Zhen Li, Yang Wang

Furthermore, it is shown that the approximation error in Wasserstein distance grows at most linearly on the ambient dimension and that the approximation order only depends on the intrinsic dimension of the target distribution.

VSEGAN: Visual Speech Enhancement Generative Adversarial Network

no code implementations4 Feb 2021 Xinmeng Xu, Yang Wang, Dongxiang Xu, Yiyuan Peng, Cong Zhang, Jie Jia, Binbin Chen

This paper proposes a novel frameworkthat involves visual information for speech enhancement, by in-corporating a Generative Adversarial Network (GAN).

Generative Adversarial Network Speech Enhancement

Wasserstein Graph Neural Networks for Graphs with Missing Attributes

no code implementations6 Feb 2021 Zhixian Chen, Tengfei Ma, Yangqiu Song, Yang Wang

In this paper, we propose an innovative node representation learning framework, Wasserstein Graph Neural Network (WGNN), to mitigate the problem.

Attribute Graph Representation Learning +3

STUaNet: Understanding uncertainty in spatiotemporal collective human mobility

no code implementations9 Feb 2021 Zhengyang Zhou, Yang Wang, Xike Xie, Lei Qiao, Yuantao Li

The high dynamics and heterogeneous interactions in the complicated urban systems have raised the issue of uncertainty quantification in spatiotemporal human mobility, to support critical decision-makings in risk-aware web applications such as urban event prediction where fluctuations are of significant interests.

Uncertainty Quantification

Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network

no code implementations9 Feb 2021 Linwei Ye, Mrigank Rochan, Zhi Liu, Xiaoqin Zhang, Yang Wang

In this paper, we propose a cross-modal self-attention (CMSA) module to utilize fine details of individual words and the input image or video, which effectively captures the long-range dependencies between linguistic and visual features.

Ranked #5 on Referring Expression Segmentation on J-HMDB (Precision@0.9 metric)

Referring Expression Referring Expression Segmentation +3

DynACPD Embedding Algorithm for Prediction Tasks in Dynamic Networks

no code implementations12 Mar 2021 Chris Connell, Yang Wang

Classical network embeddings create a low dimensional representation of the learned relationships between features across nodes.

Link Prediction Node Classification

Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modelling

no code implementations5 Apr 2021 Tong Chen, Hongzhi Yin, Xiangliang Zhang, Zi Huang, Yang Wang, Meng Wang

As a well-established approach, factorization machine (FM) is capable of automatically learning high-order interactions among features to make predictions without the need for manual feature engineering.

Feature Engineering

Dual-side Sparse Tensor Core

no code implementations20 May 2021 Yang Wang, Chen Zhang, Zhiqiang Xie, Cong Guo, Yunxin Liu, Jingwen Leng

We demonstrate the feasibility of our design with minimal changes to the existing production-scale inner-product-based Tensor Core.

An error analysis of generative adversarial networks for learning distributions

no code implementations27 May 2021 Jian Huang, Yuling Jiao, Zhen Li, Shiao Liu, Yang Wang, Yunfei Yang

This paper studies how well generative adversarial networks (GANs) learn probability distributions from finite samples.

Learning Elastic Embeddings for Customizing On-Device Recommenders

no code implementations4 Jun 2021 Tong Chen, Hongzhi Yin, Yujia Zheng, Zi Huang, Yang Wang, Meng Wang

The core idea is to compose elastic embeddings for each item, where an elastic embedding is the concatenation of a set of embedding blocks that are carefully chosen by an automated search function.

Recommendation Systems

Generalized Linear Bandits with Local Differential Privacy

1 code implementation NeurIPS 2021 Yuxuan Han, Zhipeng Liang, Yang Wang, Jiheng Zhang

In this paper, we design LDP algorithms for stochastic generalized linear bandits to achieve the same regret bound as in non-privacy settings.

Decision Making Multi-Armed Bandits

DECORE: Deep Compression with Reinforcement Learning

no code implementations CVPR 2022 Manoj Alwani, Yang Wang, Vashisht Madhavan

For a larger dataset like ImageNet with just 30 epochs of training, it can compress the ResNet-50 architecture by 44. 7% and reduce FLOPs by 42. 3%, with just a 0. 69% drop on Top-5 accuracy of the uncompressed model.

reinforcement-learning Reinforcement Learning (RL)

Data Augmentation for Graph Convolutional Network on Semi-Supervised Classification

no code implementations16 Jun 2021 Zhengzheng Tang, Ziyue Qiao, Xuehai Hong, Yang Wang, Fayaz Ali Dharejo, Yuanchun Zhou, Yi Du

However, data augmentation for graph-based models remains a challenging problem, as graph data is more complex than traditional data, which consists of two features with different properties: graph topology and node attributes.

Classification Data Augmentation +1

Deep Generative Learning via Schrödinger Bridge

no code implementations19 Jun 2021 Gefei Wang, Yuling Jiao, Qian Xu, Yang Wang, Can Yang

At the sample level, we derive our Schr\"{o}dinger Bridge algorithm by plugging the drift term estimated by a deep score estimator and a deep density ratio estimator into the Euler-Maruyama method.

Image Inpainting

Test-Time Fast Adaptation for Dynamic Scene Deblurring via Meta-Auxiliary Learning

no code implementations CVPR 2021 Zhixiang Chi, Yang Wang, Yuanhao Yu, Jin Tang

Therefore, we are able to exploit the internal information at test time via the auxiliary task to enhance the performance of deblurring.

Auxiliary Learning Deblurring +1

Image Change Captioning by Learning From an Auxiliary Task

no code implementations CVPR 2021 Mehrdad Hosseinzadeh, Yang Wang

Inspired by the success of multi-task learning, we formulate a training scheme that uses an auxiliary task to improve the training of the change captioning network.

Image Retrieval Multi-Task Learning +2

Visualizing Graph Neural Networks with CorGIE: Corresponding a Graph to Its Embedding

1 code implementation24 Jun 2021 Zipeng Liu, Yang Wang, Jürgen Bernard, Tamara Munzner

Graph neural networks (GNNs) are a class of powerful machine learning tools that model node relations for making predictions of nodes or links.

Bias-Tolerant Fair Classification

no code implementations7 Jul 2021 Yixuan Zhang, Feng Zhou, Zhidong Li, Yang Wang, Fang Chen

Therefore, we propose a Bias-TolerantFAirRegularizedLoss (B-FARL), which tries to regain the benefits using data affected by label bias and selection bias.

Classification Fairness +2

Data-Driven Constitutive Relation Reveals Scaling Law for Hydrodynamic Transport Coefficients

no code implementations1 Aug 2021 Candi Zheng, Yang Wang, Shiyi Chen

We further proposed a constitutive relation model based on scaling law and tested it on the calculation of Rayleigh scattering spectra.

regression Relation +2

Spatio-Temporal Self-Attention Network for Video Saliency Prediction

1 code implementation24 Aug 2021 Ziqiang Wang, Zhi Liu, Gongyang Li, Yang Wang, Tianhong Zhang, Lihua Xu, Jijun Wang

3D convolutional neural networks have achieved promising results for video tasks in computer vision, including video saliency prediction that is explored in this paper.

Saliency Prediction Video Saliency Prediction

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

1 code implementation Findings (EMNLP) 2021 Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim

While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects.

Graph Question Answering Question Answering

FedDrop: Trajectory-weighted Dropout for Efficient Federated Learning

no code implementations29 Sep 2021 Dongping Liao, Xitong Gao, Yiren Zhao, Hao Dai, Li Li, Kafeng Wang, Kejiang Ye, Yang Wang, Cheng-Zhong Xu

Federated learning (FL) enables edge clients to train collaboratively while preserving individual's data privacy.

Federated Learning

Graph Information Matters: Understanding Graph Filters from Interaction Probability

no code implementations29 Sep 2021 Zhixian Chen, Tengfei Ma, Yang Wang

We show that the homophily degree of graphs significantly affects the prediction error of graph filters.

Graph Learning Node Classification

Non-Asymptotic Error Bounds for Bidirectional GANs

no code implementations NeurIPS 2021 Shiao Liu, Yunfei Yang, Jian Huang, Yuling Jiao, Yang Wang

Our results are also applicable to the Wasserstein bidirectional GAN if the target distribution is assumed to have a bounded support.

Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity

no code implementations11 Dec 2021 Hanwen Liang, Niamul Quader, Zhixiang Chi, Lizhe Chen, Peng Dai, Juwei Lu, Yang Wang

Recent self-supervised video representation learning methods have found significant success by exploring essential properties of videos, e. g. speed, temporal order, etc.

Action Localization Action Recognition +3

Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading

no code implementations CVPR 2022 Ganchao Tan, Yang Wang, Han Han, Yang Cao, Feng Wu, Zheng-Jun Zha

To recognize words from the event data, we propose a novel Multi-grained Spatio-Temporal Features Perceived Network (MSTP) to perceive fine-grained spatio-temporal features from microsecond time-resolved event data.

Action Recognition Lip Reading

Contrastive Learning for Unsupervised Video Highlight Detection

no code implementations CVPR 2022 Taivanbat Badamdorj, Mrigank Rochan, Yang Wang, Li Cheng

Our framework encodes a video into a vector representation by learning to pick video clips that help to distinguish it from other videos via a contrastive objective using dropout noise.

Contrastive Learning Highlight Detection

Exposure Normalization and Compensation for Multiple-Exposure Correction

no code implementations CVPR 2022 Jie Huang, Yajing Liu, Xueyang Fu, Man Zhou, Yang Wang, Feng Zhao, Zhiwei Xiong

However, the procedures of correcting underexposure and overexposure to normal exposures are much different from each other, leading to large discrepancies for the network in correcting multiple exposures, thus resulting in poor performance.

Image Enhancement

Dreaming To Prune Image Deraining Networks

no code implementations CVPR 2022 Weiqi Zou, Yang Wang, Xueyang Fu, Yang Cao

It is based on our observation that deep degradation representations can be clustered by degradation characteristics (types of rain) while independent of image content.

Model Compression Rain Removal

MetaFSCIL: A Meta-Learning Approach for Few-Shot Class Incremental Learning

no code implementations CVPR 2022 Zhixiang Chi, Li Gu, Huan Liu, Yang Wang, Yuanhao Yu, Jin Tang

The learning objective of these methods is often hand-engineered and is not directly tied to the objective (i. e. incrementally learning new classes) during testing.

Few-Shot Class-Incremental Learning Incremental Learning +1

Approximation bounds for norm constrained neural networks with applications to regression and GANs

no code implementations24 Jan 2022 Yuling Jiao, Yang Wang, Yunfei Yang

This paper studies the approximation capacity of ReLU neural networks with norm constraint on the weights.

regression

When Does A Spectral Graph Neural Network Fail in Node Classification?

no code implementations16 Feb 2022 Zhixian Chen, Tengfei Ma, Yang Wang

Although graph filters provide theoretical foundations for model explanations, it is unclear when a spectral GNN will fail.

Graph Learning Node Classification

S-Rocket: Selective Random Convolution Kernels for Time Series Classification

1 code implementation7 Mar 2022 Hojjat Salehinejad, Yang Wang, Yuanhao Yu, Tang Jin, Shahrokh Valaee

A population-based optimization algorithm evolves the population in order to find a best state vector which minimizes the number of active kernels while maximizing the accuracy of the classifier.

Combinatorial Optimization regression +3

ProgressiveMotionSeg: Mutually Reinforced Framework for Event-Based Motion Segmentation

no code implementations22 Mar 2022 Jinze Chen, Yang Wang, Yang Cao, Feng Wu, Zheng-Jun Zha

Dynamic Vision Sensor (DVS) can asynchronously output the events reflecting apparent motion of objects with microsecond resolution, and shows great application potential in monitoring and other fields.

Denoising Motion Estimation +1

Recent Few-Shot Object Detection Algorithms: A Survey with Performance Comparison

no code implementations27 Mar 2022 Tianying Liu, Lu Zhang, Yang Wang, Jihong Guan, Yanwei Fu, Jiajia Zhao, Shuigeng Zhou

To this end, the Few-Shot Object Detection (FSOD) has been topical recently, as it mimics the humans' ability of learning to learn, and intelligently transfers the learned generic object knowledge from the common heavy-tailed, to the novel long-tailed object classes.

Few-Shot Object Detection Meta-Learning +3

BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed Cluster

1 code implementation CVPR 2022 Jason Dai, Ding Ding, Dongjie Shi, Shengsheng Huang, Jiao Wang, Xin Qiu, Kai Huang, Guoqiong Song, Yang Wang, Qiyuan Gong, Jiaming Song, Shan Yu, Le Zheng, Yina Chen, Junwei Deng, Ge Song

To address this challenge, we have open sourced BigDL 2. 0 at https://github. com/intel-analytics/BigDL/ under Apache 2. 0 license (combining the original BigDL and Analytics Zoo projects); using BigDL 2. 0, users can simply build conventional Python notebooks on their laptops (with possible AutoML support), which can then be transparently accelerated on a single node (with up-to 9. 6x speedup in our experiments), and seamlessly scaled out to a large cluster (across several hundreds servers in real-world use cases).

AutoML Distributed Computing +1

Thinking inside The Box: Learning Hypercube Representations for Group Recommendation

1 code implementation6 Apr 2022 Tong Chen, Hongzhi Yin, Jing Long, Quoc Viet Hung Nguyen, Yang Wang, Meng Wang

Such user and group preferences are commonly represented as points in the vector space (i. e., embeddings), where multiple user embeddings are compressed into one to facilitate ranking for group-item pairs.

Decision Making

Transformer-Empowered 6G Intelligent Networks: From Massive MIMO Processing to Semantic Communication

no code implementations8 May 2022 Yang Wang, Zhen Gao, Dezhi Zheng, Sheng Chen, Deniz Gündüz, H. Vincent Poor

It is anticipated that 6G wireless networks will accelerate the convergence of the physical and cyber worlds and enable a paradigm-shift in the way we deploy and exploit communication networks.

On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits

1 code implementation16 Jun 2022 Yuxuan Han, Zhicong Liang, Zhipeng Liang, Yang Wang, Yuan YAO, Jiheng Zhang

To address such a challenge as the online convex optimization with privacy protection, we propose a private variant of online Frank-Wolfe algorithm with recursive gradients for variance reduction to update and reveal the parameters upon each data.

Multi-Armed Bandits

DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection

no code implementations30 Jun 2022 Xuanwen Huang, Yang Yang, Yang Wang, Chunping Wang, Zhisheng Zhang, Jiarong Xu, Lei Chen, Michalis Vazirgiannis

Since GAD emphasizes the application and the rarity of anomalous samples, enriching the varieties of its datasets is fundamental work.

Graph Anomaly Detection

Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention

no code implementations30 Jun 2022 Xinmeng Xu, Yang Wang, Jie Jia, Binbin Chen, Dejun Li

The proposed model alleviates these drawbacks by a) applying a model that fuses audio and visual features layer by layer in encoding phase, and that feeds fused audio-visual features to each corresponding decoder layer, and more importantly, b) introducing a 2-stage multi-head cross attention (MHCA) mechanism to infer audio-visual speech enhancement for balancing the fused audio-visual features and eliminating irrelevant features.

Speech Enhancement

Learning Resolution-Adaptive Representations for Cross-Resolution Person Re-Identification

no code implementations9 Jul 2022 Lin Wu, Lingqiao Liu, Yang Wang, Zheng Zhang, Farid Boussaid, Mohammed Bennamoun

It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.

Person Re-Identification Super-Resolution

De-biased Representation Learning for Fairness with Unreliable Labels

no code implementations1 Aug 2022 Yixuan Zhang, Feng Zhou, Zhidong Li, Yang Wang, Fang Chen

In other words, the fair pre-processing methods ignore the discrimination encoded in the labels either during the learning procedure or the evaluation stage.

Fairness Representation Learning

PECCO: A Profit and Cost-oriented Computation Offloading Scheme in Edge-Cloud Environment with Improved Moth-flame Optimisation

no code implementations9 Aug 2022 Jiashu Wu, Hao Dai, Yang Wang, Shigen Shen, Chengzhong Xu

With the fast growing quantity of data generated by smart devices and the exponential surge of processing demand in the Internet of Things (IoT) era, the resource-rich cloud centres have been utilised to tackle these challenges.

Trajectory Tracking Control of the Bionic Joint Actuated by Pneumatic Artificial Muscle Based on Robust Modeling

no code implementations10 Aug 2022 Yang Wang, Qiang Zhang, Xiao-hui Xiao

Then, a hybrid model is established based on the two models (the nonlinear model and the LTI model) and corresponding to it, a cascaded controller is developed, the outer loop of which is an H-infinite controller for the angular position tracking designed by loop-shaping design procedure (LSDP) and the inner loop is a nonlinear controller based on the feedback linearization theory for the PAM driving pressure control.

Position

Instance Image Retrieval by Learning Purely From Within the Dataset

no code implementations12 Aug 2022 Zhongyan Zhang, Lei Wang, Yang Wang, Luping Zhou, Jianjia Zhang, Peng Wang, Fang Chen

Although achieving promising results, this approach is restricted by two issues: 1) the domain gap between benchmark datasets and the dataset of a given retrieval task; 2) the required auxiliary dataset cannot be readily obtained.

Image Retrieval Retrieval +2

Grasping Core Rules of Time Series through Pure Models

no code implementations15 Aug 2022 Gedi Liu, Yifeng Jiang, Yi Ouyang, Keyang Zhong, Yang Wang

Time series underwent the transition from statistics to deep learning, as did many other machine learning fields.

Time Series Time Series Analysis

Towards Learning in Grey Spatiotemporal Systems: A Prophet to Non-consecutive Spatiotemporal Dynamics

no code implementations17 Aug 2022 Zhengyang Zhou, Yang Kuo, Wei Sun, Binwu Wang, Min Zhou, Yunan Zong, Yang Wang

To infer region-wise proximity under flexible factor-wise combinations and enable dynamic neighborhood aggregations, we further disentangle compounded influences of exogenous factors on region-wise proximity and learn to aggregate them.

Uncertainty Quantification

Switchable Online Knowledge Distillation

1 code implementation12 Sep 2022 Biao Qian, Yang Wang, Hongzhi Yin, Richang Hong, Meng Wang

Instead of focusing on the accuracy gap at test phase by the existing arts, the core idea of SwitOKD is to adaptively calibrate the gap at training phase, namely distillation gap, via a switching strategy between two modes -- expert mode (pause the teacher while keep the student learning) and learning mode (restart the teacher).

Knowledge Distillation

Delving Globally into Texture and Structure for Image Inpainting

1 code implementation17 Sep 2022 Haipeng Liu, Yang Wang, Meng Wang, Yong Rui

Our model is orthogonal to the fashionable arts, such as Convolutional Neural Networks (CNNs), Attention and Transformer model, from the perspective of texture and structure information for image inpainting.

Image Inpainting

Spatio-Temporal Contrastive Learning Enhanced GNNs for Session-based Recommendation

1 code implementation23 Sep 2022 Zhongwei Wan, Xin Liu, Benyou Wang, Jiezhong Qiu, Boyu Li, Ting Guo, Guangyong Chen, Yang Wang

The idea is to supplement the GNN-based main supervised recommendation task with the temporal representation via an auxiliary cross-view contrastive learning mechanism.

Collaborative Filtering Contrastive Learning +1

Hierarchical Few-Shot Object Detection: Problem, Benchmark and Method

1 code implementation8 Oct 2022 Lu Zhang, Yang Wang, Jiaogen Zhou, Chenbo Zhang, Yinglu Zhang, Jihong Guan, Yatao Bian, Shuigeng Zhou

In this paper, we propose and solve a new problem called hierarchical few-shot object detection (Hi-FSOD), which aims to detect objects with hierarchical categories in the FSOD paradigm.

Contrastive Learning Few-Shot Object Detection +2

Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts

1 code implementation8 Oct 2022 Tao Zhong, Zhixiang Chi, Li Gu, Yang Wang, Yuanhao Yu, Jin Tang

Most existing methods perform training on multiple source domains using a single model, and the same trained model is used on all unseen target domains.

Domain Generalization Knowledge Distillation +3

FAQS: Communication-efficient Federate DNN Architecture and Quantization Co-Search for personalized Hardware-aware Preferences

no code implementations16 Oct 2022 Hongjiang Chen, Yang Wang, Leibo Liu, Shaojun Wei, Shouyi Yin

Due to user privacy and regulatory restrictions, federate learning (FL) is proposed as a distributed learning framework for training deep neural networks (DNN) on decentralized data clients.

Neural Architecture Search Quantization

HQNAS: Auto CNN deployment framework for joint quantization and architecture search

no code implementations16 Oct 2022 Hongjiang Chen, Yang Wang, Leibo Liu, Shaojun Wei, Shouyi Yin

Deep learning applications are being transferred from the cloud to edge with the rapid development of embedded computing systems.

Neural Architecture Search Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.