Search Results for author: Wei Li

Found 357 papers, 130 papers with code

Locally Aligned Feature Transforms across Views

no code implementations CVPR 2013 Wei Li, Xiaogang Wang

In this paper, we propose a new approach for matching images observed in different camera views with complex cross-view transforms and apply it to person reidentification.

Clustering Metric Learning +1

Exploiting Structure in Weighted Model Counting Approaches to Probabilistic Inference

no code implementations16 Jan 2014 Wei Li, Pascal Poupart, Peter van Beek

Previous studies have demonstrated that encoding a Bayesian network into a SAT formula and then performing weighted model counting using a backtracking search algorithm can be an effective method for exact inference.

A Novel Face Recognition Method using Nearest Line Projection

no code implementations24 Feb 2014 Huanguo Zhang, Sha Lv, Wei Li, Xun Qu

Instead of projecting an image to its nearest image, we try to project it to its nearest line spanned by two different face images.

Face Recognition

Real-time Decolorization using Dominant Colors

no code implementations10 Apr 2014 Wei Hu, Wei Li, Fan Zhang, Qian Du

Decolorization is the process to convert a color image or video to its grayscale version, and it has received great attention in recent years.

DeepReID: Deep Filter Pairing Neural Network for Person Re-Identification

no code implementations CVPR 2014 Wei Li, Rui Zhao, Tong Xiao, Xiaogang Wang

In this paper, we propose a novel filter pairing neural network (FPNN) to jointly handle misalignment, photometric and geometric transforms, occlusions and background clutter.

Person Re-Identification

Macroblock Classification Method for Video Applications Involving Motions

no code implementations28 Feb 2015 Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou

We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.

Change Detection Classification +2

Analytic connectivity of $k$-uniform hypergraphs

no code implementations10 Jul 2015 An Chang, Joshua Cooper, Wei Li

In this paper, we study the analytic connectivity of a $k$-uniform hypergraph $H$, denoted by $\alpha(H)$.

Combinatorics 05C65 (Primary), 05C40, 05B05, 26D15 (Secondary)

Bearing fault diagnosis based on spectrum images of vibration signals

no code implementations8 Nov 2015 Wei Li, Mingquan Qiu, Zhencai Zhu, Bo Wu, Gongbo Zhou

Bearing fault diagnosis has been a challenge in the monitoring activities of rotating machinery, and it's receiving more and more attention.

General Classification

Turing learning: a metric-free approach to inferring behavior and its application to swarms

no code implementations15 Mar 2016 Wei Li, Melvin Gauci, Roderich Gross

We present two case studies with swarms of simulated robots and prove that the underlying behaviors cannot be inferred by a metric-based system identification method.

Towards an "In-the-Wild" Emotion Dataset Using a Game-based Framework

no code implementations10 Jul 2016 Wei Li, Farnaz Abtahi, Christina Tsangouri, Zhigang Zhu

To evaluate the dataset, we compared the performance of two deep learning models trained on both GaMo and CIFE.

Recycle deep features for better object detection

no code implementations18 Jul 2016 Wei Li, Matthias Breier, Dorit Merhof

Aiming at improving the performance of existing detection algorithms developed for different applications, we propose a region regression-based multi-stage class-agnostic detection pipeline, whereby the existing algorithms are employed for providing the initial detection proposals.

Object object-detection +2

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

3 code implementations21 Jul 2016 Peng Li, Wei Li, Zhengyan He, Xuguang Wang, Ying Cao, Jie zhou, Wei Xu

While question answering (QA) with neural network, i. e. neural QA, has achieved promising results in recent years, lacking of large scale real-word QA dataset is still a challenge for developing and evaluating neural QA system.

Answer Generation Question Answering

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016

1 code implementation2 Aug 2016 Yuanjun Xiong, Li-Min Wang, Zhe Wang, Bo-Wen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc van Gool, Xiaoou Tang

This paper presents the method that underlies our submission to the untrimmed video classification task of ActivityNet Challenge 2016.

General Classification Video Classification

A Recursive Framework for Expression Recognition: From Web Images to Deep Models to Game Dataset

no code implementations4 Aug 2016 Wei Li, Christina Tsangouri, Farnaz Abtahi, Zhigang Zhu

In order to increase the expression recognition accuracy, we also fine-tune the CNN model and thus obtain a better CNN facial expression recognition model.

Facial Expression Recognition Facial Expression Recognition (FER)

Tuning parameter calibration for $\ell_1$-regularized logistic regression

no code implementations1 Oct 2016 Wei Li, Johannes Lederer

Feature selection is a standard approach to understanding and modeling high-dimensional classification data, but the corresponding statistical methods hinge on tuning parameters that are difficult to calibrate.

feature selection General Classification +1

Learning and Fusing Multimodal Features from and for Multi-task Facial Computing

no code implementations14 Oct 2016 Wei Li, Zhigang Zhu

We have found that features trained for one task can be used for other related tasks.

Face Recognition

Recurrent Neural Network Language Model Adaptation Derived Document Vector

no code implementations1 Nov 2016 Wei Li, Brian Kan Wing Mak

In many natural language processing (NLP) tasks, a document is commonly modeled as a bag of words using the term frequency-inverse document frequency (TF-IDF) vector.

General Classification Genre classification +1

Abstractive News Summarization based on Event Semantic Link Network

no code implementations COLING 2016 Wei Li, Lei He, Hai Zhuge

This paper studies the abstractive multi-document summarization for event-oriented news texts through event information extraction and abstract representation.

Abstractive Text Summarization Document Summarization +2

Fast color transfer from multiple images

no code implementations28 Dec 2016 Asad Khan, Luo Jiang, Wei Li, Ligang Liu

Our algorithm is not restricted to one-to-one image color transfer and can make use of more than one target images to transfer the color in different regions in the source image.

EAC-Net: A Region-based Deep Enhancing and Cropping Approach for Facial Action Unit Detection

no code implementations9 Feb 2017 Wei Li, Farnaz Abtahi, Zhigang Zhu, Lijun Yin

For the enhancing layers, we designed an attention map based on facial landmark features and applied it to a pretrained neural network to conduct enhanced learning (The E-Net).

Action Unit Detection Facial Action Unit Detection

Prostate Cancer Diagnosis using Deep Learning with 3D Multiparametric MRI

no code implementations12 Mar 2017 Saifeng Liu, Huaixiu Zheng, Yesu Feng, Wei Li

A novel deep learning architecture (XmasNet) based on convolutional neural networks was developed for the classification of prostate cancer lesions, using the 3D multiparametric MRI data provided by the PROSTATEx challenge.

BIG-bench Machine Learning Data Augmentation +1

Derivation of Document Vectors from Adaptation of LSTM Language Model

no code implementations EACL 2017 Wei Li, Brian Mak

In many natural language processing (NLP) tasks, a document is commonly modeled as a bag of words using the term frequency-inverse document frequency (TF-IDF) vector.

General Classification Genre classification +1

Person Re-Identification by Deep Joint Learning of Multi-Loss Classification

no code implementations12 May 2017 Wei Li, Xiatian Zhu, Shaogang Gong

Existing person re-identification (re-id) methods rely mostly on either localised or global feature representation alone.

feature selection General Classification +1

WebVision Challenge: Visual Learning and Understanding With Web Data

no code implementations16 May 2017 Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Jesse Berent, Abhinav Gupta, Rahul Sukthankar, Luc van Gool

The 2017 WebVision challenge consists of two tracks, the image classification task on WebVision test set, and the transfer learning task on PASCAL VOC 2012 dataset.

Benchmarking Image Classification +1

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

1 code implementation29 Jun 2017 Yingying Jiang, Xiangyu Zhu, Xiaobing Wang, Shuli Yang, Wei Li, Hua Wang, Pei Fu, Zhenbo Luo

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images.

Region Proposal Scene Text Detection +1

SAR Target Recognition Using the Multi-aspect-aware Bidirectional LSTM Recurrent Neural Networks

no code implementations25 Jul 2017 Fan Zhang, Chen Hu, Qiang Yin, Wei Li, Heng-Chao Li, Wen Hong

However, there is a limitation in current deep learning based ATR solution that each learning process only handle one SAR image, namely learning the static scattering information, while missing the space-varying information.

Dimensionality Reduction

WebVision Database: Visual Learning and Understanding from Web Data

no code implementations9 Aug 2017 Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Luc van Gool

Our new WebVision database and relevant studies in this work would benefit the advance of learning state-of-the-art visual models with minimum supervision based on web data.

Domain Adaptation

Hierarchical Gated Recurrent Neural Tensor Network for Answer Triggering

no code implementations17 Sep 2017 Wei Li, Yunfang Wu

In this paper, we focus on the problem of answer triggering ad-dressed by Yang et al. (2015), which is a critical component for a real-world question answering system.

Question Answering

Attribute Recognition by Joint Recurrent Learning of Context and Correlation

no code implementations ICCV 2017 Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li

Recognising semantic pedestrian attributes in surveillance images is a challenging task for computer vision, particularly when the imaging quality is poor with complex background clutter and uncontrolled viewing conditions, and the number of labelled training data is small.

Attribute Multi-Label Image Classification +1

Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models

no code implementations26 Oct 2017 Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw

We develop streaming keyword spotting systems using a recurrent neural network transducer (RNN-T) model: an all-neural, end-to-end trained, sequence-to-sequence model which jointly learns acoustic and language model components.

General Classification Language Modelling +1

Distributed Representation for Traditional Chinese Medicine Herb via Deep Learning Models

no code implementations6 Nov 2017 Wei Li, Zheng Yang

Traditional Chinese Medicine (TCM) has accumulated a big amount of precious resource in the long history of development.

Language Modelling

Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method

3 code implementations17 Nov 2017 Xu Sun, Xuancheng Ren, Shuming Ma, Bingzhen Wei, Wei Li, Jingjing Xu, Houfeng Wang, Yi Zhang

Based on the sparsified gradients, we further simplify the model by eliminating the rows or columns that are seldom updated, which will reduce the computational cost both in the training and decoding, and potentially accelerate decoding in real-world applications.

Appearance-and-Relation Networks for Video Classification

1 code implementation CVPR 2018 Limin Wang, Wei Li, Wen Li, Luc van Gool

Specifically, SMART blocks decouple the spatiotemporal learning module into an appearance branch for spatial modeling and a relation branch for temporal modeling.

Action Classification Action Recognition +6

Generalizing GANs: A Turing Perspective

no code implementations NeurIPS 2017 Roderich Gross, Yue Gu, Wei Li, Melvin Gauci

In this paper we examine how these algorithms relate to the Turing test, and derive what - from a Turing perspective - can be considered their defining features.

Exploration on Generating Traditional Chinese Medicine Prescription from Symptoms with an End-to-End method

no code implementations27 Jan 2018 Wei Li, Zheng Yang, Xu sun

Traditional Chinese Medicine (TCM) is an influential form of medical treatment in China and surrounding areas.

Improving Word Vector with Prior Knowledge in Semantic Dictionary

no code implementations27 Jan 2018 Wei Li, Yunfang Wu, Xueqiang Lv

Using low dimensional vector space to represent words has been very effective in many NLP tasks.

NER

Harmonious Attention Network for Person Re-Identification

1 code implementation CVPR 2018 Wei Li, Xiatian Zhu, Shaogang Gong

Existing person re-identification (re-id) methods either assume the availability of well-aligned person bounding box images as model input or rely on constrained attention selection mechanisms to calibrate misaligned images.

Person Re-Identification

Automatic Translating between Ancient Chinese and Contemporary Chinese with Limited Aligned Corpora

no code implementations5 Mar 2018 Zhiyuan Zhang, Wei Li, Qi Su

In this paper, we propose to build an end-to-end neural model to automatically translate between ancient and contemporary Chinese.

Sentence Translation

SeqFace: Make full use of sequence information for face recognition

1 code implementation17 Mar 2018 Wei Hu, Yangyu Huang, Fan Zhang, Ruirui Li, Wei Li, Guodong Yuan

Deep convolutional neural networks (CNNs) have greatly improved the Face Recognition (FR) performance in recent years.

Face Recognition Face Verification

Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-Identification

no code implementations CVPR 2018 Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li

Most existing person re-identification (re-id) methods require supervised model learning from a separate large set of pairwise labelled training data for every single camera pair.

Attribute Unsupervised Domain Adaptation +1

Adapting Blockchain Technology for Scientific Computing

no code implementations23 Apr 2018 Wei Li

Blockchain stores information into a chain of blocks, whose integrity is usually guaranteed by Proof of Work (PoW).

Cryptography and Security Distributed, Parallel, and Cluster Computing

Generative Model for Heterogeneous Inference

no code implementations26 Apr 2018 Honggang Zhou, Yunchun Li, Hailong Yang, Wei Li, Jie Jia

However, the learning and inference of BN model are NP-hard thus the number of stochastic variables in BN is highly constrained.

Denoising Image Inpainting

TreeSegNet: Adaptive Tree CNNs for Subdecimeter Aerial Image Segmentation

no code implementations29 Apr 2018 Kai Yue, Lei Yang, Ruirui Li, Wei Hu, Fan Zhang, Wei Li

For the task of subdecimeter aerial imagery segmentation, fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing content and optical conditions.

Image Segmentation Segmentation +1

Adversarial adaptive 1-D convolutional neural networks for bearing fault diagnosis under varying working condition

no code implementations1 May 2018 Bo Zhang, Wei Li, Jie Hao, Xiao-Li Li, Meng Zhang

The layers between the source and target feature extractor are partially untied during the training stage to take both training efficiency and domain adaptation into consideration.

Domain Adaptation

Automatic Academic Paper Rating Based on Modularized Hierarchical Convolutional Neural Network

1 code implementation ACL 2018 Pengcheng Yang, Xu sun, Wei Li, Shuming Ma

As more and more academic papers are being submitted to conferences and journals, evaluating all these papers by professionals is time-consuming and can cause inequality due to the personal factors of the reviewers.

SGM: Sequence Generation Model for Multi-label Classification

1 code implementation COLING 2018 Pengcheng Yang, Xu sun, Wei Li, Shuming Ma, Wei Wu, Houfeng Wang

Further analysis of experimental results demonstrates that the proposed methods not only capture the correlations between labels, but also select the most informative words automatically when predicting different labels.

Classification General Classification +1

A Unified Approximation Framework for Compressing and Accelerating Deep Neural Networks

no code implementations26 Jul 2018 Yuzhe Ma, Ran Chen, Wei Li, Fanhua Shang, Wenjian Yu, Minsik Cho, Bei Yu

To address this issue, various approximation techniques have been investigated, which seek for a light weighted network with little performance degradation in exchange of smaller model size or faster inference.

General Classification Image Classification +1

NMT-based Cross-lingual Document Embeddings

no code implementations29 Jul 2018 Wei Li, Brian Mak

This paper further adds a distance constraint to the training objective function of NV so that the two embeddings of a parallel document are required to be as close as possible.

Cross-Lingual Document Classification Document Classification +5

Primal Meaning Recommendation via On-line Encyclopedia

no code implementations14 Aug 2018 Zhiyuan Zhang, Wei Li, Jingjing Xu, Xu sun

We define the primal meaning of an expression to be a frequently used sense of that expression from which its other frequent senses can be deduced.

Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions

no code implementations16 Aug 2018 Wei Li, Xuancheng Ren, Damai Dai, Yunfang Wu, Houfeng Wang, Xu sun

In the experiments, we take a real-world sememe knowledge base HowNet and the corresponding descriptions of the words in Baidu Wiki for training and evaluation.

Learning Universal Sentence Representations with Mean-Max Attention Autoencoder

1 code implementation EMNLP 2018 Minghua Zhang, Yunfang Wu, Weikang Li, Wei Li

In the encoding we propose a mean-max strategy that applies both mean and max pooling operations over the hidden vectors to capture diverse information of the input.

Sentence

Stochastic Answer Networks for SQuAD 2.0

5 code implementations24 Sep 2018 Xiaodong Liu, Wei Li, Yuwei Fang, Aerin Kim, Kevin Duh, Jianfeng Gao

This paper presents an extension of the Stochastic Answer Network (SAN), one of the state-of-the-art machine reading comprehension models, to be able to judge whether a question is unanswerable or not.

Machine Reading Comprehension Question Answering

Knowing Where to Look? Analysis on Attention of Visual Question Answering System

no code implementations9 Oct 2018 Wei Li, Zehuan Yuan, Xiangzhong Fang, Changhu Wang

Attention mechanisms have been widely used in Visual Question Answering (VQA) solutions due to their capacity to model deep cross-domain interactions.

Question Answering Visual Question Answering

The Effect of Explicit Structure Encoding of Deep Neural Networks for Symbolic Music Generation

1 code implementation20 Nov 2018 Ke Chen, Weilin Zhang, Shlomo Dubnov, Gus Xia, Wei Li

With recent breakthroughs in artificial neural networks, deep generative models have become one of the leading techniques for computational creativity.

Music Generation

Learning to discover and localize visual objects with open vocabulary

no code implementations25 Nov 2018 Keren Ye, Mingda Zhang, Wei Li, Danfeng Qin, Adriana Kovashka, Jesse Berent

To alleviate the cost of obtaining accurate bounding boxes for training today's state-of-the-art object detection models, recent weakly supervised detection work has proposed techniques to learn from image-level labels.

Object object-detection +1

Improved Expressivity Through Dendritic Neural Networks

no code implementations NeurIPS 2018 Xundong Wu, Xiangwen Liu, Wei Li, Qing Wu

In this study, we model such local nonlinearity of dendritic trees with our dendritic neural network (DENN) structure and apply this structure to typical machine learning tasks.

BIG-bench Machine Learning

DeepBillboard: Systematic Physical-World Testing of Autonomous Driving Systems

no code implementations27 Dec 2018 Husheng Zhou, Wei Li, Yuankun Zhu, Yuqun Zhang, Bei Yu, Lingming Zhang, Cong Liu

Furthermore, DeepBillboard is sufficiently robust and resilient for generating physical-world adversarial billboard tests for real-world driving under various weather conditions.

Autonomous Driving DNN Testing

AADS: Augmented Autonomous Driving Simulation using Data-driven Algorithms

1 code implementation23 Jan 2019 Wei Li, Chengwei Pan, Rong Zhang, Jiaping Ren, Yuexin Ma, Jin Fang, Feilong Yan, Qichuan Geng, Xinyu Huang, Huajun Gong, Weiwei Xu, Guoping Wang, Dinesh Manocha, Ruigang Yang

Our augmented approach combines the flexibility in a virtual environment (e. g., vehicle movements) with the richness of the real world to allow effective simulation of anywhere on earth.

Autonomous Driving

ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features

no code implementations23 Jan 2019 Xin Wu, Danfeng Hong, Jiaojiao Tian, Jocelyn Chanussot, Wei Li, Ran Tao

To this end, we propose a novel object detection framework, called optical remote sensing imagery detector (ORSIm detector), integrating diverse channel features extraction, feature learning, fast image pyramid matching, and boosting strategy.

Novel Object Detection object-detection +1

Multi-Interest Network with Dynamic Routing for Recommendation at Tmall

5 code implementations17 Apr 2019 Chao Li, Zhiyuan Liu, Mengmeng Wu, Yuchi Xu, Pipei Huang, Huan Zhao, Guoliang Kang, Qiwei Chen, Wei Li, Dik Lun Lee

Industrial recommender systems usually consist of the matching stage and the ranking stage, in order to handle the billion-scale of users and items.

Clustering Information Retrieval +1

High-Resolution Network for Photorealistic Style Transfer

4 code implementations25 Apr 2019 Ming Li, Chunyang Ye, Wei Li

Photorealistic style transfer aims to transfer the style of one image to another, but preserves the original structure and detail outline of the content image, which makes the content image still look like a real shot after the style transfer.

Image Generation Vocal Bursts Intensity Prediction

Spatial-Spectral Feature Extraction via Deep ConvLSTM Neural Networks for Hyperspectral Image Classification

no code implementations9 May 2019 Wen-Shuai Hu, Heng-Chao Li, Lei Pan, Wei Li, Ran Tao, Qian Du

Particularly, long short-term memory (LSTM), as a special deep learning structure, has shown great ability in modeling long-term dependencies in the time dimension of video or the spectral dimension of HSIs.

General Classification Hyperspectral Image Classification

Multiple Policy Value Monte Carlo Tree Search

2 code implementations31 May 2019 Li-Cheng Lan, Wei Li, Ting-Han Wei, I-Chen Wu

Many of the strongest game playing programs use a combination of Monte Carlo tree search (MCTS) and deep neural networks (DNN), where the DNNs are used as policy or value evaluators.

Coherent Comment Generation for Chinese Articles with a Graph-to-Sequence Model

1 code implementation4 Jun 2019 Wei Li, Jingjing Xu, Yancheng He, ShengLi Yan, Yunfang Wu, Xu sun

In this paper, we propose to generate comments with a graph-to-sequence model that models the input news as a topic interaction graph.

Comment Generation Graph-to-Sequence

Monotonic Infinite Lookback Attention for Simultaneous Machine Translation

no code implementations ACL 2019 Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel

Simultaneous machine translation begins to translate each source sentence before the source speaker is finished speaking, with applications to live and streaming scenarios.

Machine Translation NMT +2

Coherent Comments Generation for Chinese Articles with a Graph-to-Sequence Model

1 code implementation ACL 2019 Wei Li, Jingjing Xu, Yancheng He, ShengLi Yan, Yunfang Wu, Xu sun

In this paper, we propose to generate comments with a graph-to-sequence model that models the input news as a topic interaction graph.

Graph-to-Sequence

Deformable Tube Network for Action Detection in Videos

no code implementations3 Jul 2019 Wei Li, Zehuan Yuan, Dashan Guo, Lei Huang, Xiangzhong Fang, Changhu Wang

To perform action detection, we design a 3D convolution network with skip connections for tube classification and regression.

Action Detection Action Recognition

Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

1 code implementation ICCV 2019 Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li, Danfeng Qin, Jesse Berent

Learning to localize and name object instances is a fundamental problem in vision, but state-of-the-art approaches rely on expensive bounding box supervision.

Object object-detection +1

Toward better boundary preserved supervoxel segmentation for 3D point clouds

1 code implementation ISPRS Journal of Photogrammetry and Remote Sensing 2019 Yangbin Lin, Cheng Wang, Dawei Zhai, Wei Li, Jonathan Li

In this paper, we present a simple but effective supervoxel segmentation method for point clouds, which formalizes supervoxel segmentation as a subset selection problem.

Point Cloud Segmentation Segmentation

Neural Operator Search

no code implementations25 Sep 2019 Wei Li, Shaogang Gong, Xiatian Zhu

We address this limitation by additionally exploiting feature self-calibration operations, resulting in a heterogeneous search space.

Neural Architecture Search

A Three-dimensional Convolutional-Recurrent Network for Convective Storm Nowcasting

no code implementations1 Oct 2019 Wei Zhang, Wei Li, Lei Han

Very short-term convective storm forecasting, termed nowcasting, has long been an important issue and has attracted substantial interest.

Feature Engineering

Dynamic Upsampling of Smoke through Dictionary-based Learning

no code implementations21 Oct 2019 Kai Bai, Wei Li, Mathieu Desbrun, Xiaopei Liu

We propose a novel dictionary-based neural network which learns both a fast evaluation of sparse patch encoding and a dictionary of corresponding coarse and fine patches from a sequence of example simulations computed with any numerical solver.

Graphics

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

51 code implementations arXiv 2019 Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP).

Answer Generation Common Sense Reasoning +11

AutoRemover: Automatic Object Removal for Autonomous Driving Videos

1 code implementation28 Nov 2019 Rong Zhang, Wei Li, Peng Wang, Chenye Guan, Jin Fang, Yuhang Song, Jinhui Yu, Baoquan Chen, Weiwei Xu, Ruigang Yang

To deal with shadows, we build up an autonomous driving shadow dataset and design a deep neural network to detect shadows automatically.

Autonomous Driving Object +1

High Temporal Resolution Rainfall Runoff Modelling Using Long-Short-Term-Memory (LSTM) Networks

no code implementations7 Feb 2020 Wei Li, Amin Kiaghadi, Clint N. Dawson

Accurate and efficient models for rainfall runoff (RR) simulations are crucial for flood risk management.

Management

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

no code implementations28 Mar 2020 Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao

Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art conventional models with respect to both quality, i. e., word error rate (WER), and latency, i. e., the time the hypothesis is finalized after the user stops speaking.

Sentence

Investigation of Singing Voice Separation for Singing Voice Detection in Polyphonic Music

no code implementations8 Apr 2020 Yifu Sun, xulong Zhang, Yi Yu, Xi Chen, Wei Li

Singing voice detection (SVD), to recognize vocal parts in the song, is an essential task in music information retrieval (MIR).

Information Retrieval Melody Extraction +2

Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks

2 code implementations14 Apr 2020 Shu Liu, Wei Li, Yunfang Wu, Qi Su, Xu sun

Target-Based Sentiment Analysis aims to detect the opinion aspects (aspect extraction) and the sentiment polarities (sentiment detection) towards them.

Aspect Extraction Sentiment Analysis

Query-Variant Advertisement Text Generation with Association Knowledge

1 code implementation14 Apr 2020 Siyu Duan, Wei Li, Cai Jing, Yancheng He, Yunfang Wu, Xu sun

In this paper, we propose the query-variant advertisement text generation task that aims to generate candidate advertisement texts for different web search queries with various needs based on queries and item keywords.

Text Generation

Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech

no code implementations19 May 2020 Wenjie Li, Benlai Tang, Xiang Yin, Yushi Zhao, Wei Li, Kang Wang, Hao Huang, Yuxuan Wang, Zejun Ma

Accent conversion (AC) transforms a non-native speaker's accent into a native accent while maintaining the speaker's voice timbre.

Leveraging Graph to Improve Abstractive Multi-Document Summarization

2 code implementations ACL 2020 Wei Li, Xinyan Xiao, Jiachen Liu, Hua Wu, Haifeng Wang, Junping Du

Graphs that capture relations between textual units have great benefits for detecting salient information from multiple documents and generating overall coherent summaries.

Document Summarization Multi-Document Summarization

BiERU: Bidirectional Emotional Recurrent Unit for Conversational Sentiment Analysis

1 code implementation31 May 2020 Wei Li, Wei Shao, Shaoxiong Ji, Erik Cambria

Sentiment analysis in conversations has gained increasing attention in recent years for the growing amount of applications it can serve, e. g., sentiment analysis, recommender systems, and human-robot interaction.

Emotion Recognition in Conversation Sentence +1

Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

no code implementations CVPR 2020 Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, Hongdng Li, Ruigang Yang

Second, we propose a new framework for real-world DSR, which consists of four modules : 1) An iterative residual learning module with deep supervision to learn effective high-frequency components of depth maps in a coarse-to-fine manner; 2) A channel attention strategy to enhance channels with abundant high-frequency components; 3) A multi-stage fusion module to effectively re-exploit the results in the coarse-to-fine process; and 4) A depth refinement module to improve the depth map by TGV regularization and input loss.

Benchmarking Depth Map Super-Resolution

Modeling the Stock Relation with Graph Network for Overnight Stock Movement Prediction

no code implementations26 Jun 2020 Wei Li, Ruihan Bao, Keiko Harimoto, Deli Chen, Jingjing Xu and Qi Su

Further analysis shows that the introduction of the graph enables our model to predict the movement of stocks that are not directly associated with news as well as the whole market, which is not available in most previous methods.

Relation

PerMO: Perceiving More at Once from a Single Image for Autonomous Driving

no code implementations16 Jul 2020 Feixiang Lu, Zongdai Liu, Xibin Song, Dingfu Zhou, Wei Li, Hui Miao, Miao Liao, Liangjun Zhang, Bin Zhou, Ruigang Yang, Dinesh Manocha

We present a novel approach to detect, segment, and reconstruct complete textured 3D models of vehicles from a single image for autonomous driving.

3D Reconstruction Autonomous Driving +3

Vehicle Detection of Multi-source Remote Sensing Data Using Active Fine-tuning Network

no code implementations16 Jul 2020 Xin Wu, Wei Li, Danfeng Hong, Jiaojiao Tian, Ran Tao, Qian Du

In addition, the generalization ability of Ms-AFt in dense remote sensing scenes is further verified on stereo aerial imagery of a large camping site.

Transfer Learning

DVI: Depth Guided Video Inpainting for Autonomous Driving

2 code implementations ECCV 2020 Miao Liao, Feixiang Lu, Dingfu Zhou, Sibo Zhang, Wei Li, Ruigang Yang

To get clear street-view and photo-realistic simulation in autonomous driving, we present an automatic video inpainting algorithm that can remove traffic agents from videos and synthesize missing regions with the guidance of depth/point cloud.

Autonomous Driving Image Inpainting +2

TEAM: We Need More Powerful Adversarial Examples for DNNs

1 code implementation31 Jul 2020 Ya-guan Qian, Ximin Zhang, Bin Wang, Wei Li, Zhaoquan Gu, Haijiang Wang, Wassim Swaileh

In this paper, we propose a novel method (TEAM, Taylor Expansion-Based Adversarial Methods) to generate more powerful adversarial examples than previous methods.

Exploring the Impacts from Datasets to Monocular Depth Estimation (MDE) Models with MineNavi

no code implementations19 Aug 2020 Xiangtong Wang, Binbin Liang, Menglong Yang, Wei Li

Current computer vision tasks based on deep learning require a huge amount of data with annotations for model training or testing, especially in some dense estimation tasks, such as optical flow segmentation and depth estimation.

Monocular Depth Estimation Optical Flow Estimation +2

Transformer based Multilingual document Embedding model

no code implementations19 Aug 2020 Wei Li, Brian Mak

One of the current state-of-the-art multilingual document embedding model LASER is based on the bidirectional LSTM neural machine translation model.

Document Embedding Machine Translation +3

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition

no code implementations30 Aug 2020 Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He

The 2nd-pass model plays a key role in the quality improvement of the end-to-end model to surpass the conventional model.

speech-recognition Speech Recognition

Adversarial Privacy Preserving Graph Embedding against Inference Attack

1 code implementation30 Aug 2020 Kaiyang Li, Guangchun Luo, Yang Ye, Wei Li, Shihao Ji, Zhipeng Cai

In this paper, we propose Adversarial Privacy Graph Embedding (APGE), a graph adversarial training framework that integrates the disentangling and purging mechanisms to remove users' private information from learned node representations.

Graph Embedding Inference Attack +4

Temporal optical neurons for serial deep learning

no code implementations4 Sep 2020 Zhixing Lin, Shuqian Sun, Jose Azana, Wei Li, Ninghua Zhu, Ming Li

This concept represents a novel one-dimensional realization of artificial neural networks, enabling an efficient application of optical deep learning methods to the analysis and processing of serial data signals, while offering a new overall perspective for the temporal signal processing.

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

1 code implementation9 Sep 2020 Quan Wang, Ignacio Lopez Moreno, Mert Saglam, Kevin Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein

We introduce VoiceFilter-Lite, a single-channel source separation model that runs on the device to preserve only the speech signals from a target user, as part of a streaming speech recognition system.

speech-recognition Speech Recognition

Hidden Markov Models for Pipeline Damage Detection Using Piezoelectric Transducers

no code implementations30 Sep 2020 Mingchi Zhang, Xuemin Chen, Wei Li

However, the negative pressure waves or guided stress waves may not be easily detected with environmental interference, e. g., the oil and gas pipelines in offshore environment.

SMOT: Single-Shot Multi Object Tracking

1 code implementation30 Oct 2020 Wei Li, Yuanjun Xiong, Shuo Yang, Siqi Deng, Wei Xia

We combine this scheme with SSD detectors by proposing a novel tracking anchor assignment module.

Multi-Object Tracking Object

Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks

1 code implementation CVPR 2021 Xiaoxiao Long, Lingjie Liu, Wei Li, Christian Theobalt, Wenping Wang

We present a novel method for multi-view depth estimation from a single video, which is a critical task in various applications, such as perception, reconstruction and robot navigation.

Depth Estimation Robot Navigation

Decoupled Self Attention for Accurate One Stage Object Detection

1 code implementation14 Dec 2020 Kehe WU, Zuge Chen, Qi Ma, Xiaoliang Zhang, Wei Li

When DSA module and object confidence task are applied in RetinaNet together, the detection performances based on ResNet50 and ResNet101 can be increased by 1. 0% AP and 1. 4% AP respectively.

Object object-detection +2

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth

no code implementations18 Dec 2020 Xingxing Zuo, Nathaniel Merrill, Wei Li, Yong liu, Marc Pollefeys, Guoquan Huang

In this work, we present a lightweight, tightly-coupled deep depth network and visual-inertial odometry (VIO) system, which can provide accurate state estimates and dense depth maps of the immediate surroundings.

Depth Estimation Depth Prediction +1

Gradient Descent Averaging and Primal-dual Averaging for Strongly Convex Optimization

no code implementations29 Dec 2020 Wei Tao, Wei Li, Zhisong Pan, Qing Tao

In order to remove this factor, we first develop gradient descent averaging (GDA), which is a general projection-based dual averaging algorithm in the strongly convex setting.

Detection Booster Training: A detection booster training method for improving the accuracy of classifiers.

no code implementations1 Jan 2021 Ali Ghobadzadeh, Deepak Sridhar, Juwei Lu, Wei Li

In this paper, we probe this direction by deriving a relationship between the estimation of unknown parameters of the probability density function (pdf) of input data and classification accuracy.

Face Recognition Image Classification

Rethinking Graph Neural Networks for Graph Coloring

no code implementations1 Jan 2021 Wei Li, Ruxuan Li, Yuzhe ma, Siu On Chan, Bei Yu

To characterize the power of GNNs for the graph coloring problem, we first formalize the discrimination power of GNNs as the capability to assign nodes different colors.

A Simple Feature Augmentation for Domain Generalization

no code implementations ICCV 2021 Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, Timothy M. Hospedales

The topical domain generalization (DG) problem asks trained models to perform well on an unseen target domain with different data statistics from the source training domains.

Data Augmentation Domain Generalization

Day-ahead electricity price prediction applying hybrid models of LSTM-based deep learning methods and feature selection algorithms under consideration of market coupling

no code implementations13 Jan 2021 Wei Li, Denis Mike Becker

In the context of trade liberalisation and market harmonisation in the European markets, accurate price forecasting becomes difficult for electricity market participants to obtain because electricity forecasting requires the consideration of features from ever-growing coupling markets.

Feature Importance feature selection +2

Correlated interaction effects in three-dimensional semi-Dirac semimetal

no code implementations14 Jan 2021 Jing-Rong Wang, Wei Li, Chang-Jin Zhang

The physical essences of the quantum critical points are determined by analyzing the susceptibility exponents for all of the source terms in particle-hole and particle-particle channels.

Strongly Correlated Electrons Materials Science

Phase Diagram of Triangular Lattice Quantum Ising Model under External Field

no code implementations27 Jan 2021 Yuan Da Liao, Han Li, Zheng Yan, Hao-Tian Wei, Wei Li, Yang Qi, Zi Yang Meng

Quantum Ising model on a triangular lattice hosts a finite temperature Berezinskii-Kosterlitz-Thouless (BKT) phase with emergent U(1) symmetry, and it will transit into an up-up-down (UUD) phase with $C_3$ symmetry breaking upon an infinitesimal external field along the longitudinal direction, but the overall phase diagram spanned by the axes of external field and temperature remains opaque due to the lack of systematic invesitgations with controlled methodologies.

Strongly Correlated Electrons Statistical Mechanics

LEAD: LiDAR Extender for Autonomous Driving

no code implementations16 Feb 2021 Jianing Zhang, Wei Li, Honggang Gou, Lu Fang, Ruigang Yang

In this paper, we propose LEAD, i. e., LiDAR Extender for Autonomous Driving, to extend the MEMS LiDAR by coupled image w. r. t both FoV and range.

Autonomous Driving Depth Completion +1

Significant Inverse Magnetocaloric Effect induced by Quantum Criticality

no code implementations17 Feb 2021 Tao Liu, Xin-Yang Liu, Yuan Gao, Hai Jin, Jun He, Xian-Lei Sheng, Wentao Jin, Ziyu Chen, Wei Li

Strong fluctuations in the low-$T$ quantum critical regime can give rise to a large thermal entropy change and thus significant cooling effect when approaching the QCP.

Strongly Correlated Electrons

Do Transformer Modifications Transfer Across Implementations and Applications?

1 code implementation EMNLP 2021 Sharan Narang, Hyung Won Chung, Yi Tay, William Fedus, Thibault Fevry, Michael Matena, Karishma Malkan, Noah Fiedel, Noam Shazeer, Zhenzhong Lan, Yanqi Zhou, Wei Li, Nan Ding, Jake Marcus, Adam Roberts, Colin Raffel

The research community has proposed copious modifications to the Transformer architecture since it was introduced over three years ago, relatively few of which have seen widespread adoption.

A Universal Urbach Rule for Disordered Organic Semiconductors

no code implementations25 Feb 2021 Christina Kaiser, Oskar J. Sandberg, Nasim Zarrabi, Wei Li, Paul Meredith, Ardalan Armin

A simple model is presented that explains absorption line-shapes of disordered systems, and we also provide a strategy to determine the excitonic disorder energy.

Optics Disordered Systems and Neural Networks

Modelling brain based on canonical ensemble with functional MRI: A thermodynamic exploration on neural system

no code implementations26 Feb 2021 Chenxi Zhou, Bin Yang, Wenliang Fan, Wei Li

(3) The detection of neural disease was demonstrated to be benefit from thermodynamic model, implying the immense potential of thermodynamics in auxiliary diagnosis.

Path-specific Underwater Acoustic Channel Tracking and its Application in Passive Time Reversal Mirror

no code implementations1 Mar 2021 Xiuqing Li, Wei Li, Xinlin Yi, Qihang Huang, Yuhang Wang, Chenzhe Ye

With the path-specific parameters obtained by the proposed channel tracking, the proposed PTRM can not only match the time dispersion as conventional PTRM, but also the doubly-spread channel, since the path-specific delay and Doppler scaler factor can help to match the channel in both time and frequency domain.

Dynamic Underwater Acoustic Channel Tracking for Correlated Rapidly Time-varying Channels

no code implementations1 Mar 2021 Qihang Huang, Wei Li, Weicheng Zhan, Yuhang Wang, Rongrong Guo

A model based on the underwater acoustic channel's correlation can be used as the state-space model in the Kalman filter to improve the underwater acoustic channel tracking compared that without a model.

Transferable Semantic Augmentation for Domain Adaptation

1 code implementation CVPR 2021 Shuang Li, Mixue Xie, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Wei Li

To remedy this, we propose a Transferable Semantic Augmentation (TSA) approach to enhance the classifier adaptation ability through implicitly generating source features towards target semantics.

Domain Adaptation

Dynamic Domain Adaptation for Efficient Inference

1 code implementation CVPR 2021 Shuang Li, Jinming Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li

Domain adaptation (DA) enables knowledge transfer from a labeled source domain to an unlabeled target domain by reducing the cross-domain distribution discrepancy.

Domain Generalization Transfer Learning

Discover the Hidden Attack Path in Multi-domain Cyberspace Based on Reinforcement Learning

no code implementations15 Apr 2021 Lei Zhang, Wei Bai, Wei Li, Shiming Xia, Qibin Zheng

To achieve these results, we pose discovering attack paths as a Reinforcement Learning (RL) problem and train an agent to discover multi-domain cyberspace attack paths.

Reinforcement Learning (RL)

ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion

1 code implementation19 Apr 2021 Yaqi Xia, Yan Xia, Wei Li, Rui Song, Kailang Cao, Uwe Stilla

We tackle the problem of object completion from point clouds and propose a novel point cloud completion network employing an Asymmetrical Siamese Feature Matching strategy, termed as ASFM-Net.

Point Cloud Completion

Temporal Knowledge Graph Reasoning Based on Evolutional Representation Learning

1 code implementation21 Apr 2021 Zixuan Li, Xiaolong Jin, Wei Li, Saiping Guan, Jiafeng Guo, HuaWei Shen, Yuanzhuo Wang, Xueqi Cheng

To capture these properties effectively and efficiently, we propose a novel Recurrent Evolution network based on Graph Convolution Network (GCN), called RE-GCN, which learns the evolutional representations of entities and relations at each timestamp by modeling the KG sequence recurrently.

Representation Learning

Multi-scale PIIFD for Registration of Multi-source Remote Sensing Images

1 code implementation26 Apr 2021 Chenzhong Gao, Wei Li

This paper aims at providing multi-source remote sensing images registered in geometric space for image fusion.

Image Registration

MUSE: Multi-faceted Attention for Signed Network Embedding

no code implementations29 Apr 2021 Dengcheng Yan, Youwen Zhang, Wei Li, Yiwen Zhang

Signed network embedding is an approach to learn low-dimensional representations of nodes in signed networks with both positive and negative links, which facilitates downstream tasks such as link prediction with general data mining frameworks.

Link Prediction Network Embedding

BASS: Boosting Abstractive Summarization with Unified Semantic Graph

no code implementations ACL 2021 Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Ziqiang Cao, Sujian Li, Hua Wu, Haifeng Wang

Abstractive summarization for long-document or multi-document remains challenging for the Seq2Seq architecture, as Seq2Seq is not good at analyzing long-distance relations in text.

Abstractive Text Summarization Document Summarization +2

DiaKG: an Annotated Diabetes Dataset for Medical Knowledge Graph Construction

1 code implementation31 May 2021 Dejie Chang, Mosha Chen, Chaozhen Liu, LiPing Liu, Dongdong Li, Wei Li, Fei Kong, Bangchang Liu, Xiaobin Luo, Ji Qi, Qiao Jin, Bin Xu

In order to accelerate the research for domain-specific knowledge graphs in the medical domain, we introduce DiaKG, a high-quality Chinese dataset for Diabetes knowledge graph, which contains 22, 050 entities and 6, 890 relations in total.

graph construction Knowledge Graphs +4

Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs

no code implementations ACL 2021 Zixuan Li, Xiaolong Jin, Saiping Guan, Wei Li, Jiafeng Guo, Yuanzhuo Wang, Xueqi Cheng

Specifically, at the clue searching stage, CluSTeR learns a beam search policy via reinforcement learning (RL) to induce multiple clues from historical facts.

Knowledge Graphs Reinforcement Learning (RL)

Generative Adversarial Networks: A Survey Towards Private and Secure Applications

no code implementations7 Jun 2021 Zhipeng Cai, Zuobin Xiong, Honghui Xu, Peng Wang, Wei Li, Yi Pan

Generative Adversarial Networks (GAN) have promoted a variety of applications in computer vision, natural language processing, etc.

MST: Masked Self-Supervised Transformer for Visual Representation

no code implementations NeurIPS 2021 Zhaowen Li, Zhiyang Chen, Fan Yang, Wei Li, Yousong Zhu, Chaoyang Zhao, Rui Deng, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang

More importantly, the masked tokens together with the remaining tokens are further recovered by a global image decoder, which preserves the spatial information of the image and is more friendly to the downstream dense prediction tasks.

Language Modelling Masked Language Modeling +3

Toward Less Hidden Cost of Code Completion with Acceptance and Ranking Models

no code implementations26 Jun 2021 Jingxuan Li, Rui Huang, Wei Li, Kai Yao, Weiguo Tan

We integrate this ranking scheme with two frequency models and a GPT-2 styled language model, along with the acceptance model to yield 27. 80% and 37. 64% increase in TOP1 and TOP5 accuracy, respectively.

Code Completion Language Modelling

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

no code implementations6 Jul 2021 Wei Li, Yuanjun Xiong, Shuo Yang, Mingze Xu, Yongxin Wang, Wei Xia

We design a new instance-to-track matching objective to learn appearance embedding that compares a candidate detection to the embedding of the tracks persisted in the tracker.

Multiple Object Tracking Object +1

Video 3D Sampling for Self-supervised Representation Learning

no code implementations8 Jul 2021 Wei Li, Dezhao Luo, Bo Fang, Yu Zhou, Weiping Wang

As a result, we can leverage the spatial information (the size of objects), temporal information (the direction and magnitude of motions) as our learning target.

Action Recognition Representation Learning +2

Residual Attention Based Network for Automatic Classification of Phonation Modes

no code implementations18 Jul 2021 Xiaoheng Sun, Yiliang Jiang, Wei Li

Phonation mode is an essential characteristic of singing style as well as an important expression of performance.

Classification Feature Engineering +3

A Data-driven Explainable Case-based Reasoning Approach for Financial Risk Detection

no code implementations19 Jul 2021 Wei Li, Florentina Paraschiv, Georgios Sermpinis

The rapid development of artificial intelligence methods contributes to their wide applications for forecasting various financial risks in recent years.

Wavelet-Based Network For High Dynamic Range Imaging

1 code implementation3 Aug 2021 Tianhong Dai, Wei Li, Xilei Cao, Jianzhuang Liu, Xu Jia, Ales Leonardis, Youliang Yan, Shanxin Yuan

The frequency-guided upsampling module reconstructs details from multiple frequency-specific components with rich details.

Optical Flow Estimation Vocal Bursts Intensity Prediction

Semantic Concentration for Domain Adaptation

1 code implementation ICCV 2021 Shuang Li, Mixue Xie, Fangrui Lv, Chi Harold Liu, Jian Liang, Chen Qin, Wei Li

To tackle this issue, we propose Semantic Concentration for Domain Adaptation (SCDA), which encourages the model to concentrate on the most principal features via the pair-wise adversarial alignment of prediction distributions.

Domain Adaptation Transfer Learning

Multi defect detection and analysis of electron microscopy images with deep learning

no code implementations19 Aug 2021 Mingren Shen, Guanzhao Li, Dongxia Wu, YuHan Liu, Jacob Greaves, Wei Hao, Nathaniel J. Krakauer, Leah Krudy, Jacob Perez, Varun Sreenivasan, Bryan Sanchez, Oigimer Torres, Wei Li, Kevin Field, Dane Morgan

Electron microscopy is widely used to explore defects in crystal structures, but human detecting of defects is often time-consuming, error-prone, and unreliable, and is not scalable to large numbers of images or real-time analysis.

Defect Detection

ASAT: Adaptively Scaled Adversarial Training in Time Series

no code implementations20 Aug 2021 Zhiyuan Zhang, Wei Li, Ruihan Bao, Keiko Harimoto, Yunfang Wu, Xu sun

Besides the security concerns of potential adversarial examples, adversarial training can also improve the generalization ability of neural networks, train robust neural networks, and provide interpretability for neural networks.

Adversarial Robustness Time Series +1

Musical Tempo Estimation Using a Multi-scale Network

no code implementations3 Sep 2021 Xiaoheng Sun, Qiqi He, Yongwei Gao, Wei Li

Recently, some single-step systems without onset detection have shown their effectiveness in automatic musical tempo estimation.

Tied & Reduced RNN-T Decoder

no code implementations15 Sep 2021 Rami Botros, Tara N. Sainath, Robert David, Emmanuel Guzman, Wei Li, Yanzhang He

Previous works on the Recurrent Neural Network-Transducer (RNN-T) models have shown that, under some conditions, it is possible to simplify its prediction network with little or no loss in recognition accuracy (arXiv:2003. 07705 [eess. AS], [2], arXiv:2012. 06749 [cs. CL]).

Language Modelling

CENN: Conservative energy method based on neural networks with subdomains for solving variational problems involving heterogeneous and complex geometries

1 code implementation25 Sep 2021 Yizheng Wang, Jia Sun, Wei Li, Zaiyuan Lu, Yinghua Liu

The advantage of the proposed method is higher efficiency, more accurate, and less hyperparameters than the strong form PINN with subdomains.

A General Gaussian Heatmap Label Assignment for Arbitrary-Oriented Object Detection

1 code implementation27 Sep 2021 Zhanchao Huang, Wei Li, Xiang-Gen Xia, Ran Tao

Specifically, an anchor-free object-adaptation label assignment (OLA) strategy is presented to define the positive candidates based on two-dimensional (2-D) oriented Gaussian heatmaps, which reflect the shape and direction features of arbitrary-oriented objects.

Ranked #31 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images +1

Referring Self-supervised Learning on 3D Point Cloud

no code implementations29 Sep 2021 Runnan Chen, Xinge Zhu, Nenglun Chen, Dawei Wang, Wei Li, Yuexin Ma, Ruigang Yang, Wenping Wang

In this paper, we study a new problem named Referring Self-supervised Learning (RSL) on 3D scene understanding: Given the 3D synthetic models with labels and the unlabeled 3D real scene scans, our goal is to distinguish the identical semantic objects on an unseen scene according to the referring synthetic 3D models.

Scene Understanding Self-Supervised Learning

MC-LCR: Multi-modal contrastive classification by locally correlated representations for effective face forgery detection

no code implementations7 Oct 2021 Gaojian Wang, Qian Jiang, Xin Jin, Wei Li, Xiaohui Cui

Moreover, we make a key observation that subtle forgery artifacts can be further exposed in the patch-wise phase and amplitude spectrum and exhibit different clues.

Deep Learning for UAV-based Object Detection and Tracking: A Survey

no code implementations25 Oct 2021 Xin Wu, Wei Li, Danfeng Hong, Ran Tao, Qian Du

Owing to effective and flexible data acquisition, unmanned aerial vehicle (UAV) has recently become a hotspot across the fields of computer vision (CV) and remote sensing (RS).

Management Object +3

SgSum: Transforming Multi-document Summarization into Sub-graph Selection

1 code implementation25 Oct 2021 Moye Chen, Wei Li, Jiachen Liu, Xinyan Xiao, Hua Wu, Haifeng Wang

Comparing with traditional methods, our method has two main advantages: (1) the relations between sentences are captured by modeling both the graph structure of the whole document set and the candidate sub-graphs; (2) directly outputs an integrate summary in the form of sub-graph which is more informative and coherent.

Document Summarization Multi-Document Summarization +1

FastFlow: Unsupervised Anomaly Detection and Localization via 2D Normalizing Flows

5 code implementations15 Nov 2021 Jiawei Yu, Ye Zheng, Xiang Wang, Wei Li, Yushuang Wu, Rui Zhao, Liwei Wu

However, current methods can not effectively map image features to a tractable base distribution and ignore the relationship between local and global features which are important to identify anomalies.

Unsupervised Anomaly Detection Weakly Supervised Defect Detection

Variational Autoencoder with CCA for Audio-Visual Cross-Modal Retrieval

no code implementations5 Dec 2021 Jiwei Zhang, Yi Yu, Suhua Tang, Jianming Wu, Wei Li

On the one hand, audio encoder and visual encoder separately encode audio data and visual data into two different latent spaces.

Cross-Modal Retrieval Information Retrieval +1

Transfer learning of phase transitions in percolation and directed percolation

no code implementations31 Dec 2021 Jianmin Shen, Feiyi Liu, Shiyang Chen, Dian Xu, Xiangna Chen, Shengfeng Deng, Wei Li, Gabor Papp, Chunbin Yang

With the DANN, only a small fraction of input configurations (2d images) needs to be labeled, which is automatically chosen, in order to capture the critical point.

Transfer Learning

PPDL: Predicate Probability Distribution Based Loss for Unbiased Scene Graph Generation

no code implementations CVPR 2022 Wei Li, Haiwei Zhang, Qijie Bai, Guoqing Zhao, Ning Jiang, Xiaojie Yuan

However, the application value of SG on downstream tasks is severely limited by the predicate classification bias, which is caused by long-tailed data and presented as semantic bias of predicted relation predicates.

Graph Generation Predicate Classification +1

Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark

1 code implementation CVPR 2022 Jiaxu Miao, Xiaohan Wang, Yu Wu, Wei Li, Xu Zhang, Yunchao Wei, Yi Yang

In contrast, our large-scale VIdeo Panoptic Segmentation in the Wild (VIPSeg) dataset provides 3, 536 videos and 84, 750 frames with pixel-level panoptic annotations, covering a wide range of real-world scenarios and categories.

Segmentation Video Panoptic Segmentation

Evolutionary Action Selection for Gradient-based Policy Learning

no code implementations12 Jan 2022 Yan Ma, Tianxing Liu, Bingsheng Wei, Yi Liu, Kang Xu, Wei Li

Evolutionary Algorithms (EAs) and Deep Reinforcement Learning (DRL) have recently been integrated to take the advantage of the both methods for better exploration and exploitation. The evolutionary part in these hybrid methods maintains a population of policy networks. However, existing methods focus on optimizing the parameters of policy network, which is usually high-dimensional and tricky for EA. In this paper, we shift the target of evolution from high-dimensional parameter space to low-dimensional action space. We propose Evolutionary Action Selection-Twin Delayed Deep Deterministic Policy Gradient (EAS-TD3), a novel hybrid method of EA and DRL. In EAS, we focus on optimizing the action chosen by the policy network and attempt to obtain high-quality actions to promote policy learning through an evolutionary algorithm.

Continuous Control Evolutionary Algorithms

DDU-Net: Dual-Decoder-U-Net for Road Extraction Using High-Resolution Remote Sensing Images

no code implementations18 Jan 2022 Ying Wang, Yuexing Peng, Xinran Liu, Wei Li, George C. Alexandropoulos, Junchuan Yu, Daqing Ge, Wei Xiang

Extracting roads from high-resolution remote sensing images (HRSIs) is vital in a wide variety of applications, such as autonomous driving, path planning, and road navigation.

Autonomous Driving

A Joint Morphological Profiles and Patch Tensor Change Detection for Hyperspectral Imagery

no code implementations20 Jan 2022 Zengfu Hou, Wei Li

Multi-temporal hyperspectral images can be used to detect changed information, which has gradually attracted researchers' attention.

Change Detection Image Reconstruction

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

1 code implementation2 Feb 2022 Ke Chen, Shuai Yu, Cheng-i Wang, Wei Li, Taylor Berg-Kirkpatrick, Shlomo Dubnov

In this paper, we propose TONet, a plug-and-play model that improves both tone and octave perceptions by leveraging a novel input representation and a novel network architecture.

Information Retrieval Melody Extraction +2

DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus Detection

1 code implementation13 Feb 2022 Qiqi He, Xiaoheng Sun, Yi Yu, Wei Li

Chorus detection is a challenging problem in musical signal processing as the chorus often repeats more than once in popular songs, usually with rich instruments and complex rhythm forms.

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

1 code implementation CVPR 2022 Yuchao Wang, Haochen Wang, Yujun Shen, Jingjing Fei, Wei Li, Guoqiang Jin, Liwei Wu, Rui Zhao, Xinyi Le

A common practice is to select the highly confident predictions as the pseudo ground-truth, but it leads to a problem that most pixels may be left unused due to their unreliability.

Semi-Supervised Semantic Segmentation

A Lightweight and Detector-free 3D Single Object Tracker on Point Clouds

1 code implementation8 Mar 2022 Yan Xia, Qiangqiang Wu, Wei Li, Antoni B. Chan, Uwe Stilla

Recent works on 3D single object tracking treat the task as a target-specific 3D detection task, where an off-the-shelf 3D detector is commonly employed for the tracking.

3D Single Object Tracking motion prediction +1

Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods

no code implementations10 Mar 2022 Wei Li, Wenhao Wu, Moye Chen, Jiachen Liu, Xinyan Xiao, Hua Wu

In this survey, we provide a systematic overview of the research progress on the faithfulness problem of NLG, including problem analysis, evaluation metrics and optimization methods.

Abstractive Text Summarization Data-to-Text Generation +2

Efficient universal shuffle attack for visual object tracking

no code implementations14 Mar 2022 Siao Liu, Zhaoyu Chen, Wei Li, Jiwei Zhu, Jiafeng Wang, Wenqiang Zhang, Zhongxue Gan

Recently, adversarial attacks have been applied in visual object tracking to deceive deep trackers by injecting imperceptible perturbations into video frames.

Adversarial Attack Computational Efficiency +2

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training

no code implementations CVPR 2022 Zhaowen Li, Yousong Zhu, Fan Yang, Wei Li, Chaoyang Zhao, Yingying Chen, Zhiyang Chen, Jiahao Xie, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang

Furthermore, our method can also exploit single-centric-object dataset such as ImageNet and outperforms BYOL by 2. 5% with the same pre-training epochs in linear probing, and surpass current self-supervised object detection methods on COCO dataset, demonstrating its universality and potential.

Image Classification Object +4

Complex Evolutional Pattern Learning for Temporal Knowledge Graph Reasoning

1 code implementation ACL 2022 Zixuan Li, Saiping Guan, Xiaolong Jin, Weihua Peng, Yajuan Lyu, Yong Zhu, Long Bai, Wei Li, Jiafeng Guo, Xueqi Cheng

Furthermore, these models are all trained offline, which cannot well adapt to the changes of evolutional patterns from then on.

UNIMO-2: End-to-End Unified Vision-Language Grounded Learning

1 code implementation Findings (ACL) 2022 Wei Li, Can Gao, guocheng niu, Xinyan Xiao, Hao liu, Jiachen Liu, Hua Wu, Haifeng Wang

In particular, we propose to conduct grounded learning on both images and texts via a sharing grounded space, which helps bridge unaligned images and texts, and align the visual and textual semantic spaces on different types of corpora.

Open-Vocabulary DETR with Conditional Matching

1 code implementation22 Mar 2022 Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy

To this end, we propose a novel open-vocabulary detector based on DETR -- hence the name OV-DETR -- which, once trained, can detect any object given its class name or an exemplar image.

Language Modelling object-detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.