Search Results for author: Wei Liu

Found 368 papers, 140 papers with code

Lexicon-Based Graph Convolutional Network for Chinese Word Segmentation

no code implementations Findings (EMNLP) 2021 Kaiyu Huang, Hao Yu, Junpeng Liu, Wei Liu, Jingxiang Cao, Degen Huang

Experimental results on five benchmarks and four cross-domain datasets show the lexicon-based graph convolutional network successfully captures the information of candidate words and helps to improve performance on the benchmarks (Bakeoff-2005 and CTB6) and the cross-domain datasets (SIGHAN-2010).

Chinese Word Segmentation

PointPWC-Net: Cost Volume on Point Clouds for (Self-)Supervised Scene Flow Estimation

1 code implementation ECCV 2020 Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin

We propose a novel end-to-end deep scene flow model, called PointPWC-Net, that directly processes 3D point cloud scenes with large motions in a coarse-to-fine fashion.

Scene Flow Estimation

QuickGraph: A Rapid Annotation Tool for Knowledge Graph Extraction from Technical Text

1 code implementation ACL 2022 Tyler Bikaun, Michael Stewart, Wei Liu

Acquiring high-quality annotated corpora for complex multi-task information extraction (MT-IE) is an arduous and costly process for human-annotators.

LexiClean: An annotation tool for rapid multi-task lexical normalisation

1 code implementation EMNLP (ACL) 2021 Tyler Bikaun, Tim French, Melinda Hodkiewicz, Michael Stewart, Wei Liu

LexiClean’s main contribution is support for simultaneous in situ token-level modification and annotation that can be rapidly applied corpus wide.

CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention

1 code implementation13 Mar 2023 Wenxiao Wang, Wei Chen, Qibo Qiu, Long Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wei Liu

On the one hand, CEL blends each token with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +3

Frauds Bargain Attack: Generating Adversarial Text Samples via Word Manipulation Process

1 code implementation1 Mar 2023 Mingze Ni, Zhensu Sun, Wei Liu

Recent studies on adversarial examples expose vulnerabilities of natural language processing (NLP) models.

Adversarial Text

Q-Cogni: An Integrated Causal Reinforcement Learning Framework

no code implementations26 Feb 2023 Cris Cunha, Wei Liu, Tim French, Ajmal Mian

We present Q-Cogni, an algorithmically integrated causal reinforcement learning framework that redesigns Q-Learning with an autonomous causal structure discovery method to improve the learning process with causal inference.

Causal Inference Decision Making +3

ChatAug: Leveraging ChatGPT for Text Data Augmentation

no code implementations25 Feb 2023 Haixing Dai, Zhengliang Liu, Wenxiong Liao, Xiaoke Huang, Zihao Wu, Lin Zhao, Wei Liu, Ninghao Liu, Sheng Li, Dajiang Zhu, Hongmin Cai, Quanzheng Li, Dinggang Shen, Tianming Liu, Xiang Li

A natural and widely-used strategy to mitigate such challenges is to perform data augmentation on the training data to better capture the data invariance and increase the sample size.

Data Augmentation Few-Shot Learning +2

Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring

no code implementations21 Feb 2023 Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

Recent studies on pronunciation scoring have explored the effect of introducing phone embeddings as reference pronunciation, but mostly in an implicit manner, i. e., addition or concatenation of reference phone embedding and actual pronunciation of the target phone as the phone-level pronunciation quality representation.

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

no code implementations20 Feb 2023 Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

A typical fluency scoring system generally relies on an automatic speech recognition (ASR) system to obtain time stamps in input speech for either the subsequent calculation of fluency-related features or directly modeling speech fluency with an end-to-end approach.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

1 code implementation5 Feb 2023 Zichuan Lin, Xiapeng Wu, Mingfei Sun, Deheng Ye, Qiang Fu, Wei Yang, Wei Liu

Recent success in Deep Reinforcement Learning (DRL) methods has shown that policy optimization with respect to an off-policy distribution via importance sampling is effective for sample reuse.

CFFT-GAN: Cross-domain Feature Fusion Transformer for Exemplar-based Image Translation

no code implementations3 Feb 2023 Tianxiang Ma, Bingchuan Li, Wei Liu, Miao Hua, Jing Dong, Tieniu Tan

In this paper, we propose a more general learning approach by considering two domain features as a whole and learning both inter-domain correspondence and intra-domain potential information interactions.


HDFormer: High-order Directed Transformer for 3D Human Pose Estimation

1 code implementation3 Feb 2023 Hanyuan Chen, Jun-Yan He, Wangmeng Xiang, Wei Liu, Zhi-Qi Cheng, Hanbing Liu, Bin Luo, Yifeng Geng, Xuansong Xie

Unfortunately, this causes 3D pose estimation to fail in difficult cases such as $\textit{joints overlapping}$, and pose $\textit{fast-changing}$, as pair-wise relations cannot exploit fine-grained human body priors in pose estimation.

3D Human Pose Estimation 3D Pose Estimation

ReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing

no code implementations31 Jan 2023 Bingchuan Li, Tianxiang Ma, Peng Zhang, Miao Hua, Wei Liu, Qian He, Zili Yi

Specifically, in Phase I, a W-space-oriented StyleGAN inversion network is trained and used to perform image inversion and editing, which assures the editability but sacrifices reconstruction quality.

Image Generation

Planning and Tracking Control of Full Drive-by-Wire Electric Vehicles in Unstructured Scenario

no code implementations7 Jan 2023 Guoying Chen, Min Hua, Wei Liu, Jinhai Wang, Shunhui Song, Changsheng Liu

Full drive-by-wire electric vehicles (FDWEV) with X-by-wire technology can achieve independent driving, braking, and steering of each wheel, providing a good application platform for autonomous driving technology.

Autonomous Driving

Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation

no code implementations9 Dec 2022 Jie Jiang, Zhimin Li, Jiangfeng Xiong, Rongwei Quan, Qinglin Lu, Wei Liu

Therefore, TAVS is distinguished from previous temporal segmentation datasets due to its multi-modal information, holistic view of categories, and hierarchical granularities.

Multi-Label Classification Scene Segmentation +2

Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning

no code implementations24 Nov 2022 Yatai Ji, RongCheng Tu, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

Cross-modal alignment is essential for vision-language pre-training (VLP) models to learn the correct corresponding information across different modalities.

Language Modelling Masked Language Modeling +5

PointCA: Evaluating the Robustness of 3D Point Cloud Completion Models Against Adversarial Examples

no code implementations22 Nov 2022 Shengshan Hu, Junwei Zhang, Wei Liu, Junhui Hou, Minghui Li, Leo Yu Zhang, Hai Jin, Lichao Sun

In addition, existing attack approaches towards point cloud classifiers cannot be applied to the completion models due to different output forms and attack purposes.

Adversarial Attack Point Cloud Classification +2

Curriculum-based Asymmetric Multi-task Reinforcement Learning

1 code implementation7 Nov 2022 Hanchi Huang, Deheng Ye, Li Shen, Wei Liu

To mitigate the negative influence of customizing the one-off training order in curriculum-based AMTL, CAMRL switches its training mode between parallel single-task RL and asymmetric multi-task RL (MTRL), according to an indicator regarding the training time, the overall performance, and the performance gap among tasks.

Multi-Task Learning reinforcement-learning +1

Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

no code implementations31 Oct 2022 Jingyu Li, Wei Liu, Zhaoyang Zhang, Jiong Wang, Tan Lee

The experiments on VoxCeleb indicate that quantization is effective for compressing SV models, where the model size can be reduced by multiple times with no noticeable performance decline.

Model Compression Quantization +1

Online LiDAR-Camera Extrinsic Parameters Self-checking

1 code implementation19 Oct 2022 Pengjin Wei, Guohang Yan, Yikang Li, Kun Fang, Jie Yang, Wei Liu

This calibration task is multi-modal, where the rich color and texture information captured by the camera and the accurate three-dimensional spatial information from the LiDAR is incredibly significant for downstream tasks.

Autonomous Driving

Neural Extended Kalman Filters for Learning and Predicting Dynamics of Structural Systems

no code implementations9 Oct 2022 Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

Typically, conventional variational inference models are parameterized by neural networks independent of the latent dynamics models.

Variational Inference

FR: Folded Rationalization with a Unified Encoder

1 code implementation17 Sep 2022 Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Chao Yue, Yuankai Zhang

Conventional works generally employ a two-phase model in which a generator selects the most important pieces, followed by a predictor that makes predictions based on the selected pieces.

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

no code implementations21 Aug 2022 Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang

We further propose a parallel design that integrates the convolutional network with a powerful self-attention mechanism to provide complementary clues between the attention path and convolutional path.

Scene Text Detection

CircuitNet: An Open-Source Dataset for Machine Learning Applications in Electronic Design Automation (EDA)

no code implementations1 Aug 2022 Zhuomin Chai, Yuxiang Zhao, Yibo Lin, Wei Liu, Runsheng Wang, Ru Huang

The electronic design automation (EDA) community has been actively exploring machine learning (ML) for very large-scale integrated computer-aided design (VLSI CAD).

BIG-bench Machine Learning

Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

1 code implementation27 Jul 2022 Jiawang Bai, Kuofeng Gao, Dihong Gong, Shu-Tao Xia, Zhifeng Li, Wei Liu

The security of deep neural networks (DNNs) has attracted increasing attention due to their widespread use in various applications.

Towards Efficient Adversarial Training on Vision Transformers

no code implementations21 Jul 2022 Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

Vision Transformer (ViT), as a powerful alternative to Convolutional Neural Network (CNN), has received much attention.

Neural modal ordinary differential equations: Integrating physics-based modeling with neural ordinary differential equations for modeling high-dimensional monitored structures

1 code implementation16 Jul 2022 Zhilu Lai, Wei Liu, Xudong Jian, Kiran Bacsa, Limin Sun, Eleni Chatzi

In the scope of physics-informed machine learning, this paper proposes a framework -- termed Neural Modal ODEs -- to integrate physics-based modeling with deep learning for modeling the dynamics of monitored and high-dimensional engineered systems.

Physics-informed machine learning

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

no code implementations15 Jul 2022 Mengyin Liu, Chao Zhu, Hongyu Gao, Weibo Gu, Hongfa Wang, Wei Liu, Xu-Cheng Yin

2) Secondly, a text-guided information range minimization method is proposed to adaptively encode descriptive parts of each modality into an identical space with a powerful pretrained linguistic model.

Attribute Value Extraction

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

1 code implementation4 Jul 2022 Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, RongCheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

In this report, we propose a video-language pretraining (VLP) based solution \cite{kevin2022egovlp} for four Ego4D challenge tasks, including Natural Language Query (NLQ), Moment Query (MQ), Object State Change Classification (OSCC), and PNR Localization (PNR).

Language Modelling Object State Change Classification

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

no code implementations21 Jun 2022 Wei Liu, Rui Wang, Tao Zhang, Kaiwen Li, Wenhua Li, Hisao Ishibuchi

Multi-objective orienteering problems (MO-OPs) are classical multi-objective routing problems and have received a lot of attention in the past decades.

Problem Decomposition reinforcement-learning +1

Towards Generalizable Person Re-identification with a Bi-stream Generative Model

no code implementations19 Jun 2022 Xin Xu, Wei Liu, Zheng Wang, Ruiming Hu, Qi Tian

Guided by original pedestrian images, one stream is employed to learn a camera-invariant global feature for the CC problem via filtering cross-camera interference factors.

Domain Generalization Generalizable Person Re-identification

EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification

no code implementations15 Jun 2022 Jingyu Li, Wei Liu, Tan Lee

This paper proposes a domain transfer network, named EDITnet, to alleviate the language-mismatch problem on speaker embeddings without requiring speaker labels.

Self-Supervised Learning Speaker Verification +1

Unsupervised Knowledge Adaptation for Passenger Demand Forecasting

no code implementations8 Jun 2022 Can Li, Lei Bai, Wei Liu, Lina Yao, S Travis Waller

These multimodal forecasting models can improve accuracy but be less practical when different parts of multimodal datasets are owned by different institutions who cannot directly share data among them.

Egocentric Video-Language Pretraining

1 code implementation3 Jun 2022 Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, RongCheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Video-Language Pretraining (VLP), which aims to learn transferable representation to advance a wide range of video-text downstream tasks, has recently received increasing attention.

Action Recognition Contrastive Learning +9

Efficient-Adam: Communication-Efficient Distributed Adam with Complexity Analysis

no code implementations28 May 2022 Congliang Chen, Li Shen, Wei Liu, Zhi-Quan Luo

Distributed adaptive stochastic gradient methods have been widely used for large-scale nonconvex optimization, such as training deep learning models.


An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech

no code implementations25 May 2022 Wei Liu, Jingyu Li, Tan Lee

The performance of child speech recognition is generally less satisfactory compared to adult speech due to limited amount of training data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Demand Response Method Considering Multiple Types of Flexible Loads in Industrial Parks

no code implementations24 May 2022 Jia Cui, Mingze Gao, Xiaoming Zhou, Yang Li, Wei Liu, Jiazheng Tian, XiMing Zhang

With the rapid development of the energy internet, the proportion of flexible loads in smart grid is getting much higher than before.

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

1 code implementation CVPR 2022 Li Yang, Yan Xu, Chunfeng Yuan, Wei Liu, Bing Li, Weiming Hu

They base the visual grounding on the features from pre-generated proposals or anchors, and fuse these features with the text embeddings to locate the target mentioned by the text.

object-detection Object Detection +1

Deep Reinforcement Learning for Orienteering Problems Based on Decomposition

no code implementations25 Apr 2022 Wei Liu, Tao Zhang, Rui Wang, Kaiwen Li, Wenhua Li, Kang Yang

A dynamic pointer network (DYPN) is introduced as the TSP solver, which takes city locations as inputs and immediately outputs a permutation of nodes.

reinforcement-learning reinforcement Learning +1

ChildPredictor: A Child Face Prediction Framework with Disentangled Learning

1 code implementation21 Apr 2022 Yuzhi Zhao, Lai-Man Po, Xuehui Wang, Qiong Yan, Wei Shen, Yujia Zhang, Wei Liu, Chun-Kit Wong, Chiu-Sing Pang, Weifeng Ou, Wing-Yin Yu, Buhua Liu

On this basis, we formulate predictions as a mapping from parents' genetic factors to children's genetic factors, and disentangle them from external and variety factors.

Age-Invariant Face Recognition Image-to-Image Translation +1

XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation

no code implementations CVPR 2022 Wei Liu, Fangyue Liu, Fei Ding, Qian He, Zili Yi

The cross-modality encoder is pre-trained in a self-supervised manner to allow effective capture of cross- and intra-modality correlations, which facilitates the content-style disentanglement and modeling style representations of all scales (stroke-level, component-level and character-level).

Disentanglement Font Generation

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level Representations

no code implementations7 Apr 2022 Jie Jiang, Shaobo Min, Weijie Kong, Dihong Gong, Hongfa Wang, Zhifeng Li, Wei Liu

With multi-level representations for video and text, hierarchical contrastive learning is designed to explore fine-grained cross-modal relationships, i. e., frame-word, clip-phrase, and video-sentence, which enables HCMI to achieve a comprehensive semantic comparison between video and text modalities.

 Ranked #1 on Video Retrieval on MSVD (using extra training data)

Contrastive Learning Denoising +3

Improving Vision Transformers by Revisiting High-frequency Components

1 code implementation3 Apr 2022 Jiawang Bai, Li Yuan, Shu-Tao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu

Inspired by this finding, we first investigate the effects of existing techniques for improving ViT models from a new frequency perspective, and find that the success of some techniques (e. g., RandAugment) can be attributed to the better usage of the high-frequency components.

Domain Generalization Image Classification

Masked Autoencoders for Point Cloud Self-supervised Learning

1 code implementation13 Mar 2022 Yatian Pang, Wenxiao Wang, Francis E. H. Tay, Wei Liu, Yonghong Tian, Li Yuan

Then, a standard Transformer based autoencoder, with an asymmetric design and a shifting mask tokens operation, learns high-level latent features from unmasked point patches, aiming to reconstruct the masked point patches.

3D Part Segmentation Few-Shot 3D Point Cloud Classification +2

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

2 code implementations1 Mar 2022 ZiHao Wang, Wei Liu, Qian He, Xinglong Wu, Zili Yi

Once trained, the transformer can generate coherent image tokens based on the text embedding extracted from the text encoder of CLIP upon an input text.

Text-to-Image Generation

Holistic Attention-Fusion Adversarial Network for Single Image Defogging

no code implementations19 Feb 2022 Wei Liu, Cheng Chen, Rui Jiang, Tao Lu, Zixiang Xiong

To address these issues, we develop a novel generative adversarial network, called holistic attention-fusion adversarial network (HAAN), for single image defogging.

An Unsupervised Attentive-Adversarial Learning Framework for Single Image Deraining

no code implementations19 Feb 2022 Wei Liu, Rui Jiang, Cheng Chen, Tao Lu, Zixiang Xiong

Moreover, to improve the transformation ability of C2R, we design a rain-fog feature decoupling and reorganization network (RFDR) by embedding a rainy image degradation model and a mixed discriminator to preserve richer texture details.

Single Image Deraining

Exploring Structural Sparsity in Neural Image Compression

no code implementations9 Feb 2022 Shanzhi Yin, Chao Li, Wen Tan, Youneng Bao, Yongsheng Liang, Wei Liu

Neural image compression have reached or out-performed traditional methods (such as JPEG, BPG, WebP).

Image Compression

Constrained Variational Policy Optimization for Safe Reinforcement Learning

1 code implementation28 Jan 2022 Zuxin Liu, Zhepeng Cen, Vladislav Isenbaev, Wei Liu, Zhiwei Steven Wu, Bo Li, Ding Zhao

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications.

reinforcement-learning reinforcement Learning +1

DynaMixer: A Vision MLP Architecture with Dynamic Mixing

2 code implementations28 Jan 2022 Ziyu Wang, Wenhao Jiang, Yiming Zhu, Li Yuan, Yibing Song, Wei Liu

In contrast with vision transformers and CNNs, the success of MLP-like models shows that simple information fusion operations among tokens and channels can yield a good representation power for deep recognition models.

Image Classification

Spatio-Temporal Graph Representation Learning for Fraudster Group Detection

no code implementations7 Jan 2022 Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Then we use an RNN on the spatial relations to predict the spatio-temporal relations of reviewers in the group.

Graph Representation Learning

RFNet: Unsupervised Network for Mutually Reinforcing Multi-Modal Image Registration and Fusion

no code implementations CVPR 2022 Han Xu, Jiayi Ma, Jiteng Yuan, Zhuliang Le, Wei Liu

Specifically, for image registration, we solve the bottlenecks of defining registration metrics applicable for multi-modal images and facilitating the network convergence.

Image Registration

Coherent Point Drift Revisited for Non-Rigid Shape Matching and Registration

no code implementations CVPR 2022 Aoxiang Fan, Jiayi Ma, Xin Tian, Xiaoguang Mei, Wei Liu

In this paper, we explore a new type of extrinsic method to directly align two geometric shapes with point-to-point correspondences in ambient space by recovering a deformation, which allows more continuous and smooth maps to be obtained.

DGL-GAN: Discriminator Guided Learning for GAN Compression

no code implementations13 Dec 2021 Yuesong Tian, Li Shen, DaCheng Tao, Zhifeng Li, Wei Liu

Generative Adversarial Networks (GANs) with high computation costs, e. g., BigGAN and StyleGAN2, have achieved remarkable results in synthesizing high resolution and diverse images with high fidelity from random noises.

Triangle Attack: A Query-efficient Decision-based Adversarial Attack

1 code implementation13 Dec 2021 Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu

Decision-based attack poses a severe threat to real-world applications since it regards the target model as a black box and only accesses the hard prediction label.

Adversarial Attack Dimensionality Reduction

CO2Sum:Contrastive Learning for Factual-Consistent Abstractive Summarization

no code implementations2 Dec 2021 Wei Liu, Huanqin Wu, Wenjing Mu, Zhen Li, Tao Chen, Dan Nie

We propose CO2Sum (Contrastive for Consistency), a contrastive learning scheme that can be easily applied on sequence-to-sequence models for factual-consistent abstractive summarization, proving that the model can be fact-aware without modifying the architecture.

Abstractive Text Summarization Contrastive Learning

MC-Blur: A Comprehensive Benchmark for Image Deblurring

2 code implementations1 Dec 2021 Kaihao Zhang, Tao Wang, Wenhan Luo, Boheng Chen, Wenqi Ren, Bjorn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang

Blur artifacts can seriously degrade the visual quality of images, and numerous deblurring methods have been proposed for specific scenarios.

Benchmarking Deblurring +1

Neural Routing by Memory

no code implementations NeurIPS 2021 Kaipeng Zhang, Zhenqiang Li, Zhifeng Li, Wei Liu, Yoichi Sato

However, they use the same procedure sequence for all inputs, regardless of the intermediate features. This paper proffers a simple yet effective idea of constructing parallel procedures and assigning similar intermediate features to the same specialized procedures in a divide-and-conquer fashion.

Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement

1 code implementation NeurIPS 2021 Aming Wu, Suqi Zhao, Cheng Deng, Wei Liu

To alleviate the impact of few samples, enhancing the generalization and discrimination abilities of detectors on new objects plays an important role.

Dictionary Learning Few-Shot Object Detection +1

PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction

1 code implementation30 Nov 2021 Qingyu Wang, Baojian Ma, Wei Liu, Mingzhao Lou, Mingchuan Zhou, Huanyu Jiang, Yibin Ying

In this paper, we aim to address the issue between datasets and models and propose a large scale stereo dataset with high accuracy disparity ground truth named PlantStereo.

Camera Calibration Image Registration +1

Social Fraud Detection Review: Methods, Challenges and Analysis

no code implementations10 Nov 2021 Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

Many studies proposed approaches based on user behaviors and review text to address the challenges of fraud detection.

Decision Making Fraud Detection

Meter-Range Wireless Motor Drive for Pipeline Transportation

no code implementations26 Oct 2021 Wei Liu, K. T. Chau, Hui Wang, Tengbo Yang

This paper proposes and implements a meter-range wireless motor drive (WMD) system for promising applications of underground pipeline transportations or in-pipe robots.

Physics-guided Deep Markov Models for Learning Nonlinear Dynamical Systems with Uncertainty

1 code implementation16 Oct 2021 Wei Liu, Zhilu Lai, Kiran Bacsa, Eleni Chatzi

To address this, we bridge physics-based state space models with Deep Markov Models, thus delivering a hybrid modeling framework for unsupervised learning and identification of nonlinear dynamical systems.

Variational Inference

Rethinking the Spatial Route Prior in Vision-and-Language Navigation

no code implementations12 Oct 2021 Xinzhe Zhou, Wei Liu, Yadong Mu

In a most information-rich case of knowing environment maps and admitting shortest-path prior, we observe that given an origin-destination node pair, the internal route can be uniquely determined.

Navigate Vision and Language Navigation

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

1 code implementation22 Sep 2021 Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi

To address these limitations, we design a Dynamic Style Manipulation Network (DyStyle) whose structure and parameters vary by input samples, to perform nonlinear and adaptive manipulation of latent codes for flexible and precise attribute control.

Contrastive Learning

Utterance-level neural confidence measure for end-to-end children speech recognition

no code implementations16 Sep 2021 Wei Liu, Tan Lee

The investigation is focused on evaluating and comparing the efficacies of predictor features that are derived from different internal and external modules of the E2E system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Semantic-Preserving Adversarial Text Attacks

2 code implementations23 Aug 2021 Xinghao Yang, Weifeng Liu, James Bailey, DaCheng Tao, Wei Liu

In this paper, we propose a Bigram and Unigram based adaptive Semantic Preservation Optimization (BU-SPO) method to examine the vulnerability of deep models.

Adversarial Text Semantic Similarity +3

End2End Occluded Face Recognition by Masking Corrupted Features

1 code implementation21 Aug 2021 Haibo Qiu, Dihong Gong, Zhifeng Li, Wei Liu, DaCheng Tao

However, the state-of-the-art general face recognition models do not generalize well to occluded face images, which are exactly the common cases in real-world scenarios.

Face Recognition

SynFace: Face Recognition with Synthetic Data

1 code implementation ICCV 2021 Haibo Qiu, Baosheng Yu, Dihong Gong, Zhifeng Li, Wei Liu, DaCheng Tao

We then analyze the underlying causes behind the performance gap, e. g., the poor intra-class variations and the domain gap between synthetic and real face images.

Face Generation Face Recognition

Structure-Aware Feature Generation for Zero-Shot Learning

no code implementations16 Aug 2021 Lianbo Zhang, Shaoli Huang, Xinchao Wang, Wei Liu, DaCheng Tao

In this paper, we introduce a novel structure-aware feature generation scheme, termed as SA-GAN, to explicitly account for the topological structure in learning both the latent space and the generative networks.

Zero-Shot Learning

CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention

2 code implementations ICLR 2022 Wenxiao Wang, Lu Yao, Long Chen, Binbin Lin, Deng Cai, Xiaofei He, Wei Liu

On the one hand, CEL blends each embedding with multiple patches of different scales, providing the self-attention module itself with cross-scale features.

Image Classification Instance Segmentation +3

Decentralized Federated Learning: Balancing Communication and Computing Costs

no code implementations26 Jul 2021 Wei Liu, Li Chen, Wenyi Zhang

The performance of decentralized SGD is jointly influenced by inter-node communications and local updates.

Federated Learning

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

1 code implementation15 Jul 2021 Wei Liu, Pingping Zhang, Yinjie Lei, Xiaolin Huang, Jie Yang, Michael Ng

The effectiveness and superior performance of our approach are validated through comprehensive experiments in a range of applications.

image smoothing

Controlled Caption Generation for Images Through Adversarial Attacks

no code implementations7 Jul 2021 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Mubarak Shah, Ajmal Mian

In contrast, we propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN such that the resulting deep features of the input image enable a controlled incorrect caption generation through the recurrent network.

Image Captioning Language Modelling

Robust Pose Transfer with Dynamic Details using Neural Video Rendering

no code implementations27 Jun 2021 Yang-tian Sun, Hao-Zhi Huang, Xuan Wang, Yu-Kun Lai, Wei Liu, Lin Gao

Moreover, we introduce a concise temporal loss in the training stage to suppress the detail flickering that is made more visible due to high-quality dynamic details generated by our method.

Neural Rendering Pose Transfer +1

Stock Market Analysis with Text Data: A Review

no code implementations23 Jun 2021 Kamaladdin Fataliyev, Aneesh Chivukula, Mukesh Prasad, Wei Liu

Then, we cover the analysis techniques and create a taxonomy of the main stock market forecast models.

Simple Distillation Baselines for Improving Small Self-supervised Models

1 code implementation21 Jun 2021 Jindong Gu, Wei Liu, Yonglong Tian

While large self-supervised models have rivalled the performance of their supervised counterparts, small models still struggle.

Subjective Bias in Abstractive Summarization

1 code implementation18 Jun 2021 Lei LI, Wei Liu, Marina Litvak, Natalia Vanetik, Jiacheng Pei, Yinan Liu, Siya Qi

Due to the subjectivity of the summarization, it is a good practice to have more than one gold summary for each training document.

Abstractive Text Summarization

Structure-Regularized Attention for Deformable Object Representation

1 code implementation12 Jun 2021 Shenao Zhang, Li Shen, Zhifeng Li, Wei Liu

Capturing contextual dependencies has proven useful to improve the representational power of deep neural networks.

UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction

1 code implementation Findings (ACL) 2021 Huanqin Wu, Wei Liu, Lei LI, Dan Nie, Tao Chen, Feng Zhang, Di Wang

Keyphrase Prediction (KP) task aims at predicting several keyphrases that can summarize the main idea of the given document.

Attacking Adversarial Attacks as A Defense

no code implementations9 Jun 2021 Boxi Wu, Heng Pan, Li Shen, Jindong Gu, Shuai Zhao, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu

In this work, we find that the adversarial attacks can also be vulnerable to small perturbations.

A Generative Node-attribute Network Model for Detecting Generalized Structure

no code implementations5 Jun 2021 Wei Liu, Zhenhai Chang, Caiyan Jia, Yimei Zheng

Exploring meaningful structural regularities embedded in networks is a key to understanding and analyzing the structure and function of a network.

Image-to-Video Generation via 3D Facial Dynamics

no code implementations31 May 2021 Xiaoguang Tu, Yingtian Zou, Jian Zhao, Wenjie Ai, Jian Dong, Yuan YAO, Zhikang Wang, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

Video generation from a single face image is an interesting problem and usually tackled by utilizing Generative Adversarial Networks (GANs) to integrate information from the input face image and a sequence of sparse facial landmarks.

Image to Video Generation Video Prediction

Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter

1 code implementation ACL 2021 Wei Liu, Xiyan Fu, Yue Zhang, Wenming Xiao

Lexicon information and pre-trained models, such as BERT, have been combined to explore Chinese sequence labelling tasks due to their respective strengths.

named-entity-recognition Named Entity Recognition +2

Joint Face Image Restoration and Frontalization for Recognition

no code implementations12 May 2021 Xiaoguang Tu, Jian Zhao, Qiankun Liu, Wenjie Ai, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

First, MDFR is a well-designed encoder-decoder architecture which extracts feature representation from an input face image with arbitrary low-quality factors and restores it to a high-quality counterpart.

Face Recognition Image Restoration

Poisoning MorphNet for Clean-Label Backdoor Attack to Point Clouds

no code implementations11 May 2021 Guiyu Tian, Wenhao Jiang, Wei Liu, Yadong Mu

To this end, MorphNet jointly optimizes two objectives for sample-adaptive poisoning: a reconstruction loss that preserves the visual similarity between benign / poisoned point clouds, and a classification loss that enforces a modern recognition model of point clouds tends to mis-classify the poisoned sample to a pre-specified target category.

Backdoor Attack Denoising

Beyond Monocular Deraining: Parallel Stereo Deraining Network Via Semantic Prior

no code implementations9 May 2021 Kaihao Zhang, Wenhan Luo, Yanjiang Yu, Wenqi Ren, Fang Zhao, Changsheng Li, Lin Ma, Wei Liu, Hongdong Li

We first use a coarse deraining network to reduce the rain streaks on the input images, and then adopt a pre-trained semantic segmentation network to extract semantic features from the coarse derained image.

Benchmarking Rain Removal +1

Differentiable Neural Architecture Search for Extremely Lightweight Image Super-Resolution

1 code implementation9 May 2021 Han Huang, Li Shen, Chaoyang He, Weisheng Dong, Wei Liu

Specifically, the cell-level search space is designed based on an information distillation mechanism, focusing on the combinations of lightweight operations and aiming to build a more lightweight and accurate SR structure.

Image Super-Resolution Neural Architecture Search +1

Causal factors discovering from Chinese construction accident cases

no code implementations4 May 2021 Zi-jian Ni, Wei Liu

In China, most of the cases are from accident investigation reports.

Physical world assistive signals for deep neural network classifiers -- neither defense nor attack

no code implementations3 May 2021 Camilo Pestana, Wei Liu, David Glance, Robyn Owens, Ajmal Mian

We discuss how we can exploit these insights to re-think, or avoid, some patterns that might contribute to, or degrade, the detectability of objects in the real-world.

Improved Matrix Gaussian Mechanism for Differential Privacy

no code implementations30 Apr 2021 Jungang Yang, Liyao Xiang, Weiting Li, Wei Liu, Xinbing Wang

The wide deployment of machine learning in recent years gives rise to a great demand for large-scale and high-dimensional data, for which the privacy raises serious concern.

ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows

1 code implementation CVPR 2021 Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo

The forward inference projects input images into deep features, while the backward inference remaps deep features back to input images in a lossless and unbiased way.

Style Transfer

COSINE: A Web Server for Clonal and Subclonal Structure Inference and Evolution in Cancer Genomics

no code implementations28 Mar 2021 Xiguo Yuan, Yuan Zhao, Yang Guo, Linmei Ge, Wei Liu, Shiyu Wen, Qi Li, Zhangbo Wan, Peina Zheng, Tao Guo, Zhida Li, Martin Peifer, Yupeng Cun

In the past decade, a variety of methods have been developed for subclonal reconstruction using bulk tumor sequencing data.

Enhanced Spatio-Temporal Interaction Learning for Video Deraining: A Faster and Better Framework

1 code implementation23 Mar 2021 Kaihao Zhang, Dongxu Li, Wenhan Luo, Wenqi Ren, Wei Liu

Video deraining is an important task in computer vision as the unwanted rain hampers the visibility of videos and deteriorates the robustness of most outdoor vision systems.

Rain Removal

DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

1 code implementation CVPR 2021 Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bing Jiang, Wei Liu

While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content.

Texture Synthesis

Generalizing Face Forgery Detection with High-frequency Features

no code implementations CVPR 2021 Yuchen Luo, Yong Zhang, Junchi Yan, Wei Liu

The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.

Human-like Controllable Image Captioning with Verb-specific Semantic Roles

1 code implementation CVPR 2021 Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu

However, we argue that almost all existing objective control signals have overlooked two indispensable characteristics of an ideal control signal: 1) Event-compatible: all visual contents referred to in a single sentence should be compatible with the described activity.

Image Captioning Semantic Role Labeling

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On

1 code implementation CVPR 2021 Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo

To this end, DCTON can be naturally trained in a self-supervised manner following cycle consistency learning.

Virtual Try-on

LARNet: Lie Algebra Residual Network for Face Recognition

1 code implementation15 Mar 2021 Xiaolong Yang, Xiaohong Jia, Dihong Gong, Dong-Ming Yan, Zhifeng Li, Wei Liu

We prove that face rotation in the image space is equivalent to an additive residual component in the feature space of CNNs, which is determined solely by the rotation.

Face Recognition Robust Face Recognition

VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples

1 code implementation CVPR 2021 Tian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu

By empowering the temporal robustness of the encoder and modeling the temporal decay of the keys, our VideoMoCo improves MoCo temporally based on contrastive learning.

Action Recognition Contrastive Learning +1

Parser-Free Virtual Try-on via Distilling Appearance Flows

1 code implementation CVPR 2021 Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo

A recent pioneering work employed knowledge distillation to reduce the dependency of human parsing, where the try-on images produced by a parser-based method are used as supervisions to train a "student" network without relying on segmentation, making the student mimic the try-on ability of the parser-based model.

Human Parsing Knowledge Distillation +1

Learning Discriminative Features using Multi-label Dual Space

no code implementations25 Feb 2021 Ali Braytee, Wei Liu

We show that the learned projection matrix identifies a subset of discriminative features across multiple semantic labels.

Multi-Label Learning

Analogue cosmological particle creation in an ultracold quantum fluid of light

no code implementations16 Feb 2021 Jeff Steinhauer, Murad Abuzarli, Tangui Aladjidi, Tom Bienaimé, Clara Piekarski, Wei Liu, Elisabeth Giacobino, Alberto Bramati, Quentin Glorieux

In inflationary cosmology, the rapid expansion of the early universe resulted in the spontaneous production of cosmological particles from vacuum fluctuations, observable today in the cosmic microwave background anisotropies.

Quantum Gases Optics Quantum Physics

PSA-Net: Deep Learning based Physician Style-Aware Segmentation Network for Post-Operative Prostate Cancer Clinical Target Volume

no code implementations15 Feb 2021 Anjali Balagopal, Howard Morgan, Michael Dohopoloski, Ramsey Timmerman, Jie Shan, Daniel F. Heitjan, Wei Liu, Dan Nguyen, Raquibul Hannan, Aurelie Garant, Neil Desai, Steve Jiang

A classifier is trained to identify which physician has contoured the CTV from just the contour and corresponding CT scan, to determine if physician styles are consistent and learnable.

Rescattering mechanism of weak decays of double-charm baryons

no code implementations28 Jan 2021 Jia-Jie Han, Hua-Yu Jiang, Wei Liu, Zhen-Jun Xiao, Fu-Sheng Yu

The doubly charmed baryon $\Xi_{cc}^{++}$ was recently observed by LHCb via the decay processes of $\Xi_{cc}^{++}\to \Lambda_c^+ K^-\pi^+\pi^+$ and $\Xi_c^+\pi^+$.

High Energy Physics - Phenomenology High Energy Physics - Experiment

CPTR: Full Transformer Network for Image Captioning

no code implementations26 Jan 2021 Wei Liu, Sihan Chen, Longteng Guo, Xinxin Zhu, Jing Liu

Besides, we provide detailed visualizations of the self-attention between patches in the encoder and the "words-to-patches" attention in the decoder thanks to the full Transformer architecture.

Image Captioning

Phonon Scattering in the Complex Strain Field of a Dislocation

no code implementations26 Jan 2021 Yandong Sun, Yanguang Zhou, Ramya Gurunathan, Jin-Yu Zhang, Ming Hu, Wei Liu, Ben Xu, G. Jeffrey Snyder

Strain engineering is critical to the performance enhancement of electronic and thermoelectric devices because of its influence on the material thermal conductivity.

Materials Science

Global-Local Propagation Network for RGB-D Semantic Segmentation

no code implementations26 Jan 2021 Sihan Chen, Xinxin Zhu, Wei Liu, Xingjian He, Jing Liu

Depth information matters in RGB-D semantic segmentation task for providing additional geometric information to color images.

Scene Segmentation

Bayesian Optimization Assisted Meal Bolus Decision Based on Gaussian Processes Learning and Risk-Sensitive Control

no code implementations20 Jan 2021 Deheng Cai, Wei Liu, Linong Ji, Dawei Shi

For the case of announced meals, the proposed method achieves satisfactory and similar performance in terms of mean glucose and percentage time in [70, 180] mg/dL without increasing the risk of hypoglycemia.

Gaussian Processes Management

Towards Practical Adam: Non-Convexity, Convergence Theory, and Mini-Batch Acceleration

no code implementations14 Jan 2021 Congliang Chen, Li Shen, Fangyu Zou, Wei Liu

Adam is one of the most influential adaptive stochastic algorithms for training deep neural networks, which has been pointed out to be divergent even in the simple convex setting via a few simple counterexamples.

Stochastic Optimization

Far-Field Super-Resolution Imaging By Nonlinear Excited Evanescent Waves

no code implementations14 Jan 2021 ZhiHao Zhou, Wei Liu, Jiajing He, Lei Chen, Xin Luo, Dongyi Shen, Jianjun Cao, Yaping Dan, Xianfeng Chen, Wenjie Wan

Abbe's resolution limit, one of the best-known physical limitations, poses a great challenge for any wave systems in imaging, wave transport, and dynamics.

Super-Resolution Optics

Extremize Optical Chiralities through Polarization Singularities

no code implementations11 Jan 2021 Weijin Chen, Qingdong Yang, Yuntian Chen, Wei Liu

Chiral optical effects are generally quantified along some specific incident directions of exciting waves (especially for extrinsic chiralities of achiral structures) or defined as direction-independent properties by averaging the responses among all structure orientations.


Graph Deformer Network

no code implementations1 Jan 2021 Wenting Zhao, Yuan Fang, Zhen Cui, Tong Zhang, Jian Yang, Wei Liu

In this paper, we propose a simple yet effective graph deformer network (GDN) to fulfill anisotropic convolution filtering on graphs, analogous to the standard convolution operation on images.

Adversarial Attack on Deep Cross-Modal Hamming Retrieval

no code implementations ICCV 2021 Chao Li, Shangqian Gao, Cheng Deng, Wei Liu, Heng Huang

Specifically, given a target model, we first construct its substitute model to exploit cross-modal correlations within hamming space, with which we create adversarial examples by limitedly querying from a target model.

Adversarial Attack Cross-Modal Retrieval +2

Benchmarking Ultra-High-Definition Image Super-Resolution

no code implementations ICCV 2021 Kaihao Zhang, Dongxu Li, Wenhan Luo, Wenqi Ren, Bjorn Stenger, Wei Liu, Hongdong Li, Ming-Hsuan Yang

Increasingly, modern mobile devices allow capturing images at Ultra-High-Definition (UHD) resolution, which includes 4K and 8K images.

Benchmarking Image Super-Resolution

Deep-Learning-Enabled Inverse Engineering of Multi-Wavelength Invisibility-to-Superscattering Switching with Phase-Change Materials

no code implementations25 Dec 2020 Jie Luo, Xun Li, Xinyuan Zhang, Jiajie Guo, Wei Liu, Yun Lai, Yaohui Zhan, Min Huang

Inverse design of nanoparticles for desired scattering spectra and dynamic switching between the two opposite scattering anomalies, i. e. superscattering and invisibility, is important in realizing cloaking, sensing and functional devices.


SubICap: Towards Subword-informed Image Captioning

no code implementations24 Dec 2020 Naeha Sharif, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah

In this work we address this common limitation of IC systems in dealing with rare words in the corpora.

Image Captioning Language Modelling

WEmbSim: A Simple yet Effective Metric for Image Captioning

no code implementations24 Dec 2020 Naeha Sharif, Lyndon White, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah

The area of automatic image caption evaluation is still undergoing intensive research to address the needs of generating captions which can meet adequacy and fluency requirements.

Image Captioning Word Embeddings

LCEval: Learned Composite Metric for Caption Evaluation

1 code implementation24 Dec 2020 Naeha Sharif, Lyndon White, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah

Automatic evaluation metrics hold a fundamental importance in the development and fine-grained analysis of captioning systems.

Traffic Assignment Problem for Footpath Networks with Bidirectional Links

no code implementations6 Dec 2020 Tanapon Lilasathapornkit, David Rey, Wei Liu, Meead Saberi

The estimation of pedestrian traffic in urban areas is often performed with computationally intensive microscopic models that usually suffer from scalability issues in large-scale footpath networks.

Adversarial Learning for Robust Deep Clustering

1 code implementation NeurIPS 2020 Xu Yang, Cheng Deng, Kun Wei, Junchi Yan, Wei Liu

Meanwhile, we devise an adversarial attack strategy to explore samples that easily fool the clustering layers but do not impact the performance of the deep embedding.

Adversarial Attack Deep Clustering

Towards Playing Full MOBA Games with Deep Reinforcement Learning

no code implementations NeurIPS 2020 Deheng Ye, Guibin Chen, Wen Zhang, Sheng Chen, Bo Yuan, Bo Liu, Jia Chen, Zhao Liu, Fuhao Qiu, Hongsheng Yu, Yinyuting Yin, Bei Shi, Liang Wang, Tengfei Shi, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu

However, existing work falls short in handling the raw game complexity caused by the explosion of agent combinations, i. e., lineups, when expanding the hero pool in case that OpenAI's Dota AI limits the play to a pool of only 17 heroes.

Dota 2 League of Legends +2

Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty

1 code implementation5 Nov 2020 Camilo Pestana, Wei Liu, David Glance, Ajmal Mian

We propose three metrics to determine the proportion of robust images in a dataset and provide scoring to determine the dataset bias.

Adversarial Attack Benchmarking

Towards Dark Jargon Interpretation in Underground Forums

no code implementations5 Nov 2020 Dominic Seyler, Wei Liu, XiaoFeng Wang, ChengXiang Zhai

Dark jargons are benign-looking words that have hidden, sinister meanings and are used by participants of underground forums for illicit behavior.

GAIN: Graph Attention & Interaction Network for Inductive Semi-Supervised Learning over Large-scale Graphs

no code implementations3 Nov 2020 Yunpeng Weng, Xu Chen, Liang Chen, Wei Liu

Most existing GNN models exploit a single type of aggregator (e. g., mean-pooling) to aggregate neighboring nodes information, and then add or concatenate the output of aggregator to the current representation vector of the center node.

Graph Attention Link Prediction +1

Fewer is More: A Deep Graph Metric Learning Perspective Using Fewer Proxies

1 code implementation NeurIPS 2020 Yuehua Zhu, Muli Yang, Cheng Deng, Wei Liu

In this paper, we propose a novel Proxy-based deep Graph Metric Learning (ProxyGML) approach from the perspective of graph classification, which uses fewer proxies yet achieves better comprehensive performance.

General Classification Graph Classification +1

Face Hallucination via Split-Attention in Split-Attention Network

1 code implementation22 Oct 2020 Tao Lu, Yuanzhi Wang, Yanduo Zhang, Yu Wang, Wei Liu, Zhongyuan Wang, Junjun Jiang

However, most of them fail to take into account the overall facial profile and fine texture details simultaneously, resulting in reduced naturalness and fidelity of the reconstructed face, and further impairing the performance of downstream tasks (e. g., face detection, facial recognition).

Face Detection Face Hallucination +3

Probing the Phonon Mean Free Paths in Dislocation Core by Molecular Dynamics Simulation

no code implementations18 Oct 2020 Yandong Sun, Yanguang Zhou, Ming Hu, G. Jeffrey Snyder, Ben Xu, Wei Liu

In this study, the 1D McKelvey-Shockley phonon BTE method was extended to model inhomogeneous materials, where the effect of defects on the phonon MFPs is explicitly obtained.

Materials Science Computational Physics 80A05 I.6.0

Deep-HOSeq: Deep Higher Order Sequence Fusion for Multimodal Sentiment Analysis

1 code implementation16 Oct 2020 Sunny Verma, Jiwei Wang, Zhefeng Ge, Rujia Shen, Fan Jin, Yang Wang, Fang Chen, Wei Liu

In this research, we first propose a common network to discover both intra-modal and inter-modal dynamics by utilizing basic LSTMs and tensor based convolution networks.

Multimodal Sentiment Analysis Sentiment Classification

Attn-HybridNet: Improving Discriminability of Hybrid Features with Attention Fusion

2 code implementations13 Oct 2020 Sunny Verma, Chen Wang, Liming Zhu, Wei Liu

The principal component analysis network (PCANet) is an unsupervised parsimonious deep network, utilizing principal components as filters in its convolution layers.

Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework

no code implementations10 Oct 2020 Wenxiao Wang, Minghao Chen, Shuai Zhao, Long Chen, Jinming Hu, Haifeng Liu, Deng Cai, Xiaofei He, Wei Liu

Specifically, it first casts the relationships between a certain model's accuracy and depth/width/resolution into a polynomial regression and then maximizes the polynomial to acquire the optimal values for the three dimensions.

Network Pruning Neural Architecture Search +1

Targeted Physical-World Attention Attack on Deep Learning Models in Road Sign Recognition

2 code implementations9 Oct 2020 Xinghao Yang, Weifeng Liu, Shengli Zhang, Wei Liu, DaCheng Tao

To alleviate these problems, this paper proposes the targeted attention attack (TAA) method for real world road sign attack.

Traffic Sign Recognition

Knowledge Adaption for Demand Prediction based on Multi-task Memory Neural Network

no code implementations12 Sep 2020 Can Li, Lei Bai, Wei Liu, Lina Yao, S Travis Waller

Accurate demand forecasting of different public transport modes(e. g., buses and light rails) is essential for public service operation. However, the development level of various modes often varies sig-nificantly, which makes it hard to predict the demand of the modeswith insufficient knowledge and sparse station distribution (i. e., station-sparse mode).

Multi-Task Learning

Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics

2 code implementations31 Aug 2020 Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Wei Liu, Yun-hui Liu

Specifically, given an unlabeled video clip, we compute a series of spatio-temporal statistical summaries, such as the spatial location and dominant direction of the largest motion, the spatial location and dominant color of the largest color diversity along the temporal axis, etc.

Action Recognition Representation Learning +3

Unravelling the Architecture of Membrane Proteins with Conditional Random Fields

no code implementations6 Aug 2020 Lior Lukov, Sanjay Chawla, Wei Liu, Brett Church, Gaurav Pandey

In this paper, we will show that the recently introduced graphical model: Conditional Random Fields (CRF) provides a template to integrate micro-level information about biological entities into a mathematical model to understand their macro-level behavior.

Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos

no code implementations ECCV 2020 Shaoxiang Chen, Wenhao Jiang, Wei Liu, Yu-Gang Jiang

Inspired by the fact that there exist cross-modal interactions in the human brain, we propose a novel method for learning pairwise modality interactions in order to better exploit complementary information for each pair of modalities in videos and thus improve performances on both tasks.

Face Super-Resolution Guided by 3D Facial Priors

1 code implementation ECCV 2020 Xiaobin Hu, Wenqi Ren, John LaMaster, Xiaochun Cao, Xiaoming Li, Zechao Li, Bjoern Menze, Wei Liu

State-of-the-art face super-resolution methods employ deep convolutional neural networks to learn a mapping between low- and high- resolution facial patterns by exploring local appearance knowledge.


Attention-based Residual Speech Portrait Model for Speech to Face Generation

no code implementations9 Jul 2020 Jianrong Wang, Xiaosheng Hu, Li Liu, Wei Liu, Mei Yu, Tianyi Xu

Given a speaker's speech, it is interesting to see if it is possible to generate this speaker's face.