Speech-T: Transducer for Text to Speech and Beyond

no code implementations NeurIPS 2021 Jiawei Chen, Xu Tan, Yichong Leng, Jin Xu, Guihua Wen, Tao Qin, Tie-Yan Liu

Experiments on LJSpeech datasets demonstrate that Speech-T 1) is more robust than the attention based autoregressive TTS model due to its inherent monotonic alignments between text and speech; 2) naturally supports streaming TTS with good voice quality; and 3) enjoys the benefit of joint modeling TTS and ASR in a single network.

automatic-speech-recognition Speech Recognition

Popularity Bias Is Not Always Evil: Disentangling Benign and Harmful Bias for Recommendation

no code implementations16 Sep 2021 Zihao Zhao, Jiawei Chen, Sheng Zhou, Xiangnan He, Xuezhi Cao, Fuzheng Zhang, Wei Wu

To sufficiently exploit such important information for recommendation, it is essential to disentangle the benign popularity bias caused by item quality from the harmful popularity bias caused by conformity.

Recommendation Systems

Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention

1 code implementation EMNLP 2021 Jiawei Chen, Hongyu Lin, Xianpei Han, Le Sun

In this paper, we identify and solve the trigger curse problem in few-shot event detection (FSED) from a causal view.

Event Detection

InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

1 code implementation11 Sep 2021 Hong Wang, Yuexiang Li, Haimiao Zhang, Jiawei Chen, Kai Ma, Deyu Meng, Yefeng Zheng

For the task of metal artifact reduction (MAR), although deep learning (DL)-based methods have achieved promising performances, most of them suffer from two problems: 1) the CT imaging geometry constraint is not fully embedded into the network during training, leaving room for further performance improvement; 2) the model interpretability is lack of sufficient consideration.

Metal Artifact Reduction

DisenKGAT: Knowledge Graph Embedding with Disentangled Graph Attention Network

2 code implementations22 Aug 2021 Junkang Wu, Wentao Shi, Xuezhi Cao, Jiawei Chen, Wenqiang Lei, Fuzheng Zhang, Wei Wu, Xiangnan He

Knowledge graph completion (KGC) has become a focus of attention across deep learning community owing to its excellent contribution to numerous downstream tasks.

Graph Attention Knowledge Graph Completion +1

MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition

no code implementations20 Aug 2021 Jiawei Chen, Chiu Man Ho

This paper presents a pure transformer-based approach, dubbed the Multi-Modal Video Transformer (MM-ViT), for video action recognition.

Action Recognition Optical Flow Estimation

Distilling Holistic Knowledge with Graph Neural Networks

1 code implementation ICCV 2021 Sheng Zhou, Yucheng Wang, Defang Chen, Jiawei Chen, Xin Wang, Can Wang, Jiajun Bu

The holistic knowledge is represented as a unified graph-based embedding by aggregating individual knowledge from relational neighborhood samples with graph neural networks, the student network is learned by distilling the holistic knowledge in a contrastive manner.

Knowledge Distillation

Time-aware Path Reasoning on Knowledge Graph for Recommendation

no code implementations5 Aug 2021 Yuyue Zhao, Xiang Wang, Jiawei Chen, Wei Tang, Yashen Wang, Xiangnan He, Haiyong Xie

In this work, we propose a novel Time-aware Path reasoning for Recommendation (TPRec for short) method, which leverages the potential of temporal information to offer better recommendation with plausible explanations.

Relation Extraction

Single-shot structured illumination microscopy

no code implementations13 Jul 2021 Qinnan Zhang, En Bo, Jiawei Chen, Jiaosheng Li, Heming Jiang, Xiaoxu Lu, Liyun Zhong, Jindong Tian

In this paper, we report a novel technique termed single-shot SIM, to overcome these limitations.


Mutual-GAN: Towards Unsupervised Cross-Weather Adaptation with Mutual Information Constraint

no code implementations30 Jun 2021 Jiawei Chen, Yuexiang Li, Kai Ma, Yefeng Zheng

In practical applications, the outdoor weather and illumination are changeable, e. g., cloudy and nighttime, which results in a significant drop of semantic segmentation accuracy of CNN only trained with daytime data.

Autonomous Driving Semantic Segmentation +2

CausCF: Causal Collaborative Filtering for RecommendationEffect Estimation

no code implementations28 May 2021 Xu Xie, Zhaoyang Liu, Shiwen Wu, Fei Sun, Cihang Liu, Jiawei Chen, Jinyang Gao, Bin Cui, Bolin Ding

It is based on the idea that similar users not only have a similar taste on items, but also have similar treatment effect under recommendations.

Collaborative Filtering Recommendation Systems

AutoDebias: Learning to Debias for Recommendation

1 code implementation10 May 2021 Jiawei Chen, Hande Dong, Yang Qiu, Xiangnan He, Xin Xin, Liang Chen, Guli Lin, Keping Yang

This provides a valuable opportunity to develop a universal solution for debiasing, e. g., by learning the debiasing parameters from data.

Imputation Meta-Learning +1

A General Framework for Learning Prosodic-Enhanced Representation of Rap Lyrics

no code implementations23 Mar 2021 Hongru Liang, Haozheng Wang, Qian Li, Jun Wang, Guandong Xu, Jiawei Chen, Jin-Mao Wei, Zhenglu Yang

Learning and analyzing rap lyrics is a significant basis for many web applications, such as music recommendation, automatic music categorization, and music information retrieval, due to the abundant source of digital music in the World Wide Web.

Information Retrieval Music Information Retrieval +1

GCF-Net: Gated Clip Fusion Network for Video Action Recognition

no code implementations2 Feb 2021 Jenhao Hsiao, Jiawei Chen, Chiuman Ho

These models are trained by applying a deep CNN on single clip of fixed temporal length.

Action Recognition

Time Series Domain Adaptation via Sparse Associative Structure Alignment

no code implementations22 Dec 2020 Ruichu Cai, Jiawei Chen, Zijian Li, Wei Chen, Keli Zhang, Junjian Ye, Zhuozhang Li, Xiaoyan Yang, Zhenjie Zhang

To reduce the difficulty in the discovery of causal structure, we relax it to the sparse associative structure and propose a novel sparse associative structure alignment model for domain adaptation.

Domain Adaptation Time Series

SamWalker++: recommendation with informative sampling strategy

1 code implementation16 Nov 2020 Can Wang, Jiawei Chen, Sheng Zhou, Qihao Shi, Yan Feng, Chun Chen

However, the social network information may not be available in many recommender systems, which hinders application of SamWalker.

Recommendation Systems

CoSam: An Efficient Collaborative Adaptive Sampler for Recommendation

no code implementations16 Nov 2020 Jiawei Chen, Chengquan Jiang, Can Wang, Sheng Zhou, Yan Feng, Chun Chen, Martin Ester, Xiangnan He

To deal with these problems, we propose an efficient and effective collaborative sampling method CoSam, which consists of: (1) a collaborative sampler model that explicitly leverages user-item interaction information in sampling probability and exhibits good properties of normalization, adaption, interaction information awareness, and sampling efficiency; and (2) an integrated sampler-recommender framework, leveraging the sampler model in prediction to offset the bias caused by uneven sampling.

Recommendation Systems

Model-Agnostic Counterfactual Reasoning for Eliminating Popularity Bias in Recommender System

1 code implementation29 Oct 2020 Tianxin Wei, Fuli Feng, Jiawei Chen, Ziwei Wu, JinFeng Yi, Xiangnan He

Existing work addresses this issue with Inverse Propensity Weighting (IPW), which decreases the impact of popular items on the training and increases the impact of long-tail items.

Counterfactual Inference Multi-Task Learning +1

On the Equivalence of Decoupled Graph Convolution Network and Label Propagation

1 code implementation23 Oct 2020 Hande Dong, Jiawei Chen, Fuli Feng, Xiangnan He, Shuxian Bi, Zhaolin Ding, Peng Cui

The original design of Graph Convolution Network (GCN) couples feature transformation and neighborhood aggregation for node representation learning.

Node Classification Representation Learning

HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis

no code implementations3 Sep 2020 Jiawei Chen, Xu Tan, Jian Luan, Tao Qin, Tie-Yan Liu

To tackle the difficulty of singing modeling caused by high sampling rate (wider frequency band and longer waveform), we introduce multi-scale adversarial training in both the acoustic model and vocoder to improve singing modeling.

Singing Voice Synthesis

Residual Frames with Efficient Pseudo-3D CNN for Human Action Recognition

no code implementations3 Aug 2020 Jiawei Chen, Jenson Hsiao, Chiu Man Ho

Empirical results confirm the efficiency and effectiveness of residual frames as well as the proposed pseudo-3D convolution module.

Action Recognition Optical Flow Estimation +1

Generative Adversarial Networks for Video-to-Video Domain Adaptation

no code implementations17 Apr 2020 Jiawei Chen, Yuexiang Li, Kai Ma, Yefeng Zheng

Two colonoscopic datasets from different centres, i. e., CVC-Clinic and ETIS-Larib, are adopted to evaluate the performance of domain adaptation of our VideoGAN.

Domain Adaptation Translation

Fast Adaptively Weighted Matrix Factorization for Recommendation with Implicit Feedback

no code implementations4 Mar 2020 Jiawei Chen, Can Wang, Sheng Zhou, Qihao Shi, Jingbang Chen, Yan Feng, Chun Chen

A popular and effective approach for implicit recommendation is to treat unobserved data as negative but downweight their confidence.

A Cyclically-Trained Adversarial Network for Invariant Representation Learning

no code implementations21 Jun 2019 Jiawei Chen, Janusz Konrad, Prakash Ishwar

Specifically, we propose a cyclically-trained adversarial network to learn a mapping from image space to latent representation space and back such that the latent representation is invariant to a specified factor of variation (e. g., identity).

Representation Learning

Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low Resolutions

no code implementations12 Oct 2016 Jiawei Chen, Jonathan Wu, Janusz Konrad, Prakash Ishwar

Deep convolutional neural networks (ConvNets) have been recently shown to attain state-of-the-art performance for action recognition on standard-resolution videos.

Action Recognition

Building A Large Concept Bank for Representing Events in Video

no code implementations29 Mar 2014 Yin Cui, Dong Liu, Jiawei Chen, Shih-Fu Chang

In this paper, we propose to build Concept Bank, the largest concept library consisting of 4, 876 concepts specifically designed to cover 631 real-world events.

Event Detection

