Search Results for author: Xin Wang

Found 478 papers, 164 papers with code

Deep Mixture of Experts via Shallow Embedding

no code implementations5 Jun 2018 Xin Wang, Fisher Yu, Lisa Dunlap, Yi-An Ma, Ruth Wang, Azalia Mirhoseini, Trevor Darrell, Joseph E. Gonzalez

Larger networks generally have greater representational power at the cost of increased computational complexity.

Few-Shot Learning Zero-Shot Learning

Fast Weight Long Short-Term Memory

no code implementations18 Apr 2018 T. Anderson Keller, Sharath Nittur Sridhar, Xin Wang

Associative memory using fast weights is a short-term memory mechanism that substantially improves the memory capacity and time scale of recurrent neural networks (RNNs).

Retrieval

A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis

no code implementations7 Apr 2018 Xin Wang, Jaime Lorenzo-Trueba, Shinji Takaki, Lauri Juvela, Junichi Yamagishi

Recent advances in speech synthesis suggest that limitations such as the lossy nature of the amplitude spectrum with minimum phase approximation and the over-smoothing effect in acoustic modeling can be overcome by using advanced machine learning approaches.

Speech Synthesis

Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data

no code implementations2 Mar 2018 Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen

Thanks to the growing availability of spoofing databases and rapid advances in using them, systems for detecting voice spoofing attacks are becoming more and more capable, and error rates close to zero are being reached for the ASVspoof2015 database.

Generative Adversarial Network Speech Enhancement +2

Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks

no code implementations NeurIPS 2017 Urs Köster, Tristan J. Webb, Xin Wang, Marcel Nassar, Arjun K. Bansal, William H. Constable, Oğuz H. Elibol, Scott Gray, Stewart Hall, Luke Hornof, Amir Khosrowshahi, Carey Kloss, Ruby J. Pai, Naveen Rao

Here we present the Flexpoint data format, aiming at a complete replacement of 32-bit floating point format training and inference, designed to support modern deep network topologies without modifications.

Generative Adversarial Network

IDK Cascades: Fast Deep Learning by Learning not to Overthink

no code implementations3 Jun 2017 Xin Wang, Yujia Luo, Daniel Crankshaw, Alexey Tumanov, Fisher Yu, Joseph E. Gonzalez

Advances in deep learning have led to substantial increases in prediction accuracy but have been accompanied by increases in the cost of rendering predictions.

Dialogue Generation

Deep Reinforcement Learning for Visual Object Tracking in Videos

no code implementations31 Jan 2017 Da Zhang, Hamid Maei, Xin Wang, Yuan-Fang Wang

In this paper we introduce a fully end-to-end approach for visual tracking in videos that learns to predict the bounding box locations of a target object at every frame.

Decision Making Object +4

Stochastic Averaging for Constrained Optimization with Application to Online Resource Allocation

no code implementations7 Oct 2016 Tianyi Chen, Aryan Mokhtari, Xin Wang, Alejandro Ribeiro, Georgios B. Giannakis

Existing approaches to resource allocation for nowadays stochastic networks are challenged to meet fast convergence and tolerable delay requirements.

Classification of Neurological Gait Disorders Using Multi-task Feature Learning

no code implementations8 Dec 2016 Ioannis Papavasileiou, Wenlong Zhang, Xin Wang, Jinbo Bi, Li Zhang, Song Han

An advanced machine learning method, multi-task feature learning (MTFL), is used to jointly train classification models of a subject's gait in three classes, post-stroke, PD and healthy gait.

Classification General Classification

Robust Learning with Kernel Mean p-Power Error Loss

no code implementations21 Dec 2016 Badong Chen, Lei Xing, Xin Wang, Jing Qin, Nanning Zheng

Correntropy is a second order statistical measure in kernel space, which has been successfully applied in robust learning and signal processing.

On Multiplicative Multitask Feature Learning

no code implementations NeurIPS 2014 Xin Wang, Jinbo Bi, Shipeng Yu, Jiangwen Sun

We prove that this framework is mathematically equivalent to the widely used multitask feature learning methods that are based on a joint regularization of all model parameters, but with a more general form of regularizers.

A multi-task learning model for malware classification with useful file access pattern from API call sequence

no code implementations19 Oct 2016 Xin Wang, Siu Ming Yiu

Based on API call sequences, semantic-aware and machine learning (ML) based malware classifiers can be built for malware detection or classification.

Classification Document Classification +6

Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis

no code implementations30 Jul 2018 Gustav Eje Henter, Jaime Lorenzo-Trueba, Xin Wang, Junichi Yamagishi

Generating versatile and appropriate synthetic speech requires control over the output expression separate from the spoken text.

Acoustic Modelling Decoder +2

Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects

no code implementations2 Aug 2018 Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

We investigated the impact of noisy linguistic features on the performance of a Japanese speech synthesis system based on neural network that uses WaveNet vocoder.

Denoising Speech Synthesis

Non-asymptotic entanglement distillation

1 code implementation19 Jun 2017 Kun Fang, Xin Wang, Marco Tomamichel, Runyao Duan

For isotropic states, it can be further simplified to a linear program.

Quantum Physics

Neural source-filter-based waveform model for statistical parametric speech synthesis

no code implementations29 Oct 2018 Xin Wang, Shinji Takaki, Junichi Yamagishi

Neural waveform models such as the WaveNet are used in many recent text-to-speech systems, but the original WaveNet is quite slow in waveform generation because of its autoregressive (AR) structure.

Speech Synthesis

Audiovisual speaker conversion: jointly and simultaneously transforming facial expression and acoustic characteristics

no code implementations29 Oct 2018 Fuming Fang, Xin Wang, Junichi Yamagishi, Isao Echizen

Transforming the facial and acoustic features together makes it possible for the converted voice and facial expressions to be highly correlated and for the generated target speaker to appear and sound natural.

Image Reconstruction

Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning

no code implementations7 Nov 2018 Xin Wang, Jiawei Wu, Da Zhang, Yu Su, William Yang Wang

Although promising results have been achieved in video captioning, existing models are limited to the fixed inventory of activities in the training corpus, and do not generalize to open vocabulary scenarios.

Video Captioning

Guided Feature Selection for Deep Visual Odometry

no code implementations25 Nov 2018 Fei Xue, Qiuyuan Wang, Xin Wang, Wei Dong, Junqiu Wang, Hongbin Zha

We present a novel end-to-end visual odometry architecture with guided feature selection based on deep convolutional recurrent neural networks.

feature selection Monocular Visual Odometry +1

MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment

no code implementations CVPR 2019 Da Zhang, Xiyang Dai, Xin Wang, Yuan-Fang Wang, Larry S. Davis

In this paper, we present Moment Alignment Network (MAN), a novel framework that unifies the candidate moment encoding and temporal structural reasoning in a single-shot feed-forward network.

Moment Retrieval Natural Language Moment Retrieval +1

Explanatory Graphs for CNNs

no code implementations18 Dec 2018 Quanshi Zhang, Xin Wang, Ruiming Cao, Ying Nian Wu, Feng Shi, Song-Chun Zhu

This paper introduces a graphical model, namely an explanatory graph, which reveals the knowledge hierarchy hidden inside conv-layers of a pre-trained CNN.

Object

Group Linguistic Bias Aware Neural Response Generation

no code implementations WS 2017 Jianan Wang, Xin Wang, Fang Li, Zhen Xu, Zhuoran Wang, Baoxun Wang

For practical chatbots, one of the essential factor for improving user experience is the capability of customizing the talking style of the agents, that is, to make chatbots provide responses meeting users{'} preference on language styles, topics, etc.

Decoder Response Generation

On Algorithms for Sparse Multi-factor NMF

no code implementations NeurIPS 2013 Siwei Lyu, Xin Wang

Nonnegative matrix factorization (NMF) is a popular data analysis method, the objective of which is to decompose a matrix with all nonnegative components into the product of two other nonnegative matrices.

Interpretable CNNs for Object Classification

no code implementations8 Jan 2019 Quanshi Zhang, Xin Wang, Ying Nian Wu, Huilin Zhou, Song-Chun Zhu

This paper proposes a generic method to learn interpretable convolutional filters in a deep convolutional neural network (CNN) for object classification, where each interpretable filter encodes features of a specific object part.

Classification General Classification +1

Residual Attention based Network for Hand Bone Age Assessment

no code implementations21 Dec 2018 Eric Wu, Bin Kong, Xin Wang, Junjie Bai, Yi Lu, Feng Gao, Shaoting Zhang, Kunlin Cao, Qi Song, Siwei Lyu, Youbing Yin

The hierarchical attention components of the residual attention subnet force our network to focus on the key components of the X-ray images and generate the final predictions as well as the associated visual supports, which is similar to the assessment procedure of clinicians.

Hand Segmentation

Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models

no code implementations ACL 2019 Dinghan Shen, Asli Celikyilmaz, Yizhe Zhang, Liqun Chen, Xin Wang, Jianfeng Gao, Lawrence Carin

Variational autoencoders (VAEs) have received much attention recently as an end-to-end architecture for text generation with latent variables.

Decoder Sentence +1

Parameter Efficient Training of Deep Convolutional Neural Networks by Dynamic Sparse Reparameterization

no code implementations15 Feb 2019 Hesham Mostafa, Xin Wang

We evaluate the performance of dynamic reallocation methods in training deep convolutional networks and show that our method outperforms previous static and dynamic reparameterization methods, yielding the best accuracy for a fixed parameter budget, on par with accuracies obtained by iteratively pruning a pre-trained dense model.

Attention-driven Tree-structured Convolutional LSTM for High Dimensional Data Understanding

no code implementations29 Jan 2019 Bin Kong, Xin Wang, Junjie Bai, Yi Lu, Feng Gao, Kunlin Cao, Qi Song, Shaoting Zhang, Siwei Lyu, Youbing Yin

In order to address these limitations, we present tree-structured ConvLSTM models for tree-structured image analysis tasks which can be trained end-to-end.

Vocal Bursts Intensity Prediction

DeepCenterline: a Multi-task Fully Convolutional Network for Centerline Extraction

no code implementations25 Mar 2019 Zhihui Guo, Junjie Bai, Yi Lu, Xin Wang, Kunlin Cao, Qi Song, Milan Sonka, Youbing Yin

The proposed method generates well-positioned centerlines, exhibiting lower number of missing branches and is more robust in the presence of minor imperfections of the object segmentation mask.

Object Semantic Segmentation

Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

no code implementations29 Mar 2019 Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi

We propose using an extended model architecture of Tacotron, that is a multi-source sequence-to-sequence model with a dual attention mechanism as the shared model for both the TTS and VC tasks.

Decoder Speech Synthesis +1

Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora

no code implementations1 Apr 2019 Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

When the available data of a target speaker is insufficient to train a high quality speaker-dependent neural text-to-speech (TTS) system, we can combine data from multiple speakers and train a multi-speaker TTS model instead.

Extract and Edit: An Alternative to Back-Translation for Unsupervised Neural Machine Translation

no code implementations NAACL 2019 Jiawei Wu, Xin Wang, William Yang Wang

The overreliance on large parallel corpora significantly limits the applicability of machine translation systems to the majority of language pairs.

Sentence Translation +1

ACE: Adapting to Changing Environments for Semantic Segmentation

no code implementations ICCV 2019 Zuxuan Wu, Xin Wang, Joseph E. Gonzalez, Tom Goldstein, Larry S. Davis

However, neural classifiers are often extremely brittle when confronted with domain shift---changes in the input distribution that occur over time.

Meta-Learning Semantic Segmentation

Maximum Correntropy Criterion with Variable Center

no code implementations13 Apr 2019 Badong Chen, Xin Wang, Yingsong Li, Jose C. Principe

The kernel function in correntropy is usually restricted to the Gaussian function with center located at zero.

Position

Logician: A Unified End-to-End Neural Approach for Open-Domain Information Extraction

no code implementations29 Apr 2019 Mingming Sun, Xu Li, Xin Wang, Miao Fan, Yue Feng, Ping Li

In this paper, we consider the problem of open information extraction (OIE) for extracting entity and relation level intermediate structures from sentences in open-domain.

Attribute Open Information Extraction +3

Neural source-filter waveform models for statistical parametric speech synthesis

no code implementations27 Apr 2019 Xin Wang, Shinji Takaki, Junichi Yamagishi

Other models such as Parallel WaveNet and ClariNet bring together the benefits of AR and IAF-based models and train an IAF model by transferring the knowledge from a pre-trained AR teacher to an IAF student without any sequential transformation.

Speech Synthesis

Multi-Kernel Correntropy for Robust Learning

no code implementations24 May 2019 Badong Chen, Yuqing Xie, Xin Wang, Zejian yuan, Pengju Ren, Jing Qin

In a recent work, the concept of mixture correntropy (MC) was proposed to improve the learning performance, where the kernel function is a mixture Gaussian kernel, namely a linear combination of several zero-mean Gaussian kernels with different widths.

One-shot entanglement distillation beyond LOCC

no code implementations4 Jun 2019 Bartosz Regula, Kun Fang, Xin Wang, Mile Gu

We show in particular that the $\varepsilon$-error one-shot distillable entanglement of any pure state is the same under all sets of operations ranging from one-way LOCC to separability-preserving operations or operations preserving the set of states with positive partial transpose, and can be computed exactly as a quadratically constrained linear program.

Quantum Physics Mathematical Physics Mathematical Physics

Recognizing License Plates in Real-Time

no code implementations11 Jun 2019 Xuewen Yang, Xin Wang

To enable real-time and accurate license plate recognition, in this work, we propose a set of techniques: 1) a contour reconstruction method along with edge-detection to quickly detect the candidate plates; 2) a simple zero-one-alternation scheme to effectively remove the fake top and bottom borders around plates to facilitate more accurate segmentation of characters on plates; 3) a set of techniques to augment the training data, incorporate SIFT features into the CNN network, and exploit transfer learning to obtain the initial parameters for more effective training; and 4) a two-phase verification procedure to determine the correct plate at low cost, a statistical filtering in the plate detection stage to quickly remove unwanted candidates, and the accurate CR results after the CR process to perform further plate verification without additional processing.

Edge Detection License Plate Detection +2

Self-Supervised Dialogue Learning

no code implementations ACL 2019 Jiawei Wu, Xin Wang, William Yang Wang

The sequential order of utterances is often meaningful in coherent dialogues, and the order changes of utterances could lead to low-quality and incoherent conversations.

Self-Supervised Learning

Modeling the Uncertainty in Electronic Health Records: a Bayesian Deep Learning Approach

no code implementations14 Jul 2019 Riyi Qiu, Yugang Jia, Mirsad Hadzikadic, Michael Dulin, Xi Niu, Xin Wang

Deep learning models have exhibited superior performance in predictive tasks with the explosively increasing Electronic Health Records (EHR).

Decision Making

Privacy-preserving Distributed Machine Learning via Local Randomization and ADMM Perturbation

no code implementations30 Jul 2019 Xin Wang, Hideaki Ishii, Linkang Du, Peng Cheng, Jiming Chen

With the proliferation of training data, distributed machine learning (DML) is becoming more competent for large-scale learning tasks.

BIG-bench Machine Learning Privacy Preserving

Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry

no code implementations ICCV 2019 Shunkai Li, Fei Xue, Xin Wang, Zike Yan, Hongbin Zha

As single-view depth estimation is an ill-posed problem, and photometric loss is incapable of discriminating distortion artifacts of warped images, the estimated depth is vague and pose is inaccurate.

Depth Estimation Image Generation +2

Latent Part-of-Speech Sequences for Neural Machine Translation

no code implementations IJCNLP 2019 Xuewen Yang, Yingru Liu, Dongliang Xie, Xin Wang, Niranjan Balasubramanian

In this work, we introduce a new latent variable model, LaSyn, that captures the co-dependence between syntax and semantics, while allowing for effective and efficient inference over the latent space.

Decoder Machine Translation +2

Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments

no code implementations30 Aug 2019 Yusuke Yasuda, Xin Wang, Junichi Yamagishi

The advantages of our approach are that we can simplify many modules for the soft attention and that we can train the end-to-end TTS model using a single likelihood function.

Decoder

Detecting Deep Neural Network Defects with Data Flow Analysis

no code implementations5 Sep 2019 Jiazhen Gu, Huanlin Xu, Yangfan Zhou, Xin Wang, Hui Xu, Michael Lyu

Deep neural networks (DNNs) are shown to be promising solutions in many challenging artificial intelligence tasks.

Object Recognition

Data Sanity Check for Deep Learning Systems via Learnt Assertions

no code implementations6 Sep 2019 Haochuan Lu, Huanlin Xu, Nana Liu, Yangfan Zhou, Xin Wang

But the statistical nature of DL makes it quite vulnerable to invalid inputs, i. e., those cases that are not considered in the training phase of a DL model.

Multi-modal Deep Analysis for Multimedia

no code implementations11 Oct 2019 Wenwu Zhu, Xin Wang, Hongzhi Li

To address the two scientific problems, we investigate them from the following aspects: 1) multi-modal correlational representation: multi-modal fusion of data across different modalities, and 2) multi-modal data and knowledge fusion: multi-modal fusion of data with domain knowledge.

Question Answering Transfer Learning +2

Transferring neural speech waveform synthesizers to musical instrument sounds generation

no code implementations27 Oct 2019 Yi Zhao, Xin Wang, Lauri Juvela, Junichi Yamagishi

Recent neural waveform synthesizers such as WaveNet, WaveGlow, and the neural-source-filter (NSF) model have shown good performance in speech synthesis despite their different methods of waveform generation.

Audio Generation Audio Synthesis +2

Latent Suicide Risk Detection on Microblog via Suicide-Oriented Word Embeddings and Layered Attention

no code implementations IJCNLP 2019 Lei Cao, Huijun Zhang, Ling Feng, Zihan Wei, Xin Wang, Ningyun Li, Xiaohao He

Despite detection of suicidal ideation on social media has made great progress in recent years, people's implicitly and anti-real contrarily expressed posts still remain as an obstacle, constraining the detectors to acquire higher satisfactory performance.

Word Embeddings

Cross-Channel Intragroup Sparsity Neural Network

no code implementations26 Oct 2019 Zhilin Yu, Chao Wang, Xin Wang, Qing Wu, Yong Zhao, Xundong Wu

Modern deep neural networks rely on overparameterization to achieve state-of-the-art generalization.

Model Compression Network Pruning

Disparity-preserved Deep Cross-platform Association for Cross-platform Video Recommendation

no code implementations1 Jan 2019 Shengze Yu, Xin Wang, Wenwu Zhu, Peng Cui, Jingdong Wang

However, there remain two unsolved challenges: i) there exist inconsistencies in cross-platform association due to platform-specific disparity, and ii) data from distinct platforms may have different semantic granularities.

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation

no code implementations CVPR 2020 Juncheng Li, Xin Wang, Siliang Tang, Haizhou Shi, Fei Wu, Yueting Zhuang, William Yang Wang

Visual navigation is a task of training an embodied agent by intelligently navigating to a target object (e. g., television) using only visual observations.

Object reinforcement-learning +3

Adaptive Activation Network and Functional Regularization for Efficient and Flexible Deep Multi-Task Learning

no code implementations19 Nov 2019 Yingru Liu, Xuewen Yang, Dongliang Xie, Xin Wang, Li Shen, Hao-Zhi Huang, Niranjan Balasubramanian

In this paper, we propose a novel deep learning model called Task Adaptive Activation Network (TAAN) that can automatically learn the optimal network architecture for MTL.

Multi-Task Learning

Domain-Aware Dynamic Networks

no code implementations26 Nov 2019 Tianyuan Zhang, Bichen Wu, Xin Wang, Joseph Gonzalez, Kurt Keutzer

In this work, we propose a method to improve the model capacity without increasing inference-time complexity.

object-detection Object Detection

"How do urban incidents affect traffic speed?" A Deep Graph Convolutional Network for Incident-driven Traffic Speed Prediction

no code implementations3 Dec 2019 Qinge Xie, Tiancheng Guo, Yang Chen, Yu Xiao, Xin Wang, Ben Y. Zhao

Combining above methods, we propose a Deep Incident-Aware Graph Convolutional Network (DIGC-Net) to effectively incorporate urban traffic incident, spatio-temporal, periodic and context features for traffic speed prediction.

Theme-Matters: Fashion Compatibility Learning via Theme Attention

no code implementations12 Dec 2019 Jui-Hsin Lai, Bo Wu, Xin Wang, Dan Zeng, Tao Mei, Jingen Liu

This model associates themes with the pairwise compatibility with attention, and thus compute the outfit-wise compatibility.

Fashion Compatibility Learning

Fully Convolutional Graph Neural Networks using Bipartite Graph Convolutions

no code implementations ICLR 2020 Marcel Nassar, Xin Wang, Evren Tumer

Graph neural networks have been adopted in numerous applications ranging from learning relational representations to modeling data on irregular domains such as point clouds, social graphs, and molecular structures.

Deep Learning for Learning Graph Representations

no code implementations2 Jan 2020 Wenwu Zhu, Xin Wang, Peng Cui

Mining graph data has become a popular research topic in computer science and has been widely studied in both academia and industry given the increasing amount of network data in the recent years.

Network Embedding

Reject Illegal Inputs with Generative Classifier Derived from Any Discriminative Classifier

no code implementations2 Jan 2020 Xin Wang

Experiments on illegal inputs, including adversarial examples, samples with common corruptions, and out-of-distribution~(OOD) samples show that allowed to reject a portion of test samples, SDIM-\emph{logit} significantly improves the performance on the left test sets.

Automated Pavement Crack Segmentation Using U-Net-based Convolutional Neural Network

no code implementations7 Jan 2020 Stephen L. H. Lau, Edwin K. P. Chong, Xu Yang, Xin Wang

In this paper, we propose a deep learning technique based on a convolutional neural network to perform segmentation tasks on pavement crack images.

Crack Segmentation Feature Engineering +2

Domain Embedded Multi-model Generative Adversarial Networks for Image-based Face Inpainting

no code implementations5 Feb 2020 Xian Zhang, Xin Wang, Bin Kong, Youbing Yin, Qi Song, Siwei Lyu, Jiancheng Lv, Canghong Shi, Xiaojie Li

We firstly represent only face regions using the latent variable as the domain knowledge and combine it with the non-face parts textures to generate high-quality face images with plausible contents.

Facial Inpainting

Improving Sampling Accuracy of Stochastic Gradient MCMC Methods via Non-uniform Subsampling of Gradients

no code implementations20 Feb 2020 Ruilin Li, Xin Wang, Hongyuan Zha, Molei Tao

In our practical implementation of EWSG, the non-uniform subsampling is performed efficiently via a Metropolis-Hastings chain on the data index, which is coupled to the MCMC algorithm.

Computational Efficiency

Category-wise Attack: Transferable Adversarial Examples for Anchor Free Object Detection

no code implementations10 Feb 2020 Quanyu Liao, Xin Wang, Bin Kong, Siwei Lyu, Youbing Yin, Qi Song, Xi Wu

Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbations can completely change the classification results.

Object object-detection +1

Synergic Adversarial Label Learning for Grading Retinal Diseases via Knowledge Distillation and Multi-task Learning

no code implementations24 Mar 2020 Lie Ju, Xin Wang, Xin Zhao, Huimin Lu, Dwarikanath Mahapatra, Paul Bonnington, ZongYuan Ge

In addition, we conduct additional experiments to show the effectiveness of SALL from the aspects of reliability and interpretability in the context of medical imaging application.

Classification General Classification +3

An adaptive neuro-fuzzy model for attitude estimation and 2 control a 3 DOF system

no code implementations21 Apr 2020 Xin Wang, SeyedMehdi Abtahi, Mahmood Chahari, Tianyu Zhao

To evaluate the performance of the AN-FIS controller in closed-loop simulation, an ANFIS observer is used to estimate the attitude and angular velocities of the satellite using magnetometer, sun sensor and data gyro data.

How fine can fine-tuning be? Learning efficient language models

no code implementations24 Apr 2020 Evani Radiya-Dixit, Xin Wang

Given a language model pre-trained on massive unlabeled text corpora, only very light supervised fine-tuning is needed to learn a task: the number of fine-tuning steps is typically five orders of magnitude lower than the total parameter count.

Language Modelling

Self-Supervised Deep Visual Odometry with Online Adaptation

no code implementations CVPR 2020 Shunkai Li, Xin Wang, Yingdian Cao, Fei Xue, Zike Yan, Hongbin Zha

In this paper, we propose an online meta-learning algorithm to enable VO networks to continuously adapt to new environments in a self-supervised manner.

Meta-Learning Visual Odometry

Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis

no code implementations20 May 2020 Yusuke Yasuda, Xin Wang, Junichi Yamagishi

Our experiments suggest that a) a neural sequence-to-sequence TTS system should have a sufficient number of model parameters to produce high quality speech, b) it should also use a powerful encoder when it takes characters as inputs, and c) the encoder still has a room for improvement and needs to have an improved architecture to learn supra-segmental features more appropriately.

Speech Synthesis Text-To-Speech Synthesis

More Practical and Adaptive Algorithms for Online Quantum State Learning

no code implementations1 Jun 2020 Yifang Chen, Xin Wang

This regret bound depends only on the maximum rank $M$ of measurements rather than the number of qubits, which takes advantage of low-rank measurements.

A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions

no code implementations1 Jun 2020 Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang

Neural Architecture Search (NAS) is just such a revolutionary algorithm, and the related research work is complicated and rich.

Neural Architecture Search

Learning Continuous-Time Dynamics by Stochastic Differential Networks

no code implementations11 Jun 2020 Yingru Liu, Yucheng Xing, Xuewen Yang, Xin Wang, Jing Shi, Di Jin, Zhaoyue Chen

Learning continuous-time stochastic dynamics is a fundamental and essential problem in modeling sporadic time series, whose observations are irregular and sparse in both time and dimension.

Time Series Time Series Analysis

The curious case of developmental BERTology: On sparsity, transfer learning, generalization and the brain

no code implementations7 Jul 2020 Xin Wang

In this essay, we explore a point of intersection between deep learning and neuroscience, through the lens of large language models, transfer learning and network compression.

Transfer Learning

Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals

no code implementations12 Jul 2020 Tomi Kinnunen, Héctor Delgado, Nicholas Evans, Kong Aik Lee, Ville Vestman, Andreas Nautsch, Massimiliano Todisco, Xin Wang, Md Sahidullah, Junichi Yamagishi, Douglas A. Reynolds

Recent years have seen growing efforts to develop spoofing countermeasures (CMs) to protect automatic speaker verification (ASV) systems from being deceived by manipulated or artificial inputs.

Speaker Verification

A SLAM Map Restoration Algorithm Based on Submaps and an Undirected Connected Graph

no code implementations29 Jul 2020 Zongqian Zhan, Wenjie Jian, Yi-Hui Li, Xin Wang, Yang Yue

To solve the missing map problem, which is an issue in many applications , after the tracking is lost, based on monocular visual SLAM, we present a method of reconstructing a complete global map of UAV datasets by sequentially merging the submaps via the corresponding undirected connected graph.

Simultaneous Localization and Mapping

Learning Tuple Compatibility for Conditional OutfitRecommendation

no code implementations18 Aug 2020 Xuewen Yang, Dongliang Xie, Xin Wang, Jiangbo Yuan, Wanying Ding, Pengyun Yan

Our contributions include: 1) Designing a Mixed Category Attention Net (MCAN) which integrates both fine-grained and coarse category information into recommendation and learns the compatibility among fashion tuples.

Cultural Vocal Bursts Intensity Prediction Recommendation Systems

Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation

no code implementations27 Aug 2020 Xuewen Yang, Dongliang Xie, Xin Wang

In this work, we propose a general framework for unsupervised image-to-image translation across multiple domains, which can translate images from domain X to any a domain without requiring direct training between the two domains involved in image translation.

Translation Unsupervised Image-To-Image Translation

Elliptic Blowup Equations for 6d SCFTs. IV: Matters

1 code implementation4 Jun 2020 Jie Gu, Babak Haghighat, Albrecht Klemm, Kaiwen Sun, Xin Wang

Given the recent geometrical classification of 6d $(1, 0)$ SCFTs, a major question is how to compute for this large class their elliptic genera.

High Energy Physics - Theory Mathematical Physics Mathematical Physics

Intragroup sparsity for efficient inference

no code implementations1 Jan 2021 Zilin Yu, Chao Wang, Xin Wang, Yong Zhao, Xundong Wu

This work studies intragroup sparsity, a fine-grained structural constraint on network weight parameters.

Towards A Unified Understanding and Improving of Adversarial Transferability

no code implementations ICLR 2021 Xin Wang, Jie Ren, Shuyun Lin, Xiangming Zhu, Yisen Wang, Quanshi Zhang

We discover and prove the negative correlation between the adversarial transferability and the interaction inside adversarial perturbations.

Holistic Combination of Structural and Textual Code Information for Context based API Recommendation

no code implementations15 Oct 2020 Chi Chen, Xin Peng, Zhenchang Xing, Jun Sun, Xin Wang, Yifan Zhao, Wenyun Zhao

APIRec-CST is a deep learning model that combines the API usage with the text information in the source code based on an API Context Graph Network and a Code Token Network that simultaneously learn structural and textual features for API recommendation.

End-to-End Text-to-Speech using Latent Duration based on VQ-VAE

no code implementations19 Oct 2020 Yusuke Yasuda, Xin Wang, Junichi Yamagishi

Explicit duration modeling is a key to achieving robust and efficient alignment in text-to-speech synthesis (TTS).

Speech Synthesis Text-To-Speech Synthesis

SWIPENET: Object detection in noisy underwater images

no code implementations19 Oct 2020 Long Chen, Feixiang Zhou, Shengke Wang, Junyu Dong, Ning li, Haiping Ma, Xin Wang, Huiyu Zhou

Moreover, inspired by the human education process that drives the learning from easy to hard concepts, we here propose the CMA training paradigm that first trains a clean detector which is free from the influence of noisy data.

Object object-detection +1

A Survey on Curriculum Learning

no code implementations25 Oct 2020 Xin Wang, Yudong Chen, Wenwu Zhu

We discuss works on curriculum learning within a general CL framework, elaborating on how to design a manually predefined curriculum or an automatic curriculum.

Active Learning BIG-bench Machine Learning +3

Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis

no code implementations10 Nov 2020 Erica Cooper, Xin Wang, Yi Zhao, Yusuke Yasuda, Junichi Yamagishi

We explore pretraining strategies including choice of base corpus with the aim of choosing the best strategy for zero-shot multi-speaker end-to-end synthesis.

Speech Synthesis

A Predicate-Function-Argument Annotation of Natural Language for Open-Domain Information eXpression

no code implementations EMNLP 2020 Mingming Sun, Wenyue Hua, Zoey Liu, Xin Wang, Kangjie Zheng, Ping Li

Based on the same platform of OIX, the OIE strategies are reusable, and people can select a set of strategies to assemble their algorithm for a specific task so that the adaptability may be significantly increased.

Open Information Extraction Sentence

Shortcuts to Adiabaticity for the Quantum Rabi Model: Efficient Generation of Giant Entangled Cat States via Parametric Amplification

no code implementations10 Aug 2020 Ye-Hong Chen, Wei Qin, Xin Wang, Adam Miranowicz, Franco Nori

We propose a method for the fast generation of nonclassical ground states of the Rabi model in the ultrastrong and deep-strong coupling regimes via the shortcuts-to-adiabatic (STA) dynamics.

Quantum Physics

Leveraging Regular Fundus Images for Training UWF Fundus Diagnosis Models via Adversarial Learning and Pseudo-Labeling

no code implementations27 Nov 2020 Lie Ju, Xin Wang, Xin Zhao, Paul Bonnington, Tom Drummond, ZongYuan Ge

We propose the use of a modified cycle generative adversarial network (CycleGAN) model to bridge the gap between regular and UWF fundus and generate additional UWF fundus images for training.

Generative Adversarial Network Lesion Detection

Cooling-Aware Resource Allocation and Load Management for Mobile Edge Computing Systems

no code implementations19 Jun 2020 Xiaojing Chen, Zhouyu Lu, Wei Ni, Xin Wang, Feng Wang, Shunqing Zhang, Shugong Xu

Driven by explosive computation demands of Internet of Things (IoT), mobile edge computing (MEC) provides a promising technique to enhance the computation capability for mobile users.

Edge-computing Management +1

Partially Connected Automated Vehicle Cooperative Control Strategy with a Deep Reinforcement Learning Approach

no code implementations3 Dec 2020 Haotian Shi, Yang Zhou, Keshu Wu, Xin Wang, Yangxin Lin, Bin Ran

This paper proposes a cooperative strategy of connected and automated vehicles (CAVs) longitudinal control for partially connected and automated traffic environment based on deep reinforcement learning (DRL) algorithm, which enhances the string stability of mixed traffic, car following efficiency, and energy efficiency.

reinforcement-learning Reinforcement Learning (RL)

Spatial Heterogeneity Automatic Detection and Estimation

no code implementations5 Jun 2019 Xin Wang, Zhengyuan Zhu, Hao Helen Zhang

Spatial regression is widely used for modeling the relationship between a dependent variable and explanatory covariates.

Methodology

Joint Optimization of Trajectory, Propulsion and Thrust Powers for Covert UAV-on-UAV Video Tracking and Surveillance

no code implementations22 Dec 2020 Shuyan Hu, Wei Ni, Xin Wang, Abbas Jamalipour, Dean Ta

Autonomous tracking of suspicious unmanned aerial vehicles (UAVs) by legitimate monitoring UAVs (or monitors) can be crucial to public safety and security.

Generalizable control for multiparameter quantum metrology

no code implementations24 Dec 2020 Han Xu, Lingna Wang, Haidong Yuan, Xin Wang

Here we study the generalizability of optimal control, namely, optimal controls that can be systematically updated across a range of parameters with minimal cost.

Quantum Physics

The dynamic energy balance in earthquakes expressed by fault surface morphology

no code implementations18 Jan 2021 Xin Wang, Juan Liu, Feng Gao, Zhizhen Zhang

The fault surface morphology is the direct result of the microscopic processes near the crack tip or on the frictional interface.

Geophysics

A Marching Cube Algorithm Based on Edge Growth

no code implementations3 Jan 2021 Xin Wang, Su Gao, Monan Wang, Zhenghua Duan

When only the main contour of the 3D model needs to be organized, the algorithm performs well.

3D Reconstruction Graphics

A Closer Look at Temporal Sentence Grounding in Videos: Dataset and Metric

no code implementations22 Jan 2021 Yitian Yuan, Xiaohan Lan, Xin Wang, Long Chen, Zhi Wang, Wenwu Zhu

All the results demonstrate that the re-organized dataset splits and new metric can better monitor the progress in TSGV.

Benchmarking Sentence +1

Explicit Perturbations to the Stabilizer $τ= {\rm i}$ of Modular $A^\prime_5$ Symmetry and Leptonic CP Violation

no code implementations8 Feb 2021 Xin Wang, Shun Zhou

In a class of neutrino mass models with modular flavor symmetries, it has been observed that CP symmetry is preserved at the fixed point (or stabilizer) of the modulus parameter $\tau = {\rm i}$, whereas significant CP violation emerges within the neighbourhood of this stabilizer.

High Energy Physics - Phenomenology

Improving Medical Image Classification with Label Noise Using Dual-uncertainty Estimation

no code implementations28 Feb 2021 Lie Ju, Xin Wang, Lin Wang, Dwarikanath Mahapatra, Xin Zhao, Mehrtash Harandi, Tom Drummond, Tongliang Liu, ZongYuan Ge

In this paper, we systematically discuss and define the two common types of label noise in medical images - disagreement label noise from inconsistency expert opinions and single-target label noise from wrong diagnosis record.

Benchmarking General Classification +3

A Hybrid Quantum-Classical Hamiltonian Learning Algorithm

no code implementations1 Mar 2021 Youle Wang, Guangxi Li, Xin Wang

Hamiltonian learning is crucial to the certification of quantum devices and quantum simulators.

A Comparative Study on Recent Neural Spoofing Countermeasures for Synthetic Speech Detection

no code implementations21 Mar 2021 Xin Wang, Junich Yamagishi

A great deal of recent research effort on speech spoofing countermeasures has been invested into back-end neural networks and training criteria.

Synthetic Speech Detection

Online Learning of a Probabilistic and Adaptive Scene Representation

no code implementations CVPR 2021 Zike Yan, Xin Wang, Hongbin Zha

Constructing and maintaining a consistent scene model on-the-fly is the core task for online spatial perception, interpretation, and action.

An Initial Investigation for Detecting Partially Spoofed Audio

no code implementations6 Apr 2021 Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi, Jose Patino, Nicholas Evans

By definition, partially-spoofed utterances contain a mix of both spoofed and bona fide segments, which will likely degrade the performance of countermeasures trained with entirely spoofed utterances.

Voice Anti-spoofing

Joint User Identification, Channel Estimation, and Signal Detection for Grant-Free NOMA

no code implementations12 Jan 2020 Shuchao Jiang, Xiaojun Yuan, Xin Wang, Chongbin Xu, Wei Yu

To address the problem that the exact calculation of the messages exchanged within CSCE and between the two modules is complicated due to phase ambiguity issues, this paper proposes a rotationally invariant Gaussian mixture (RIGM) model, and develops an efficient JUICESD-RIGM algorithm.

Relational Subsets Knowledge Distillation for Long-tailed Retinal Diseases Recognition

no code implementations22 Apr 2021 Lie Ju, Xin Wang, Lin Wang, Tongliang Liu, Xin Zhao, Tom Drummond, Dwarikanath Mahapatra, ZongYuan Ge

For example, there are estimated more than 40 different kinds of retinal diseases with variable morbidity, however with more than 30+ conditions are very rare from the global patient cohorts, which results in a typical long-tailed learning problem for deep learning-based screening models.

Knowledge Distillation

Adversarial Attack Framework on Graph Embedding Models with Limited Knowledge

no code implementations26 May 2021 Heng Chang, Yu Rong, Tingyang Xu, Wenbing Huang, Honglei Zhang, Peng Cui, Xin Wang, Wenwu Zhu, Junzhou Huang

We investigate the theoretical connections between graph signal processing and graph embedding models and formulate the graph embedding model as a general graph signal process with a corresponding graph filter.

Adversarial Attack Graph Embedding +1

Imperceptible Adversarial Examples for Fake Image Detection

no code implementations3 Jun 2021 Quanyu Liao, Yuezun Li, Xin Wang, Bin Kong, Bin Zhu, Siwei Lyu, Youbing Yin, Qi Song, Xi Wu

Fooling people with highly realistic fake images generated with Deepfake or GANs brings a great social disturbance to our society.

Face Swapping Fake Image Detection

Transferable Adversarial Examples for Anchor Free Object Detection

no code implementations3 Jun 2021 Quanyu Liao, Xin Wang, Bin Kong, Siwei Lyu, Bin Zhu, Youbing Yin, Qi Song, Xi Wu

Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbation can completely change prediction result.

Adversarial Attack Object +2

Visual Question Rewriting for Increasing Response Rate

no code implementations4 Jun 2021 Jiayi Wei, Xilian Li, Yi Zhang, Xin Wang

Offline experiments and mechanical Turk based evaluations show that it is possible to rewrite bland questions in a more detailed and attractive way to increase the response rate, and images can be helpful.

4k Question Rewriting

Mobile Optical Communications Using Second Harmonic of Intra-Cavity Laser

no code implementations21 Jun 2021 Mingliang Xiong, Qingwen Liu, Xin Wang, Shengli Zhou, Bin Zhou, Zhiyong Bu

In this paper, we propose an intra-cavity SHG RBCom system to simplify the system design and improve the energy conversion efficiency.

Over-the-Air Federated Multi-Task Learning

no code implementations27 Jun 2021 Haoming Ma, Xiaojun Yuan, Dian Fan, Zhi Ding, Xin Wang, Jun Fang

In this letter, we introduce over-the-air computation into the communication design of federated multi-task learning (FMTL), and propose an over-the-air federated multi-task learning (OA-FMTL) framework, where multiple learning tasks deployed on edge devices share a non-orthogonal fading channel under the coordination of an edge server (ES).

Federated Learning Multi-Task Learning

ReFormer: The Relational Transformer for Image Captioning

no code implementations29 Jul 2021 Xuewen Yang, Yingru Liu, Xin Wang

To improve the quality of image captioning, we propose a novel architecture ReFormer -- a RElational transFORMER to generate features with relation information embedded and to explicitly express the pair-wise relationships between objects in the image.

Graph Generation Image Captioning +3

Unsupervised Domain Adaptation for Retinal Vessel Segmentation with Adversarial Learning and Transfer Normalization

no code implementations4 Aug 2021 Wei Feng, Lie Ju, Lin Wang, Kaimin Song, Xin Wang, Xin Zhao, Qingyi Tao, ZongYuan Ge

In this work, we explore unsupervised domain adaptation in retinal vessel segmentation by using entropy-based adversarial learning and transfer normalization layer to train a segmentation network, which generalizes well across domains and requires no annotation of the target domain.

Retinal Vessel Segmentation Segmentation +1

A Hypothesis for the Aesthetic Appreciation in Neural Networks

no code implementations31 Jul 2021 Xu Cheng, Xin Wang, Haotian Xue, Zhengyang Liang, Quanshi Zhang

This paper proposes a hypothesis for the aesthetic appreciation that aesthetic images make a neural network strengthen salient concepts and discard inessential concepts.

SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation

no code implementations10 Aug 2021 Xin Wang, Yasheng Wang, Fei Mi, Pingyi Zhou, Yao Wan, Xiao Liu, Li Li, Hao Wu, Jin Liu, Xin Jiang

Code representation learning, which aims to encode the semantics of source code into distributed vectors, plays an important role in recent deep-learning-based models for code intelligence.

Clone Detection Code Search +5

Interpreting Attributions and Interactions of Adversarial Attacks

no code implementations ICCV 2021 Xin Wang, Shuyun Lin, Hao Zhang, Yufei Zhu, Quanshi Zhang

This paper aims to explain adversarial attacks in terms of how adversarial perturbations contribute to the attacking task.

Eyes Tell All: Irregular Pupil Shapes Reveal GAN-generated Faces

no code implementations1 Sep 2021 Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu

Generative adversary network (GAN) generated high-realistic human faces have been used as profile images for fake social media accounts and are visually challenging to discern from real ones.

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

no code implementations1 Sep 2021 Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, Héctor Delgado

In addition to a continued focus upon logical and physical access tasks in which there are a number of advances compared to previous editions, ASVspoof 2021 introduces a new task involving deepfake speech detection.

Face Swapping Speaker Verification

Tolerating Adversarial Attacks and Byzantine Faults in Distributed Machine Learning

no code implementations5 Sep 2021 Yusen Wu, Hao Chen, Xin Wang, Chao Liu, Phuong Nguyen, Yelena Yesha

In addition, Byzantine faults including software, hardware, network issues occur in distributed systems which also lead to a negative impact on the prediction outcome.

BIG-bench Machine Learning

Robust Attentive Deep Neural Network for Exposing GAN-generated Faces

no code implementations5 Sep 2021 Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu

However, images from existing public datasets do not represent real-world scenarios well enough in terms of view variations and data distributions (where real faces largely outnumber synthetic faces).

Face Detection

Sentence Matching with Syntax- and Semantics-Aware BERT

no code implementations COLING 2020 Tao Liu, Xin Wang, Chengguo Lv, Ranran Zhen, Guohong Fu

Sentence matching aims to identify the special relationship between two sentences, and plays a key role in many natural language processing tasks.

Sentence

A Survey on Temporal Sentence Grounding in Videos

no code implementations16 Sep 2021 Xiaohan Lan, Yitian Yuan, Xin Wang, Zhi Wang, Wenwu Zhu

In this survey, we give a comprehensive overview for TSGV, which i) summarizes the taxonomy of existing methods, ii) provides a detailed description of the evaluation protocols(i. e., datasets and metrics) to be used in TSGV, and iii) in-depth discusses potential problems of current benchmarking designs and research directions for further investigations.

Benchmarking Sentence +2

Provable hierarchical lifelong learning with a sketch-based modular architecture

no code implementations29 Sep 2021 Rina Panigrahy, Brendan Juba, Zihao Deng, Xin Wang, Zee Fryer

We propose a modular architecture for lifelong learning of hierarchically structured tasks.

A HYPOTHESIS FOR THE COGNITIVE DIFFICULTY OF IMAGES

no code implementations29 Sep 2021 Xu Cheng, Xin Wang, Haotian Xue, Zhengyang Liang, Xin Jin, Quanshi Zhang

This paper proposes a hypothesis to analyze the underlying reason for the cognitive difficulty of an image from two perspectives, i. e. a cognitive image usually makes a DNN strongly activated by cognitive concepts; discarding massive non-cognitive concepts may also help the DNN focus on cognitive concepts.

Evaluation of Various Open-Set Medical Imaging Tasks with Deep Neural Networks

no code implementations21 Oct 2021 ZongYuan Ge, Xin Wang

The current generation of deep neural networks has achieved close-to-human results on "closed-set" image recognition; that is, the classes being evaluated overlap with the training classes.

Decision Making Open Set Learning

Data-driven Control of Dynamic Event-triggered Systems with Delays

no code implementations25 Oct 2021 Xin Wang, Jian Sun, Julian Berberich, Gang Wang, Frank Allgöwer, Jie Chen

Data-based representations for time-invariant linear systems with known or unknown system input matrices are first developed, along with a novel class of dynamic triggering schemes for sampled-data systems with time delays.

CAP: Co-Adversarial Perturbation on Weights and Features for Improving Generalization of Graph Neural Networks

no code implementations28 Oct 2021 Haotian Xue, Kaixiong Zhou, Tianlong Chen, Kai Guo, Xia Hu, Yi Chang, Xin Wang

In this paper, we investigate GNNs from the lens of weight and feature loss landscapes, i. e., the loss changes with respect to model weights and node features, respectively.

Hierarchical Knowledge Guided Learning for Real-world Retinal Diseases Recognition

no code implementations17 Nov 2021 Lie Ju, Zhen Yu, Lin Wang, Xin Zhao, Xin Wang, Paul Bonnington, ZongYuan Ge

From a modeling perspective, most deep learning models trained on these datasets may lack the ability to generalize to rare diseases where only a few available samples are presented for training.

Knowledge Distillation

Graph Differentiable Architecture Search with Structure Learning

no code implementations NeurIPS 2021 Yijian Qin, Xin Wang, Zeyang Zhang, Wenwu Zhu

Extensive experiments on real-world graph datasets demonstrate that our proposed GASSO model is able to achieve state-of-the-art performance compared with existing baselines.

Denoising Graph structure learning +1

Disentangled Contrastive Learning on Graphs

no code implementations NeurIPS 2021 Haoyang Li, Xin Wang, Ziwei Zhang, Zehuan Yuan, Hang Li, Wenwu Zhu

Then we propose a novel factor-wise discrimination objective in a contrastive learning manner, which can force the factorized representations to independently reflect the expressive information from different latent factors.

Contrastive Learning Self-Supervised Learning

HybridGazeNet: Geometric model guided Convolutional Neural Networks for gaze estimation

no code implementations23 Nov 2021 Shaobo Guo, Xiao Jiang, Zhizhong Su, Rui Wu, Xin Wang

As a critical cue for understanding human intention, human gaze provides a key signal for Human-Computer Interaction(HCI) applications.

Gaze Estimation

Hierarchically-Structured Variational Autoencoders for Long Text Generation

no code implementations27 Sep 2018 Dinghan Shen, Asli Celikyilmaz, Yizhe Zhang, Liqun Chen, Xin Wang, Lawrence Carin

Variational autoencoders (VAEs) have received much attention recently as an end-to-end architecture for text generation.

Decoder Sentence +1

Generalized Natural Language Grounded Navigation via Environment-agnostic Multitask Learning

no code implementations25 Sep 2019 Xin Wang, Vihan Jain, Eugene Ie, William Wang, Zornitsa Kozareva, Sujith Ravi

Recent research efforts enable study for natural language grounded navigation in photo-realistic environments, e. g., following natural language instructions or dialog.

Vision-Language Navigation

Exploring the Correlation between Likelihood of Flow-based Generative Models and Image Semantics

no code implementations25 Sep 2019 Xin Wang, SiuMing Yiu

In this paper, we explore the correlation between flows' likelihood and image semantics.

Reject Illegal Inputs: Scaling Generative Classifiers with Supervised Deep Infomax

no code implementations25 Sep 2019 Xin Wang, SiuMing Yiu

The supervised probabilistic constraints are equivalent to a generative classifier on high-level data representations, where class conditional log-likelihoods of samples can be evaluated.

Representation Learning

OOD-GNN: Out-of-Distribution Generalized Graph Neural Network

no code implementations7 Dec 2021 Haoyang Li, Xin Wang, Ziwei Zhang, Wenwu Zhu

Our proposed OOD-GNN employs a novel nonlinear graph representation decorrelation method utilizing random Fourier features, which encourages the model to eliminate the statistical dependence between relevant and irrelevant graph representations through iteratively optimizing the sample graph weights and graph encoder.

Out-of-Distribution Generalization

You Can Wash Better: Daily Handwashing Assessment with Smartwatches

no code implementations9 Dec 2021 Fei Wang, Xilei Wu, Xin Wang, Jianlei Chi, Jingang Shi, Dong Huang

We propose UWash, an intelligent solution upon smartwatches, to assess handwashing for the purpose of raising users' awareness and cultivating habits in high-quality handwashing.

Gesture Recognition Semantic Segmentation

Decentralized Stochastic Proximal Gradient Descent with Variance Reduction over Time-varying Networks

no code implementations20 Dec 2021 Xuanjie Li, Yuedong Xu, Jessie Hui Wang, Xin Wang, John C. S. Lui

By transforming our decentralized algorithm into a centralized inexact proximal gradient algorithm with variance reduction, and controlling the bounds of error sequences, we prove that DPSVRG converges at the rate of $O(1/T)$ for general convex objectives plus a non-smooth term with $T$ as the number of iterations, while DSPG converges at the rate $O(\frac{1}{\sqrt{T}})$.

A Scalable Deep Reinforcement Learning Model for Online Scheduling Coflows of Multi-Stage Jobs for High Performance Computing

no code implementations21 Dec 2021 Xin Wang, Hong Shen

Coflow is a recently proposed networking abstraction to help improve the communication performance of data-parallel computing jobs.

Reinforcement Learning (RL) Scheduling

Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?

no code implementations23 Dec 2021 Ziwei Zhang, Xin Wang, Zeyang Zhang, Peng Cui, Wenwu Zhu

Based on the experimental results, we advocate that TinvNN should be considered a new starting point and an essential baseline for further studies of transformation-invariant geometric deep learning.

Combinatorial Optimization Inductive Bias

Self-directed Machine Learning

no code implementations4 Jan 2022 Wenwu Zhu, Xin Wang, Pengtao Xie

Inspired by the concept of self-directed human learning, we introduce the principal concept of Self-directed Machine Learning (SDML) and propose a framework for SDML.

BIG-bench Machine Learning Model Selection

Adaptive Worker Grouping For Communication-Efficient and Straggler-Tolerant Distributed SGD

no code implementations12 Jan 2022 Feng Zhu, Jingjing Zhang, Osvaldo Simeone, Xin Wang

Wall-clock convergence time and communication load are key performance metrics for the distributed implementation of stochastic gradient descent (SGD) in parameter server settings.

Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation

no code implementations COLING 2022 Yihe Wang, Yitong Li, Yasheng Wang, Fei Mi, Pingyi Zhou, Xin Wang, Jin Liu, Xin Jiang, Qun Liu

Experiments over publicly available datasets demonstrate that our method can help models generate better responses, even such training data are usually impressed as low-quality data.

Dialogue Generation Retrieval

GAN-generated Faces Detection: A Survey and New Perspectives

no code implementations15 Feb 2022 Xin Wang, Hui Guo, Shu Hu, Ming-Ching Chang, Siwei Lyu

Generative Adversarial Networks (GAN) have led to the generation of very realistic face images, which have been used in fake social media accounts and other disinformation matters that can generate profound impacts.

Face Detection

Model-Based and Data-Driven Control of Event- and Self-Triggered Discrete-Time LTI Systems

no code implementations16 Feb 2022 Xin Wang, Julian Berberich, Jian Sun, Gang Wang, Frank Allgöwer, Jie Chen

To this end, we begin by presenting a dynamic event-triggering scheme (ETS) based on periodic sampling, and a discrete-time looped-functional approach, through which a model-based stability condition is derived.

STS

Curriculum Meta-Learning for Next POI Recommendation

no code implementations KDD 2021 Yudong Chen, Xin Wang, Miao Fan, Jizhou Huang, Shengwen Yang, and Wenwu Zhu.

Next point-of-interest (POI) recommendation is a hot research field where a recent emerging scenario, next POI to search recommendation, has been deployed in many online map services such as Baidu Maps.

Meta-Learning

Synergistic Network Learning and Label Correction for Noise-robust Image Classification

no code implementations27 Feb 2022 Chen Gong, Kong Bin, Eric J. Seibel, Xin Wang, Youbing Yin, Qi Song

Taking the expertise of DNNs to learn meaningful patterns before fitting noise, our framework first trains two networks over the current dataset with small loss selection.

Image Classification

Optimal quantum dataset for learning a unitary transformation

no code implementations1 Mar 2022 Zhan Yu, Xuanqiang Zhao, Benchi Zhao, Xin Wang

In this work, we solve the problem on the minimum size of sufficient quantum datasets for learning a unitary transformation exactly, which reveals the power and limitation of quantum data.

Quantum Machine Learning

Compilable Neural Code Generation with Compiler Feedback

no code implementations Findings (ACL) 2022 Xin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, Jin Liu, Hao Wu, Xin Jiang, Qun Liu

Automatically generating compilable programs with (or without) natural language descriptions has always been a touchstone problem for computational linguistics and automated software engineering.

Code Completion Code Generation +4

Energy-Efficient UAV-Mounted RIS Assisted Mobile Edge Computing

no code implementations24 Mar 2022 Zhiyuan Zhai, Xinhong Dai, Bin Duo, Xin Wang, Xiaojun Yuan

Unmanned aerial vehicle (UAV) and reconfigurable intelligent surface (RIS) have been recently applied in the field of mobile edge computing (MEC) to improve the data exchange environment by proactively changing the wireless channels through maneuverable location deployment and intelligent signals reflection, respectively.

Edge-computing

Flexible Sampling for Long-tailed Skin Lesion Classification

no code implementations7 Apr 2022 Lie Ju, Yicheng Wu, Lin Wang, Zhen Yu, Xin Zhao, Xin Wang, Paul Bonnington, ZongYuan Ge

To address this, in this paper, we propose a curriculum learning-based framework called Flexible Sampling for the long-tailed skin lesion classification task.

Classification Lesion Classification +1

The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance

no code implementations11 Apr 2022 Lin Zhang, Xin Wang, Erica Cooper, Nicholas Evans, Junichi Yamagishi

Since the short spoofed speech segments to be embedded by attackers are of variable length, six different temporal resolutions are considered, ranging from as short as 20 ms to as large as 640 ms. Third, we propose a new CM that enables the simultaneous use of the segment-level labels at different temporal resolutions as well as utterance-level labels to execute utterance- and segment-level detection at the same time.

Speaker Verification Speech Synthesis +2

Receiver Design for MIMO Unsourced Random Access with SKP Coding

no code implementations30 Apr 2022 Zeyu Han, Xiaojun Yuan, Chongbin Xu, Xin Wang

In this letter, we extend the sparse Kronecker-product (SKP) coding scheme, originally designed for the additive white Gaussian noise (AWGN) channel, to multiple input multiple output (MIMO) unsourced random access (URA).

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training

no code implementations Findings (NAACL) 2022 Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu

Specifically, we first extract multiple code views using compiler tools, and learn the complementary information among them under a contrastive learning framework.

Contrastive Learning Defect Detection +2

Fundamental limitations on optimization in variational quantum algorithms

no code implementations10 May 2022 Hao-Kai Zhang, Chengkai Zhu, Geng Liu, Xin Wang

Exploring quantum applications of near-term quantum devices is a rapidly growing field of quantum information science with both theoretical and practical interests.

An Edge-Cloud Integrated Framework for Flexible and Dynamic Stream Analytics

no code implementations10 May 2022 Xin Wang, Azim Khan, Jianwu Wang, Aryya Gangopadhyay, Carl E. Busart, Jade Freeman

In this paper, we study how to best leverage edge and cloud resources to achieve better accuracy and latency for stream analytics using a type of RNN model called long short-term memory (LSTM).

Cloud Computing Edge-computing +3

OIE@OIA: an Adaptable and Efficient Open Information Extraction Framework

no code implementations ACL 2022 Xin Wang, Minlong Peng, Mingming Sun, Ping Li

OIE@OIA follows the methodology of Open Information eXpression (OIX): parsing a sentence to an Open Information Annotation (OIA) Graph and then adapting the OIA graph to different OIE tasks with simple rules.

Open Information Extraction Sentence

Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces

no code implementations13 May 2022 Hui Guo, Shu Hu, Xin Wang, Ming-Ching Chang, Siwei Lyu

In this work, we develop an online platform called Open-eye to study the human performance of AI-synthesized face detection.

Face Detection

Power and limitations of single-qubit native quantum neural networks

no code implementations16 May 2022 Zhan Yu, Hongshun Yao, Mujin Li, Xin Wang

Quantum neural networks (QNNs) have emerged as a leading strategy to establish applications in machine learning, chemistry, and optimization.

Spatial Attention-based Implicit Neural Representation for Arbitrary Reduction of MRI Slice Spacing

no code implementations23 May 2022 Xin Wang, Sheng Wang, Honglin Xiong, Kai Xuan, Zixu Zhuang, Mengjun Liu, Zhenrong Shen, Xiangyu Zhao, Lichi Zhang, Qian Wang

Magnetic resonance (MR) images collected in 2D clinical protocols typically have large inter-slice spacing, resulting in high in-plane resolution and reduced through-plane resolution.

Computational Efficiency Super-Resolution

Mitigating barren plateaus of variational quantum eigensolvers

no code implementations26 May 2022 Xia Liu, Geng Liu, Jiaxin Huang, Hao-Kai Zhang, Xin Wang

Variational quantum algorithms (VQAs) are expected to establish valuable applications on near-term quantum computers.

Deep Learning-based Massive MIMO CSI Acquisition for 5G Evolution and 6G

no code implementations10 Jun 2022 Xin Wang, Xiaolin Hou, Lan Chen, Yoshihisa Kishiyama, Takahiro Asai

Considering its large impact on air-interface design, it will be a candidate technology for 6th generation (6G) networks, in which an air interface designed by artificial intelligence can be used.

Concentration of Data Encoding in Parameterized Quantum Circuits

no code implementations16 Jun 2022 Guangxi Li, Ruilin Ye, Xuanqiang Zhao, Xin Wang

This result in particular implies that the average encoded state will concentrate on the maximally mixed state at an exponential speed on depth.

Combinatorial Optimization

Dependency Position Encoding for Relation Extraction

no code implementations Findings (NAACL) 2022 Qiushi Guo, Xin Wang, Dehong Gao

Leveraging the dependency tree of the input sentence is able to improve the model performance for relation extraction.

Position Relation +2

Enhanced brain structure-function tethering in transmodal cortex revealed by high-frequency eigenmodes

no code implementations7 Jul 2022 Yaqian Yang, Zhiming Zheng, Longzhao Liu, Hongwei Zheng, Yi Zhen, Yi Zheng, Xin Wang, Shaoting Tang

Specifically, low-frequency eigenmodes, which are considered sufficient to capture the essence of the functional network, contribute little to functional connectivity reconstruction in transmodal regions, resulting in structure-function decoupling along the unimodal-transmodal gradient.

Rank-based Decomposable Losses in Machine Learning: A Survey

no code implementations18 Jul 2022 Shu Hu, Xin Wang, Siwei Lyu

Following these categories, we review the literature on rank-based aggregate losses and rank-based individual losses.

BIG-bench Machine Learning

Scene Recognition with Objectness, Attribute and Category Learning

no code implementations20 Jul 2022 Ji Zhang, Jean-Paul Ainam, Li-hui Zhao, Wenai Song, Xin Wang

Based on the complementarity of attribute and category labels, we propose a Multi-task Attribute-Scene Recognition (MASR) network which learns a category embedding and at the same time predicts scene attributes.

Attribute Scene Classification +1

Proving Common Mechanisms Shared by Twelve Methods of Boosting Adversarial Transferability

no code implementations24 Jul 2022 Quanshi Zhang, Xin Wang, Jie Ren, Xu Cheng, Shuyun Lin, Yisen Wang, Xiangming Zhu

This paper summarizes the common mechanism shared by twelve previous transferability-boosting methods in a unified view, i. e., these methods all reduce game-theoretic interactions between regional adversarial perturbations.

Trajectory Planning of Cellular-Connected UAV for Communication-assisted Radar Sensing

no code implementations27 Jul 2022 Shuyan Hu, Xin Yuan, Wei Ni, Xin Wang

Being a key technology for beyond fifth-generation wireless systems, joint communication and radar sensing (JCAS) utilizes the reflections of communication signals to detect foreign objects and deliver situational awareness.

Trajectory Planning

Event-triggered Consensus Control of Heterogeneous Multi-agent Systems: Model- and Data-based Analysis

no code implementations1 Aug 2022 Xin Wang, Jian Sun, Gang Wang, Jie Chen

This article deals with model- and data-based consensus control of heterogenous leader-following multi-agent systems (MASs) under an event-triggering transmission scheme.

A Theoretical View on Sparsely Activated Networks

no code implementations8 Aug 2022 Cenk Baykal, Nishanth Dikkala, Rina Panigrahy, Cyrus Rashtchian, Xin Wang

After representing LSH-based sparse networks with our model, we prove that sparse networks can match the approximation power of dense networks on Lipschitz functions.

Cannot find the paper you are looking for? You can Submit a new open access paper.