Search Results for author: Wei zhang

Found 524 papers, 160 papers with code

Modeling 4D fMRI Data via Spatio-Temporal Convolutional Neural Networks (ST-CNN)

no code implementations31 May 2018 Yu Zhao, Xiang Li, Wei zhang, Shijie Zhao, Milad Makkie, Mo Zhang, Quanzheng Li, Tianming Liu

Simultaneous modeling of the spatio-temporal variation patterns of brain functional network from 4D fMRI data has been an important yet challenging problem for the field of cognitive neuroscience and medical image analysis.

Brain Decoding

SEE: Syntax-aware Entity Embedding for Neural Relation Extraction

no code implementations11 Jan 2018 Zhengqiu He, Wenliang Chen, Zhenghua Li, Meishan Zhang, Wei zhang, Min Zhang

First, we encode the context of entities on a dependency tree as sentence-level entity embedding based on tree-GRU.

Relation Relation Classification +3

AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training

no code implementations7 Dec 2017 Chia-Yu Chen, Jungwook Choi, Daniel Brand, Ankur Agrawal, Wei zhang, Kailash Gopalakrishnan

Highly distributed training of Deep Neural Networks (DNNs) on future compute platforms (offering 100 of TeraOps/s of computational capacity) is expected to be severely communication constrained.

Quantization

DeepSkeleton: Skeleton Map for 3D Human Pose Regression

no code implementations29 Nov 2017 Qingfu Wan, Wei zhang, xiangyang xue

For the first time, we show that training regression network from skeleton map alone is capable of meeting the performance of state-of-theart 3D human pose estimation works.

2D Human Pose Estimation 3D Human Pose Estimation +1

GaDei: On Scale-up Training As A Service For Deep Learning

no code implementations18 Nov 2016 Wei Zhang, Minwei Feng, Yunhui Zheng, Yufei Ren, Yandong Wang, Ji Liu, Peng Liu, Bing Xiang, Li Zhang, Bo-Wen Zhou, Fei Wang

By evaluating the NLC workloads, we show that only the conservative hyper-parameter setup (e. g., small mini-batch size and small learning rate) can guarantee acceptable model accuracy for a wide range of customers.

Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition

no code implementations8 Jul 2017 Tianyi Zhao, Baopeng Zhang, Wei zhang, Ning Zhou, Jun Yu, Jianping Fan

Our LMM model can provide an end-to-end approach for jointly learning: (a) the deep networks to extract more discriminative deep features for image and object class representation; (b) the tree classifier for recognizing large numbers of object classes hierarchically; and (c) the visual hierarchy adaptation for achieving more accurate indexing of large numbers of object classes hierarchically.

Object Object Recognition

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

no code implementations24 Jun 2017 Tianyi Zhao, Jun Yu, Zhenzhong Kuang, Wei zhang, Jianping Fan

In this paper, a deep mixture of diverse experts algorithm is developed for seamlessly combining a set of base deep CNNs (convolutional neural networks) with diverse outputs (task spaces), e. g., such base deep CNNs are trained to recognize different subsets of tens of thousands of atomic object classes.

Multi-Task Learning Object +1

Application of Multi-channel 3D-cube Successive Convolution Network for Convective Storm Nowcasting

no code implementations15 Feb 2017 Wei Zhang, Lei Han, Juanzhen Sun, Hanyang Guo, Jie Dai

This paper describes the first attempt to nowcast storm initiation, growth, and advection simultaneously under a deep learning framework using multi-source meteorological data.

Feature Engineering

Low-rank Label Propagation for Semi-supervised Learning with 100 Millions Samples

no code implementations28 Feb 2017 Raphael Petegrosso, Wei zhang, Zhuliu Li, Yousef Saad, Rui Kuang

The success of semi-supervised learning crucially relies on the scalability to a huge amount of unlabelled data that are needed to capture the underlying manifold structure for better classification.

Learning Compact Appearance Representation for Video-based Person Re-Identification

no code implementations21 Feb 2017 Wei Zhang, Shengnan Hu, Kan Liu, Zheng-Jun Zha

This paper presents a novel approach for video-based person re-identification using multiple Convolutional Neural Networks (CNNs).

Video-Based Person Re-Identification

PIGMIL: Positive Instance Detection via Graph Updating for Multiple Instance Learning

no code implementations12 Dec 2016 Dongkuan Xu, Jia Wu, Wei zhang, Yingjie Tian

To the end, we propose a positive instance detection via graph updating for multiple instance learning, called PIGMIL, to detect TPI accurately.

Multiple Instance Learning

Integrating Topic Models and Latent Factors for Recommendation

no code implementations28 Oct 2016 Danis J. Wilson, Wei zhang

In this work, we consider the problem of hotel recommendation for travel planning services by integrating the location information and the user's preference for recommendation.

Collaborative Filtering Recommendation Systems +1

End-to-End Answer Chunk Extraction and Ranking for Reading Comprehension

no code implementations31 Oct 2016 Yang Yu, Wei zhang, Kazi Hasan, Mo Yu, Bing Xiang, Bo-Wen Zhou

This paper proposes dynamic chunk reader (DCR), an end-to-end neural reading comprehension (RC) model that is able to extract and rank a set of answer candidates from a given document to answer questions.

Question Answering Reading Comprehension

Deep Kinematic Pose Regression

no code implementations17 Sep 2016 Xingyi Zhou, Xiao Sun, Wei zhang, Shuang Liang, Yichen Wei

In this work, we propose to directly embed a kinematic object model into the deep neutral network learning for general articulated object pose estimation.

3D Human Pose Estimation Object +2

Empirical Study on Deep Learning Models for Question Answering

no code implementations26 Oct 2015 Yang Yu, Wei zhang, Chung-Wei Hang, Bing Xiang, Bo-Wen Zhou

In this paper we explore deep learning models with memory component or attention mechanism for question answering task.

Machine Translation Question Answering +1

Structured Memory for Neural Turing Machines

no code implementations14 Oct 2015 Wei Zhang, Yang Yu, Bo-Wen Zhou

Neural Turing Machines (NTM) contain memory component that simulates "working memory" in the brain to store and retrieve information to ease simple algorithms learning.

Network-based Isoform Quantification with RNA-Seq Data for Cancer Transcriptome Analysis

no code implementations20 Mar 2014 Wei Zhang, Jae-Woong Chang, Lilong Lin, Kay Minn, Baolin Wu, Jeremy Chien, Jeongsik Yong, Hui Zheng, Rui Kuang

Based on our observation that the abundances of the neighboring isoforms by domain-domain interactions in the network are positively correlated, Net-RSTQ models the expression of the neighboring transcripts as Dirichlet priors on the likelihood of the observed read alignments against the transcripts in one gene.

Recognizing Extended Spatiotemporal Expressions by Actively Trained Average Perceptron Ensembles

no code implementations19 Aug 2015 Wei Zhang, Yang Yu, Osho Gupta, Judith Gelernter

We collected and annotated data set by querying commercial web searches API with such spatiotemporal expressions as were missed by state-of-the- art parsers.

Active Learning Ensemble Learning +1

Exploring Metaphorical Senses and Word Representations for Identifying Metonyms

no code implementations19 Aug 2015 Wei Zhang, Judith Gelernter

A metonym is a word with a figurative meaning, similar to a metaphor.

Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation

no code implementations4 Jul 2018 Lu Wang, Wei zhang, Xiaofeng He, Hongyuan Zha

Prior relevant studies recommend treatments either use supervised learning (e. g. matching the indicator signal which denotes doctor prescriptions), or reinforcement learning (e. g. maximizing evaluation signal which indicates cumulative reward from survival rates).

Recommendation Systems reinforcement-learning +1

Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining

no code implementations4 Aug 2018 Guanbin Li, Xiang He, Wei zhang, Huiyou Chang, Le Dong, Liang Lin

Single image rain streaks removal has recently witnessed substantial progress due to the development of deep convolutional neural networks.

Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos

no code implementations15 Aug 2018 Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei zhang

Secondly, TSD significantly reduces the computations to run video action recognition with compressed frames on the cloud, while maintaining high recognition accuracies.

Action Recognition In Videos Temporal Action Localization

Solving Pictorial Jigsaw Puzzle by Stigmergy-inspired Internet-based Human Collective Intelligence

no code implementations28 Nov 2018 Bo Shen, Wei zhang, Haiyan Zhao, Zhi Jin, Yanhong Wu

And through feedback, each player is provided with personalized feedback information based on the current COG and the player's exploration result, in order to accelerate his/her puzzle-solving process.

Label-Free Distant Supervision for Relation Extraction via Knowledge Graph Embedding

no code implementations EMNLP 2018 Guanying Wang, Wen Zhang, Ruoxu Wang, Yalin Zhou, Xi Chen, Wei zhang, Hai Zhu, Huajun Chen

This paper proposes a label-free distant supervision method, which makes no use of the relation labels under this inadequate assumption, but only uses the prior knowledge derived from the KG to supervise the learning of the classifier directly and softly.

Knowledge Graph Embedding Relation +3

Learning to Decompose Compound Questions with Reinforcement Learning

no code implementations ICLR 2019 Haihong Yang, Han Wang, Shuang Guo, Wei zhang, Huajun Chen

Our model consists of two parts: (i) a novel learning-to-decompose agent that learns a policy to decompose a compound question into simple questions and (ii) three independent simple-question answerers that classify the corresponding relations for each simple question.

Question Answering reinforcement-learning +1

Deep Boosting of Diverse Experts

no code implementations ICLR 2018 Wei Zhang, Qiuyu Chen, Jun Yu, Jianping Fan

In this paper, a deep boosting algorithm is developed to learn more discriminative ensemble classifier by seamlessly combining a set of base deep CNNs (base experts) with diverse capabilities, e. g., these base deep CNNs are sequentially trained to recognize a set of object classes in an easy-to-hard way according to their learning complexities.

Object Recognition

Edge-Semantic Learning Strategy for Layout Estimation in Indoor Environment

no code implementations3 Jan 2019 Weidong Zhang, Wei zhang, Jason Gu

More specifically, we present an encoder-decoder network with shared encoder and two separate decoders, which are composed of multiple deconvolution (transposed convolution) layers, to jointly learn the edge maps and semantic labels of a room image.

MSR: Multi-Scale Shape Regression for Scene Text Detection

no code implementations9 Jan 2019 Chuhui Xue, Shijian Lu, Wei zhang

State-of-the-art scene text detection techniques predict quadrilateral boxes that are prone to localization errors while dealing with straight or curved text lines of different orientations and lengths in scenes.

regression Scene Text Detection +1

Binarized Mode Seeking for Scalable Visual Pattern Discovery

no code implementations CVPR 2017 Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, Zhineng Chen

Second, we further extend bMS to a more general form, namely contrastive binary mean shift (cbMS), which maximizes the contrastive density in binary space, for finding informative patterns that are both frequent and discriminative for the dataset.

Multiple Granularity Descriptors for Fine-Grained Categorization

no code implementations ICCV 2015 Dequan Wang, Zhiqiang Shen, Jie Shao, Wei zhang, xiangyang xue, Zheng Zhang

Fine-grained categorization, which aims to distinguish subordinate-level categories such as bird species or dog breeds, is an extremely challenging task.

A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification

no code implementations ICCV 2015 Kan Liu, Bingpeng Ma, Wei zhang, Rui Huang

Pedestrian re-identification is a difficult problem due to the large variations in a person's appearance caused by different poses and viewpoints, illumination changes, and occlusions.

VrR-VG: Refocusing Visually-Relevant Relationships

no code implementations ICCV 2019 Yuanzhi Liang, Yalong Bai, Wei zhang, Xueming Qian, Li Zhu, Tao Mei

Relationships encode the interactions among individual instances, and play a critical role in deep visual scene understanding.

Image Captioning Question Answering +3

Hierarchical Photo-Scene Encoder for Album Storytelling

no code implementations2 Feb 2019 Bairui Wang, Lin Ma, Wei zhang, Wenhao Jiang, Feng Zhang

In this paper, we propose a novel model with a hierarchical photo-scene encoder and a reconstructor for the task of album storytelling.

Image-guided Story Ending Generation

Long-tail Relation Extraction via Knowledge Graph Embeddings and Graph Convolution Networks

no code implementations NAACL 2019 Ningyu Zhang, Shumin Deng, Zhanlin Sun, Guanying Wang, Xi Chen, Wei zhang, Huajun Chen

Here, the challenge is to learn accurate "few-shot" models for classes existing at the tail of the class distribution, for which little data is available.

Knowledge Graph Embeddings Relation +1

Interaction Embeddings for Prediction and Explanation in Knowledge Graphs

no code implementations12 Mar 2019 Wen Zhang, Bibek Paudel, Wei zhang, Abraham Bernstein, Huajun Chen

Knowledge graph embedding aims to learn distributed representations for entities and relations, and is proven to be effective in many applications.

Knowledge Graph Embedding Knowledge Graphs +1

Iteratively Learning Embeddings and Rules for Knowledge Graph Reasoning

no code implementations21 Mar 2019 Wen Zhang, Bibek Paudel, Liang Wang, Jiaoyan Chen, Hai Zhu, Wei zhang, Abraham Bernstein, Huajun Chen

We also evaluate the efficiency of rule learning and quality of rules from IterE compared with AMIE+, showing that IterE is capable of generating high quality rules more efficiently.

Entity Embeddings Knowledge Graphs +1

Distributed Deep Learning Strategies For Automatic Speech Recognition

no code implementations10 Apr 2019 Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung, Michael Picheny

We show that we can train the LSTM model using ADPSGD in 14 hours with 16 NVIDIA P100 GPUs to reach a 7. 6% WER on the Hub5- 2000 Switchboard (SWB) test set and a 13. 1% WER on the CallHome (CH) test set.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Anti-Confusing: Region-Aware Network for Human Pose Estimation

no code implementations3 May 2019 Xuan Cao, Yanhao Ge, Ying Tai, Wei zhang, Jian Li, Chengjie Wang, Jilin Li, Feiyue Huang

In this work, we propose a novel framework named Region-Aware Network (RANet), which learns the ability of anti-confusing in case of heavy occlusion, nearby person and symmetric appearance, for human pose estimation.

Data Augmentation Pose Estimation

Machine Learning Based Routing Congestion Prediction in FPGA High-Level Synthesis

no code implementations6 May 2019 Jieru Zhao, Tingyuan Liang, Sharad Sinha, Wei zhang

Early and accurate congestion estimation is of great benefit to guide the optimization in HLS and improve the efficiency of implementation.

BIG-bench Machine Learning Face Detection +1

Predictive Ensemble Learning with Application to Scene Text Detection

no code implementations12 May 2019 Danlu Chen, Xu-Yao Zhang, Wei zhang, Yao Lu, Xiuli Li, Tao Mei

Taking scene text detection as the application, where no suitable ensemble learning strategy exists, PEL can significantly improve the performance, compared to either individual state-of-the-art models, or the fusion of multiple models by non-maximum suppression.

Classification Ensemble Learning +5

Correlative Multi-Label Multi-Instance Image Annotation

no code implementations IEEE International Conference on Computer Vision 2011 Xiangyang Xue, Wei zhang, Jie Zhang, Bin Wu, Jianping Fan, Yao Lu

The cross-level label coherence en-codes the consistency between the labels at the image leveland the labels at the region level.

Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning

no code implementations3 Jun 2019 Wei Zhang, Bairui Wang, Lin Ma, Wei Liu

Unlike previous video captioning work mainly exploiting the cues of video contents to make a language description, we propose a reconstruction network (RecNet) in a novel encoder-decoder-reconstructor architecture, which leverages both forward (video to sentence) and backward (sentence to video) flows for video captioning.

reinforcement-learning Reinforcement Learning (RL) +2

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition

no code implementations10 Jul 2019 Khoi-Nguyen C. Mac, Xiaodong Cui, Wei zhang, Michael Picheny

In automatic speech recognition (ASR), wideband (WB) and narrowband (NB) speech signals with different sampling rates typically use separate acoustic models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition

no code implementations10 Jul 2019 Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny

On commonly used public SWB-300 and SWB-2000 ASR datasets, ADPSGD can converge with a batch size 3X as large as the one used in SSGD, thus enable training at a much larger scale.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Hard-Aware Fashion Attribute Classification

no code implementations25 Jul 2019 Yun Ye, Yixin Li, Bo Wu, Wei zhang, Ling-Yu Duan, Tao Mei

For "hard" attributes with insufficient training data, Deact brings more stable synthetic samples for training and further improve the performance.

Attribute Classification +1

Road Context-aware Intrusion Detection System for Autonomous Cars

no code implementations2 Aug 2019 Jingxuan Jiang, Chundong Wang, Sudipta Chattopadhyay, Wei zhang

With such ongoing road context, RAIDS validates corresponding frames observed on the in-vehicle network.

Intrusion Detection

Transfer Learning for Relation Extraction via Relation-Gated Adversarial Learning

no code implementations22 Aug 2019 Ningyu Zhang, Shumin Deng, Zhanlin Sun, Jiaoyan Chen, Wei zhang, Huajun Chen

However, the human annotation is expensive, while human-crafted patterns suffer from semantic drift and distant supervision samples are usually noisy.

Partial Domain Adaptation Relation +2

Fake Colorized Image Detection

no code implementations9 Jan 2018 Yuanfang Guo, Xiaochun Cao, Wei zhang, Rui Wang

Based on our observations, i. e., potential traces in the hue, saturation, dark and bright channels, we propose two simple yet effective detection methods for fake colorized images: Histogram based Fake Colorized Image Detection (FCID-HIST) and Feature Encoding based Fake Colorized Image Detection (FCID-FE).

Multimedia

Relationship-Aware Spatial Perception Fusion for Realistic Scene Layout Generation

no code implementations2 Sep 2019 Hongdong Zheng, Yalong Bai, Wei zhang, Tao Mei

In our framework, a spatial constraint module is designed to fit reasonable scaling and spatial layout of object pairs with considering relationship between them.

Image Generation Object

Learning Actions from Human Demonstration Video for Robotic Manipulation

no code implementations10 Sep 2019 Shuo Yang, Wei zhang, Weizhi Lu, Hesheng Wang, Yibin Li

However, the general video captioning methods focus more on the understanding of the full frame, lacking of consideration on the specific object of interests in robotic manipulations.

Video Captioning

A Three-dimensional Convolutional-Recurrent Network for Convective Storm Nowcasting

no code implementations1 Oct 2019 Wei Zhang, Wei Li, Lei Han

Very short-term convective storm forecasting, termed nowcasting, has long been an important issue and has attracted substantial interest.

Feature Engineering

Learning Robust Representations with Graph Denoising Policy Network

no code implementations4 Oct 2019 Lu Wang, Wenchao Yu, Wei Wang, Wei Cheng, Wei zhang, Hongyuan Zha, Xiaofeng He, Haifeng Chen

Graph representation learning, aiming to learn low-dimensional representations which capture the geometric dependencies between nodes in the original graph, has gained increasing popularity in a variety of graph analysis tasks, including node classification and link prediction.

Denoising Graph Representation Learning +2

Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

no code implementations3 Oct 2019 Guillermo A. Castillo, Bowen Weng, Wei zhang, Ayonga Hereid

This paper presents a novel model-free reinforcement learning (RL) framework to design feedback control policies for 3D bipedal walking.

Reinforcement Learning (RL)

Constrained Non-Affine Alignment of Embeddings

no code implementations13 Oct 2019 Yuwei Wang, Yan Zheng, Yanqing Peng, Chin-Chia Michael Yeh, Zhongfang Zhuang, Das Mahashweta, Bendre Mangesh, Feifei Li, Wei zhang, Jeff M. Phillips

Embeddings are already essential tools for large language models and image analysis, and their use is being extended to many other research domains.

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

no code implementations ICML 2020 Kaiyi Ji, Zhe Wang, Bowen Weng, Yi Zhou, Wei zhang, Yingbin Liang

In this paper, we propose a novel scheme, which eliminates backtracking line search but still exploits the information along optimization path by adapting the batch size via history stochastic gradients.

A Decentralized Parallel Algorithm for Training Generative Adversarial Nets

no code implementations NeurIPS 2020 Mingrui Liu, Wei zhang, Youssef Mroueh, Xiaodong Cui, Jerret Ross, Tianbao Yang, Payel Das

Despite recent progress on decentralized algorithms for training deep neural networks, it remains unclear whether it is possible to train GANs in a decentralized manner.

A Machine Learning Nowcasting Method based on Real-time Reanalysis Data

no code implementations14 Sep 2016 Lei Han, Juanzhen Sun, Wei zhang, Yuanyuan Xiu, Hailei Feng, Yinjing Lin

Despite marked progress over the past several decades, convective storm nowcasting remains a challenge because most nowcasting systems are based on linear extrapolation of radar reflectivity without much consideration for other meteorological fields.

BIG-bench Machine Learning Open-Ended Question Answering

Long-distance distribution of atom-photon entanglement at telecom wavelength

no code implementations3 Sep 2019 Tim van Leent, Matthias Bock, Robert Garthoff, Kai Redeker, Wei zhang, Tobias Bauer, Wenjamin Rosenfeld, Christoph Becher, Harald Weinfurter

Here we report on the generation and observation of entanglement between a Rb-87 atom and a photon at telecom wavelength over 20 km optical fiber.

Quantum Physics

Relation Adversarial Network for Low Resource Knowledge Graph Completion

no code implementations8 Nov 2019 Ningyu Zhang, Shumin Deng, Zhanlin Sun, Jiaoayan Chen, Wei zhang, Huajun Chen

Specifically, the framework takes advantage of a relation discriminator to distinguish between samples from different relations, and help learn relation-invariant features more transferable from source relations to target relations.

Link Prediction Partial Domain Adaptation +2

SubCharacter Chinese-English Neural Machine Translation with Wubi encoding

no code implementations7 Nov 2019 Wei Zhang, Feifei Lin, Xiaodong Wang, Zhenshuang Liang, Zhen Huang

However, when the translation task involves Chinese, semantic granularity remains at the word and character level, so there is still need more fine-grained translation model of Chinese.

Machine Translation Model Compression +2

Furnishing Your Room by What You See: An End-to-End Furniture Set Retrieval Framework with Rich Annotated Benchmark Dataset

no code implementations21 Nov 2019 Bingyuan Liu, Jiantao Zhang, Xiaoting Zhang, Wei zhang, Chuanhui Yu, Yuan Zhou

However, few works focus on the understanding of furniture within the scenes and a large-scale dataset is also lacked to advance the field.

Retrieval

SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection

no code implementations22 Nov 2019 Lewei Yao, Hang Xu, Wei zhang, Xiaodan Liang, Zhenguo Li

In this paper, we present a two-stage coarse-to-fine searching strategy named Structural-to-Modular NAS (SM-NAS) for searching a GPU-friendly design of both an efficient combination of modules and better modular-level architecture for object detection.

Neural Architecture Search Object +2

Learning Efficient Video Representation with Video Shuffle Networks

no code implementations26 Nov 2019 Pingchuan Ma, Yao Zhou, Yu Lu, Wei zhang

To this end, we propose the video shuffle, a parameter-free plug-in component that efficiently reallocates the inputs of 2D convolution so that its receptive field can be extended to the temporal dimension.

Video Recognition

Improving Neural Relation Extraction with Positive and Unlabeled Learning

no code implementations28 Nov 2019 Zhengqiu He, Wenliang Chen, Yuyi Wang, Wei zhang, Guanchun Wang, Min Zhang

We present a novel approach to improve the performance of distant supervision relation extraction with Positive and Unlabeled (PU) Learning.

reinforcement-learning Reinforcement Learning (RL) +3

Attentive Representation Learning with Adversarial Training for Short Text Clustering

no code implementations8 Dec 2019 Wei Zhang, Chao Dong, Jianhua Yin, Jianyong Wang

Relying on this, the representation learning and clustering for short texts are seamlessly integrated into a unified model.

Clustering Information Retrieval +3

Zooming into Face Forensics: A Pixel-level Analysis

no code implementations12 Dec 2019 Jia Li, Tong Shen, Wei zhang, Hui Ren, Dan Zeng, Tao Mei

The stunning progress in face manipulation methods has made it possible to synthesize realistic fake face images, which poses potential threats to our society.

Classification General Classification +1

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets

no code implementations ICLR 2020 Mingrui Liu, Youssef Mroueh, Jerret Ross, Wei zhang, Xiaodong Cui, Payel Das, Tianbao Yang

Then we propose an adaptive variant of OSG named Optimistic Adagrad (OAdagrad) and reveal an \emph{improved} adaptive complexity $O\left(\epsilon^{-\frac{2}{1-\alpha}}\right)$, where $\alpha$ characterizes the growth rate of the cumulative stochastic gradient and $0\leq \alpha\leq 1/2$.

Vision and Language: from Visual Perception to Content Creation

no code implementations26 Dec 2019 Tao Mei, Wei zhang, Ting Yao

The real-world deployment or services of vision and language are elaborated as well.

Question Answering valid +3

How Does BN Increase Collapsed Neural Network Filters?

no code implementations30 Jan 2020 Sheng Zhou, Xinjiang Wang, Ping Luo, Litong Feng, Wenjie Li, Wei zhang

This phenomenon is caused by the normalization effect of BN, which induces a non-trainable region in the parameter space and reduces the network capacity as a result.

object-detection Object Detection

Learning Long- and Short-Term User Literal-Preference with Multimodal Hierarchical Transformer Network for Personalized Image Caption

no code implementations AAAI Conference on Artificial Intelligence (AAAI 2020) 2020 Wei Zhang, Yue Ying, Pan Lu, Hongyuan Zha

Personalized image caption, a natural extension of the standard image caption task, requires to generate brief image descriptions tailored for users’ writing style and traits, and is more practical to meet users’ real demands.

Image Captioning

Improving Efficiency in Large-Scale Decentralized Distributed Training

no code implementations4 Feb 2020 Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David Kung, Michael Picheny

Decentralized Parallel SGD (D-PSGD) and its asynchronous variant Asynchronous Parallel SGD (AD-PSGD) is a family of distributed learning algorithms that have been demonstrated to perform well for large-scale deep learning tasks.

speech-recognition Speech Recognition

Non-asymptotic Convergence of Adam-type Reinforcement Learning Algorithms under Markovian Sampling

no code implementations15 Feb 2020 Huaqing Xiong, Tengyu Xu, Yingbin Liang, Wei zhang

Despite the wide applications of Adam in reinforcement learning (RL), the theoretical convergence of Adam-type RL algorithms has not been established.

reinforcement-learning Reinforcement Learning (RL)

Syndrome-aware Herb Recommendation with Multi-Graph Convolution Network

no code implementations20 Feb 2020 Yuanyuan Jin, Wei zhang, Xiangnan He, Xinyu Wang, Xiaoling Wang

Given a set of symptoms to treat, we aim to generate an overall syndrome representation by effectively fusing the embeddings of all the symptoms in the set, to mimic how a doctor induces the syndromes.

Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment

no code implementations CVPR 2020 Qiuyu Chen, Wei zhang, Ning Zhou, Peng Lei, Yi Xu, Yu Zheng, Jianping Fan

Specifically, the fractional dilated kernel is adaptively constructed according to the image aspect ratios, where the interpolation of nearest two integers dilated kernels is used to cope with the misalignment of fractional sampling.

Learning from a Lightweight Teacher for Efficient Knowledge Distillation

no code implementations19 May 2020 Yuang Liu, Wei zhang, Jun Wang

Knowledge Distillation (KD) is an effective framework for compressing deep learning models, realized by a student-teacher paradigm requiring small student networks to mimic the soft target generated by well-trained teachers.

Knowledge Distillation

Map Generation from Large Scale Incomplete and Inaccurate Data Labels

no code implementations20 May 2020 Rui Zhang, Conrad Albrecht, Wei zhang, Xiaodong Cui, Ulrich Finkler, David Kung, Siyuan Lu

Accurately and globally mapping human infrastructure is an important and challenging task with applications in routing, regulation compliance monitoring, and natural disaster response management etc..

Disaster Response Management

FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications

no code implementations5 Jun 2020 Jieru Zhao, Tingyuan Liang, Liang Feng, Wenchao Ding, Sharad Sinha, Wei zhang, Shaojie Shen

To reduce the design effort and achieve the right balance, we propose FP-Stereo for building high-performance stereo matching pipelines on FPGAs automatically.

C++ code Depth Estimation +1

Multi-future Merchant Transaction Prediction

no code implementations10 Jul 2020 Chin-Chia Michael Yeh, Zhongfang Zhuang, Wei zhang, Liang Wang

We use experiments on real-world merchant transaction data to demonstrate the effectiveness of our proposed model.

Fraud Detection Future prediction +3

Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent

no code implementations15 Jul 2020 Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei zhang

In this paper, we first characterize the convergence rate for Q-AMSGrad, which is the Q-learning algorithm with AMSGrad update (a commonly adopted alternative of Adam for theoretical analysis).

Atari Games Q-Learning

Automatic Image Labelling at Pixel Level

no code implementations15 Jul 2020 Xiang Zhang, Wei zhang, Jinye Peng, Jianping Fan

A Guided Filter Network (GFN) is first developed to learn the segmentation knowledge from a source domain, and such GFN then transfers such segmentation knowledge to generate coarse object masks in the target domain.

Image Segmentation Object +2

Research Progress of Convolutional Neural Network and its Application in Object Detection

no code implementations27 Jul 2020 Wei Zhang, Zuoxiang Zeng

With the improvement of computer performance and the increase of data volume, the object detection based on convolutional neural network (CNN) has become the main algorithm for object detection.

Object object-detection +1

Momentum Q-learning with Finite-Sample Convergence Guarantee

no code implementations30 Jul 2020 Bowen Weng, Huaqing Xiong, Lin Zhao, Yingbin Liang, Wei zhang

For the infinite state-action space case, we establish the convergence guarantee for MomentumQ with linear function approximations and Markovian sampling.

Q-Learning

Multi-stream RNN for Merchant Transaction Prediction

no code implementations25 Jul 2020 Zhongfang Zhuang, Chin-Chia Michael Yeh, Liang Wang, Wei zhang, Junpeng Wang

New challenges have surfaced in monitoring and guaranteeing the integrity of payment processing systems.

Fraud Detection Time Series +1

A(DP)$^2$SGD: Asynchronous Decentralized Parallel Stochastic Gradient Descent with Differential Privacy

no code implementations21 Aug 2020 Jie Xu, Wei zhang, Fei Wang

A popular distributed learning strategy is federated learning, where there is a central server storing the global model and a set of local computing nodes updating the model parameters with their corresponding data.

Federated Learning

Products-10K: A Large-scale Product Recognition Dataset

1 code implementation24 Aug 2020 Yalong Bai, Yuxiang Chen, Wei Yu, Linfang Wang, Wei zhang

With the rapid development of electronic commerce, the way of shopping has experienced a revolutionary evolution.

Learn by Observation: Imitation Learning for Drone Patrolling from Videos of A Human Navigator

no code implementations30 Aug 2020 Yue Fan, Shilei Chu, Wei zhang, Ran Song, Yibin Li

Extensive experiments are conducted to demonstrate the accuracy of the proposed imitating learning process as well as the reliability of the holistic system for autonomous drone navigation.

Drone navigation Imitation Learning

Decision Tree Based Hardware Power Monitoring for Run Time Dynamic Power Management in FPGA

no code implementations3 Sep 2020 Zhe Lin, Wei zhang, Sharad Sinha

A flexible architecture of the hardware power monitoring is proposed, which can be instrumented in any RTL design for runtime power estimation, dispensing with the need for extra power measurement devices.

Management

An Ensemble Learning Approach for In-situ Monitoring of FPGA Dynamic Power

no code implementations3 Sep 2020 Zhe Lin, Sharad Sinha, Wei zhang

As field-programmable gate arrays become prevalent in critical application domains, their power consumption is of high concern.

Ensemble Learning Management

Towards Efficient and Scalable Acceleration of Online Decision Tree Learning on FPGA

no code implementations3 Sep 2020 Zhe Lin, Sharad Sinha, Wei zhang

We further present a high-performance, hardware-efficient and scalable online decision tree learning system on a field-programmable gate array (FPGA) with system-level optimization techniques.

HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction

no code implementations ECCV 2020 Tianjiao Li, Jun Liu, Wei zhang, Ling-Yu Duan

In this paper, we propose a novel Hardness-AwaRe Discrimination Network (HARD-Net) to specifically investigate the relationships between the similar activity pairs that are hard to be discriminated.

Activity Prediction Skeleton Based Action Recognition

DualDE: Dually Distilling Knowledge Graph Embedding for Faster and Cheaper Reasoning

no code implementations13 Sep 2020 Yushan Zhu, Wen Zhang, Mingyang Chen, Hui Chen, Xu Cheng, Wei zhang, Huajun Chen

In DualDE, we propose a soft label evaluation mechanism to adaptively assign different soft label and hard label weights to different triples, and a two-stage distillation approach to improve the student's acceptance of the teacher.

Knowledge Distillation Knowledge Graph Embedding +2

Towards a Flexible Embedding Learning Framework

no code implementations23 Sep 2020 Chin-Chia Michael Yeh, Dhruv Gelda, Zhongfang Zhuang, Yan Zheng, Liang Gou, Wei zhang

Our proposed framework utilizes a set of entity-relation-matrices as the input, which quantifies the affinities among different entities in the database.

Relation Representation Learning

Kernel Based Progressive Distillation for Adder Neural Networks

no code implementations NeurIPS 2020 Yixing Xu, Chang Xu, Xinghao Chen, Wei zhang, Chunjing Xu, Yunhe Wang

A convolutional neural network (CNN) with the same architecture is simultaneously initialized and trained as a teacher network, features and weights of ANN and CNN will be transformed to a new space to eliminate the accuracy drop.

Knowledge Distillation

Finite-Time Analysis for Double Q-learning

no code implementations NeurIPS 2020 Huaqing Xiong, Lin Zhao, Yingbin Liang, Wei zhang

Although Q-learning is one of the most successful algorithms for finding the best action-value function (and thus the optimal policy) in reinforcement learning, its implementation often suffers from large overestimation of Q-function values incurred by random sampling.

Q-Learning

Double Q-learning: New Analysis and Sharper Finite-time Bound

no code implementations1 Jan 2021 Lin Zhao, Huaqing Xiong, Yingbin Liang, Wei zhang

Double Q-learning (Hasselt 2010) has gained significant success in practice due to its effectiveness in overcoming the overestimation issue of Q-learning.

Q-Learning

Why Does Decentralized Training Outperform Synchronous Training In The Large Batch Setting?

no code implementations1 Jan 2021 Wei zhang, Mingrui Liu, Yu Feng, Brian Kingsbury, Yuhai Tu

We conduct extensive studies over 12 state-of-the-art DL models/tasks and demonstrate that DPSGD consistently outperforms SSGD in the large batch setting; and DPSGD converges in cases where SSGD diverges for large learning rates.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

RRL: A Scalable Classifier for Interpretable Rule-Based Representation Learning

no code implementations1 Jan 2021 Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang

Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.

Representation Learning

How Can Self-Attention Networks Recognize Dyck-n Languages?

no code implementations Findings of the Association for Computational Linguistics 2020 Javid Ebrahimi, Dhruv Gelda, Wei zhang

For $\mathcal{D}_2$, we find that SA$^-$ completely breaks down on long sequences whereas the accuracy of SA$^+$ is 58. 82$\%$.

G-DARTS-A: Groups of Channel Parallel Sampling with Attention

no code implementations16 Oct 2020 Zhaowen Wang, Wei zhang, Zhiming Wang

Differentiable Architecture Search (DARTS) provides a baseline for searching effective network architectures based gradient, but it is accompanied by huge computational overhead in searching and training network architecture.

Bridging Text and Knowledge with Multi-Prototype Embedding for Few-Shot Relational Triple Extraction

no code implementations COLING 2020 Haiyang Yu, Ningyu Zhang, Shumin Deng, Hongbin Ye, Wei zhang, Huajun Chen

Current supervised relational triple extraction approaches require huge amounts of labeled data and thus suffer from poor performance in few-shot settings.

Merchant Category Identification Using Credit Card Transactions

no code implementations5 Nov 2020 Chin-Chia Michael Yeh, Zhongfang Zhuang, Yan Zheng, Liang Wang, Junpeng Wang, Wei zhang

In this work, we approach this problem from a multi-modal learning perspective, where we use not only the merchant time series data but also the information of merchant-merchant relationship (i. e., affinity) to verify the self-reported business type (i. e., merchant category) of a given merchant.

Time Series Time Series Analysis +1

Online Decision Based Visual Tracking via Reinforcement Learning

no code implementations NeurIPS 2020 Ke Song, Wei zhang, Ran Song, Yibin Li

A deep visual tracker is typically based on either object detection or template matching while each of them is only suitable for a particular group of scenes.

Hierarchical Reinforcement Learning object-detection +5

Saikosaponins with similar structures but different mechanisms lead to combined hepatotoxicity

no code implementations14 May 2018 Qianqian Zhang, Wanqiu Huang, Yiqiao Gao, Yingtong Lv, Wei zhang, Zunjian Zhang, Fengguo Xu

Previous studies have revealed that saikosaponins are the major types of components that contribute to the hepatotoxicity of Radix Bupleuri.

Latency Minimization for Multiuser Computation Offloading in Fog-Radio Access Networks

no code implementations20 Jul 2019 Wei zhang, Shafei Wang, Ye Pan, Qiang Li, Jingran Lin, Xiaoxiao Wu

This paper considers computation offloading in fog-radio access networks (F-RAN), where multiple user equipments (UEs) offload their computation tasks to the F-RAN through a number of fog nodes.

Sex Differences in Severity and Mortality Among Patients With COVID-19: Evidence from Pooled Literature Analysis and Insights from Integrated Bioinformatic Analysis

no code implementations30 Mar 2020 Xiyi Wei, Yu-Tian Xiao, Jian Wang, Rui Chen, Wei zhang, Yue Yang, Daojun Lv, Chao Qin, Di Gu, Bo Zhang, Weidong Chen, Jianquan Hou, Ninghong Song, Guohua Zeng, Shancheng Ren

Objective: To conduct a meta-analysis of current studies that examined sex differences in severity and mortality in patients with COVID-19, and identify potential mechanisms underpinning these differences.

Hard-ODT: Hardware-Friendly Online Decision Tree Learning Algorithm and System

no code implementations11 Dec 2020 Zhe Lin, Sharad Sinha, Wei zhang

Following this, we present Hard-ODT, a high-performance, hardware-efficient and scalable online decision tree learning system on a field-programmable gate array (FPGA) with system-level optimization techniques.

Anomalous Hall and Nernst Effects in FeRh

no code implementations28 Dec 2020 Hilal Saglam, Changjiang Liu, Yi Li, Joseph Sklenar, Jonathan Gibbons, Deshun Hong, Vedat Karakas, John E. Pearson, Ozhan Ozatay, Wei zhang, Anand Bhattacharya, Axel Hoffmann

Antiferromagnets with tunable phase transitions are promising for future spintronics applications.

Materials Science

Take More Positives: An Empirical Study of Contrastive Learing in Unsupervised Person Re-Identification

no code implementations12 Jan 2021 Xuanyu He, Wei zhang, Ran Song, Qian Zhang, Xiangyuan Lan, Lin Ma

By studying two unsupervised person re-ID methods in a cross-method way, we point out a hard negative problem is handled implicitly by their designs of data augmentations and PK sampler respectively.

Contrastive Learning Unsupervised Person Re-Identification

AdderNet and its Minimalist Hardware Design for Energy-Efficient Artificial Intelligence

no code implementations25 Jan 2021 Yunhe Wang, Mingqiang Huang, Kai Han, Hanting Chen, Wei zhang, Chunjing Xu, DaCheng Tao

With a comprehensive comparison on the performance, power consumption, hardware resource consumption and network generalization capability, we conclude the AdderNet is able to surpass all the other competitors including the classical CNN, novel memristor-network, XNOR-Net and the shift-kernel based network, indicating its great potential in future high performance and energy-efficient artificial intelligence applications.

Quantization

MUSE: Multi-Scale Temporal Features Evolution for Knowledge Tracing

no code implementations30 Jan 2021 Chengwei Zhang, Yangzhou Jiang, Wei zhang, Chengyu Gu

The proposed model is capable to capture the dynamic changes in users knowledge states at different temporal-ranges, and provides an efficient and powerful way to combine local and global features to make predictions.

Knowledge Tracing

Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation

no code implementations3 Feb 2021 Mingke Xu, Fan Zhang, Xiaodong Cui, Wei zhang

In this paper, we apply multiscale area attention in a deep convolutional neural network to attend emotional characteristics with varied granularities and therefore the classifier can benefit from an ensemble of attentions with different scales.

Data Augmentation Speech Emotion Recognition

Optically synchronized fiber links with spectrally pure integrated lasers

no code implementations11 Feb 2021 Grant M. Brodnik, Mark W. Harrington, John H. Dallyn, Debapam Bose, Wei zhang, Liron Stern, Paul A. Morton, Ryan O. Behunin, Scott B. Papp, Daniel J. Blumenthal

In this paper we report a record low 3x10^-4 rad^2 residual phase error variance for synchronization based on independent, spectrally pure, ultra-high mutual coherence, photonic integrated lasers.

Optics Applied Physics

Massive Self-Assembly in Grid Environments

no code implementations5 Feb 2021 Wenjie Chu, Wei zhang, Haiyan Zhao, Zhi Jin, Hong Mei

Self-assembly plays an essential role in many natural processes, involving the formation and evolution of living or non-living structures, and shows potential applications in many emerging domains.

Multiagent Systems Distributed, Parallel, and Cluster Computing Robotics

Graph-Based Tri-Attention Network for Answer Ranking in CQA

no code implementations5 Mar 2021 Wei zhang, Zeyuan Chen, Chao Dong, Wen Wang, Hongyuan Zha, Jianyong Wang

However, they encounter two main limitations: (1) Correlations between answers in the same question are often overlooked.

Question Answering

IPAPRec: A promising tool for learning high-performance mapless navigation skills with deep reinforcement learning

no code implementations22 Mar 2021 Wei zhang, Yunfeng Zhang, Ning Liu, Kai Ren, Pengfei Wang

This paper studies how to improve the generalization performance and learning speed of the navigation agents trained with deep reinforcement learning (DRL).

Reinforcement Learning (RL)

Social Link Inference via Multi-View Matching Network from Spatio-Temporal Trajectories

no code implementations20 Mar 2021 Wei zhang, Xin Lai, Jianyong Wang

In this paper, we investigate the problem of social link inference in a target Location-aware Social Network (LSN), which aims at predicting the unobserved links between users within the network.

Link Prediction Time Series Analysis

Zero-shot Adversarial Quantization

no code implementations CVPR 2021 Yuang Liu, Wei zhang, Jun Wang

To address the above issues, we propose a zero-shot adversarial quantization (ZAQ) framework, facilitating effective discrepancy estimation and knowledge transfer from a full-precision model to its quantized model.

Ranked #2 on Data Free Quantization on CIFAR-100 (CIFAR-100 W5A5 Top-1 Accuracy metric)

Data Free Quantization Transfer Learning

Fast Beam Training and Alignment for IRS-Assisted Millimeter Wave/Terahertz Systems

no code implementations10 Mar 2021 Peilan Wang, Jun Fang, Wei zhang, Hongbin Li

Intelligent reflecting surface (IRS) has emerged as a competitive solution to address blockage issues in millimeter wave (mmWave) and Terahertz (THz) communications due to its capability of reshaping wireless transmission environments.

Pneumonia after bacterial or viral infection preceded or followed by radiation exposure -- a reanalysis of older radiobiological data and implications for low dose radiotherapy for COVID-19 pneumonia

no code implementations6 Aug 2020 Mark P Little, Wei zhang, Roy van Dusen, Nobuyuki Hamada

For 7 studies that evaluated post-inoculation radiation exposure (more relevant to LDRT for COVID-19 pneumonia) the results are heterogeneous, with 2 studies showing a significant increase (p<0. 001) and another showing a significant decrease (p<0. 001) in mortality associated with radiation exposure.

Source-Free Domain Adaptation for Semantic Segmentation

no code implementations CVPR 2021 Yuang Liu, Wei zhang, Jun Wang

To cope with this issue, we propose a source-free domain adaptation framework for semantic segmentation, namely SFDA, in which only a well-trained source model and an unlabeled target domain dataset are available for adaptation.

Segmentation Self-Supervised Learning +4

Exploiting Relationship for Complex-scene Image Generation

no code implementations1 Apr 2021 Tianyu Hua, Hongdong Zheng, Yalong Bai, Wei zhang, Xiao-Ping Zhang, Tao Mei

Our method tends to synthesize plausible layouts and objects, respecting the interplay of multiple objects in an image.

Image Generation Scene Generation

Jittering Effects Analysis and Beam Training Design for UAV Millimeter Wave Communications

no code implementations13 Apr 2021 Wei Wang, Wei zhang

In beam training design, with the relationship between attitude angles and AoA/AoD, we propose to generate a rough estimate of AoA and AoD from UAV navigation information.

Hybrid Interference Mitigation Using Analog Prewhitening

no code implementations4 Mar 2021 Wei zhang, Yi Jiang, Bin Zhou, Die Hu

This paper proposes a novel scheme for mitigating strong interferences, which is applicable to various wireless scenarios, including full-duplex wireless communications and uncoordinated heterogenous networks.

Billion-scale Pre-trained E-commerce Product Knowledge Graph Model

no code implementations2 May 2021 Wen Zhang, Chi-Man Wong, Ganqiang Ye, Bo Wen, Wei zhang, Huajun Chen

As a backbone for online shopping platforms, we built a billion-scale e-commerce product knowledge graph for various item knowledge services such as item recommendation.

Knowledge Graphs

Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation

no code implementations CVPR 2021 Lewei Yao, Renjie Pi, Hang Xu, Wei zhang, Zhenguo Li, Tong Zhang

For student morphism, weight inheritance strategy is adopted, allowing the student to flexibly update its architecture while fully utilize the predecessor's weights, which considerably accelerates the search; To facilitate dynamic distillation, an elastic teacher pool is trained via integrated progressive shrinking strategy, from which teacher detectors can be sampled without additional cost in subsequent searches.

Knowledge Distillation Neural Architecture Search +2

Focus on Local: Detecting Lane Marker from Bottom Up via Key Point

no code implementations CVPR 2021 Zhan Qu, Huan Jin, Yang Zhou, Zhen Yang, Wei zhang

Mainstream lane marker detection methods are implemented by predicting the overall structure and deriving parametric curves through post-processing.

Lane Detection

Model Aided Deep Learning Based MIMO OFDM Receiver With Nonlinear Power Amplifiers

no code implementations30 May 2021 Liangyuan Xu, Feifei Gao, Wei zhang, Shaodan Ma

Multi-input multi-output orthogonal frequency division multiplexing (MIMO OFDM) is a key technology for mobile communication systems.

Joint Channel Estimation and Mixed-ADCs Allocation for Massive MIMO via Deep Learning

no code implementations8 Jun 2021 Liangyuan Xu, Feifei Gao, Ting Zhou, Shaodan Ma, Wei zhang

Instead of randomly assigning the mixed-ADCs, we then design a novel antenna selection network for mixed-ADCs allocation to further improve the channel estimation accuracy.

On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness, and Semantic Evaluation

no code implementations9 Jun 2021 Wei zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang

In the recent advances of natural language processing, the scale of the state-of-the-art models and datasets is usually extensive, which challenges the application of sample-based explanation methods in many aspects, such as explanation interpretability, efficiency, and faithfulness.

SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving

no code implementations21 Jun 2021 Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Jiageng Mao, Chaoqiang Ye, Wei zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu

Experiments show that SODA10M can serve as a promising pre-training dataset for different self-supervised learning methods, which gives superior performance when fine-tuning with different downstream tasks (i. e., detection, semantic/instance segmentation) in autonomous driving domain.

Autonomous Driving Instance Segmentation +5

LPSNet: A Lightweight Solution for Fast Panoptic Segmentation

no code implementations CVPR 2021 Weixiang Hong, Qingpei Guo, Wei zhang, Jingdong Chen, Wei Chu

Panoptic segmentation is a challenging task aiming to simultaneously segment objects (things) at instance level and background contents (stuff) at semantic level.

Instance Segmentation Panoptic Segmentation +1

Discrimination-Aware Mechanism for Fine-Grained Representation Learning

no code implementations CVPR 2021 Furong Xu, Meng Wang, Wei zhang, Yuan Cheng, Wei Chu

Therefore, there is a need for a training mechanism that enforces the discriminativeness of all the elements in the feature to capture more the subtle visual cues.

Representation Learning Retrieval

Prior Aided Streaming Network for Multi-task Affective Recognitionat the 2nd ABAW2 Competition

no code implementations8 Jul 2021 Wei zhang, Zunhu Guo, Keyu Chen, Lincheng Li, Zhimeng Zhang, Yu Ding

Automatic affective recognition has been an important research topic in human computer interaction (HCI) area.

Emotion Recognition

Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data

no code implementations16 Jul 2021 Jiandong Mu, Mengdi Wang, Feiwen Zhu, Jun Yang, Wei Lin, Wei zhang

Reinforcement learning (RL)-based auto-pruning has been further proposed to automate the DNN pruning process to avoid expensive hand-crafted work.

Neural Network Compression reinforcement-learning +2

Greedy Network Enlarging

1 code implementation31 Jul 2021 Chuanjian Liu, Kai Han, An Xiao, Yiping Deng, Wei zhang, Chunjing Xu, Yunhe Wang

Recent studies on deep convolutional neural networks present a simple paradigm of architecture design, i. e., models with more MACs typically achieve better accuracy, such as EfficientNet and RegNet.

Performance assessment and tuning of PID control using TLBO: the single-loop case and PI/P cascade case

no code implementations31 Jul 2021 Wei zhang, He Dong, Yunlang Xu, Xiaoping Li

Minimum output variance (MOV) is used as a benchmark for CPA of PID, but it is difficult to be found due to the associated non-convex optimization problem.

Stochastic Optimization

Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video

no code implementations9 Aug 2021 Jie Wu, Wei zhang, Guanbin Li, Wenhao Wu, Xiao Tan, YingYing Li, Errui Ding, Liang Lin

In this paper, we introduce a novel task, referred to as Weakly-Supervised Spatio-Temporal Anomaly Detection (WSSTAD) in surveillance video.

Anomaly Detection

Binary Complex Neural Network Acceleration on FPGA

no code implementations10 Aug 2021 Hongwu Peng, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam, Tong Geng, Ang Li, Wei zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding

Deep complex networks (DCN), in contrast, can learn from complex data, but have high computational costs; therefore, they cannot satisfy the instant decision-making requirements of many deployable systems dealing with short observations or short signal bursts.

Decision Making

G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-guided Feature Imitation

no code implementations ICCV 2021 Lewei Yao, Renjie Pi, Hang Xu, Wei zhang, Zhenguo Li, Tong Zhang

In this paper, we investigate the knowledge distillation (KD) strategy for object detection and propose an effective framework applicable to both homogeneous and heterogeneous student-teacher pairs.

Knowledge Distillation object-detection +1

ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones

no code implementations24 Aug 2021 Shan An, Guangfu Che, Jinghao Guo, Haogang Zhu, Junjie Ye, Fangru Zhou, Zhaoqi Zhu, Dong Wei, Aishan Liu, Wei zhang

To this concern, this work proposes a real-time augmented reality virtual shoe try-on system for smartphones, namely ARShoe.

Pose Estimation Virtual Try-on

4-bit Quantization of LSTM-based Speech Recognition Models

no code implementations27 Aug 2021 Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei zhang, Zoltán Tüske, Kailash Gopalakrishnan

We investigate the impact of aggressive low-precision representations of weights and activations in two families of large LSTM-based architectures for Automatic Speech Recognition (ASR): hybrid Deep Bidirectional LSTM - Hidden Markov Models (DBLSTM-HMMs) and Recurrent Neural Network - Transducers (RNN-Ts).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

How Does Adversarial Fine-Tuning Benefit BERT?

no code implementations31 Aug 2021 Javid Ebrahimi, Hao Yang, Wei zhang

Adversarial training (AT) is one of the most reliable methods for defending against adversarial attacks in machine learning.

Continual Learning Dependency Parsing +3

Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis

no code implementations5 Sep 2021 Tong Sha, Wei zhang, Tong Shen, Zhoujun Li, Tao Mei

Deep person generation has attracted extensive research attention due to its wide applications in virtual agents, video conferencing, online shopping and art/movie production.

Data Augmentation Talking Head Generation

On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation

no code implementations ACL 2021 Wei zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui, Fan Zhang

In the recent advances of natural language processing, the scale of the state-of-the-art models and datasets is usually extensive, which challenges the application of sample-based explanation methods in many aspects, such as explanation interpretability, efficiency, and faithfulness.

Online Multi-horizon Transaction Metric Estimation with Multi-modal Learning in Payment Networks

no code implementations21 Sep 2021 Chin-Chia Michael Yeh, Zhongfang Zhuang, Junpeng Wang, Yan Zheng, Javid Ebrahimi, Ryan Mercer, Liang Wang, Wei zhang

In this work, we study the problem of multivariate time series prediction for estimating transaction metrics associated with entities in the payment transaction database.

Time Series Time Series Prediction

MC$^2$-SF: Slow-Fast Learning for Mobile-Cloud Collaborative Recommendation

no code implementations25 Sep 2021 Zeyuan Chen, Jiangchao Yao, Feng Wang, Kunyang Jia, Bo Han, Wei zhang, Hongxia Yang

With the hardware development of mobile devices, it is possible to build the recommendation models on the mobile side to utilize the fine-grained features and the real-time feedbacks.

C3-SemiSeg: Contrastive Semi-Supervised Segmentation via Cross-Set Learning and Dynamic Class-Balancing

no code implementations ICCV 2021 Yanning Zhou, Hang Xu, Wei zhang, Bin Gao, Pheng-Ann Heng

The semi-supervised semantic segmentation methods utilize the unlabeled data to increase the feature discriminative ability to alleviate the burden of the annotated data.

Contrastive Learning Data Augmentation +1

Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection

no code implementations ICCV 2021 Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei zhang, Zhenguo Li, Luc van Gool

Here we present a novel self-supervised 3D Object detection framework that seamlessly integrates the geometry-aware contrast and clustering harmonization to lift the unsupervised 3D representation learning, named GCC-3D.

3D Object Detection Clustering +4

Embedding Compression with Hashing for Efficient Representation Learning in Graph

no code implementations29 Sep 2021 Chin-Chia Michael Yeh, Mengting Gu, Yan Zheng, Huiyuan Chen, Javid Ebrahimi, Zhongfang Zhuang, Junpeng Wang, Liang Wang, Wei zhang

When applying such type of networks on graph without node feature, one can extract simple graph-based node features (e. g., number of degrees) or learn the input node representation (i. e., embeddings) when training the network.

Representation Learning

Sensoring and Application of Multimodal Data for the Detection of Freezing of Gait in Parkinson's Disease

no code implementations9 Oct 2021 Wei zhang, Debin Huang, Hantao Li, Lipeng Wang, Yanzhao Wei, Kang Pan, Lin Ma, Huanhuan Feng, Jing Pan, Yuzhu Guo

The accurate and reliable detection or prediction of freezing of gaits (FOG) is important for fall prevention in Parkinson's Disease (PD) and studying the physiological transitions during the occurrence of FOG.

EEG valid

TSK Fuzzy System Towards Few Labeled Incomplete Multi-View Data Classification

no code implementations8 Oct 2021 Wei zhang, Zhaohong Deng, Qiongdan Lou, Te Zhang, Kup-Sze Choi, Shitong Wang

The proposed method has the following distinctive characteristics: 1) it can deal with the incomplete and few labeled multi-view data simultaneously; 2) it integrates the missing view imputation and model learning as a single process, which is more efficient than the traditional two-step strategy; 3) attributed to the interpretable fuzzy inference rules, this method is more interpretable.

Imputation MULTI-VIEW LEARNING +1

Asynchronous Decentralized Distributed Training of Acoustic Models

no code implementations21 Oct 2021 Xiaodong Cui, Wei zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung

Specifically, we study three variants of asynchronous decentralized parallel SGD (ADPSGD), namely, fixed and randomized communication patterns on a ring as well as a delay-by-one scheme.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Directional Self-supervised Learning for Heavy Image Augmentations

no code implementations CVPR 2022 Yalong Bai, Yifan Yang, Wei zhang, Tao Mei

Specifically, we adapt heavy augmentation policies after the views lightly augmented by standard augmentations, to generate harder view (HV).

Representation Learning Self-Supervised Learning

ViDA-MAN: Visual Dialog with Digital Humans

no code implementations26 Oct 2021 Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei zhang, Xiaodong He, Tao Mei

We demonstrate ViDA-MAN, a digital-human agent for multi-modal interaction, which offers realtime audio-visual responses to instant speech inquiries.

speech-recognition Speech Recognition +2

Consensus Graph Learning for Multi-view Clustering

1 code implementation IEEE Transactions on Multimedia 2021 Zhenglai Li, Chang Tang, Xinwang Liu, Xiao Zheng, Guanghui Yue, Wei zhang

Furthermore, we unify the spectral embedding and low rank tensor learning into a unified optimization framework to determine the spectral embedding matrices and tensor representation jointly.

Clustering Graph Learning

CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY

no code implementations25 Sep 2019 Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei zhang

Differently from the popular Deep Q-Network (DQN) learning, Alternating Q-learning (AltQ) does not fully fit a target Q-function at each iteration, and is generally known to be unstable and inefficient.

Atari Games Q-Learning

Optimizing for In-memory Deep Learning with Emerging Memory Technology

no code implementations1 Dec 2021 Zhehui Wang, Tao Luo, Rick Siow Mong Goh, Wei zhang, Weng-Fai Wong

In-memory deep learning has already demonstrated orders of magnitude higher performance density and energy efficiency.

Using Reconfigurable Intelligent Surfaces for UE Positioning in mmWave MIMO Systems

no code implementations1 Dec 2021 Wei zhang, Wee Peng Tay

We develop a RIS-aided positioning framework to locate a UE in environments where the LOS path may or may not be available.

Multi-Domain Transformer-Based Counterfactual Augmentation for Earnings Call Analysis

no code implementations2 Dec 2021 Zixuan Yuan, Yada Zhu, Wei zhang, Ziming Huang, Guangnan Ye, Hui Xiong

Earnings call (EC), as a periodic teleconference of a publicly-traded company, has been extensively studied as an essential market indicator because of its high analytical value in corporate fundamentals.

counterfactual Data Augmentation

Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent

no code implementations2 Dec 2021 Wei zhang, Mingrui Liu, Yu Feng, Xiaodong Cui, Brian Kingsbury, Yuhai Tu

We conduct extensive studies over 18 state-of-the-art DL models/tasks and demonstrate that DPSGD often converges in cases where SSGD diverges for large learning rates in the large batch setting.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization

no code implementations10 Dec 2021 Zhiwei Chen, Changan Wang, Yabiao Wang, Guannan Jiang, Yunhang Shen, Ying Tai, Chengjie Wang, Wei zhang, Liujuan Cao

In this paper, we propose a novel framework built upon the transformer, termed LCTR (Local Continuity TRansformer), which targets at enhancing the local perception capability of global features among long-range feature dependencies.

Inductive Bias Object +1

Data-Free Knowledge Transfer: A Survey

no code implementations31 Dec 2021 Yuang Liu, Wei zhang, Jun Wang, Jianyong Wang

In this paper, we provide a comprehensive survey on data-free knowledge transfer from the perspectives of knowledge distillation and unsupervised domain adaptation, to help readers have a better understanding of the current research status and ideas.

Data-free Knowledge Distillation Model Compression +2

Responsive Listening Head Generation: A Benchmark Dataset and Baseline

no code implementations27 Dec 2021 Mohan Zhou, Yalong Bai, Wei zhang, Ting Yao, Tiejun Zhao, Tao Mei

Automatically synthesizing listening behavior that actively responds to a talking head, is critical to applications such as digital human, virtual agents and social robots.

Talking Head Generation Translation

New volatility evolution model after extreme events

no code implementations10 Jan 2022 Mei-Ling Cai, Zhang-HangJian Chen, Sai-Ping Li, Xiong Xiong, Wei zhang, Ming-Yuan Yang, Fei Ren

Empirical study of the evolutionary behaviors of volatility after endogenous and exogenous events further demonstrates the descriptive power of our new model.

Descriptive

Learning-From-Disagreement: A Model Comparison and Visual Analytics Framework

no code implementations19 Jan 2022 Junpeng Wang, Liang Wang, Yan Zheng, Chin-Chia Michael Yeh, Shubham Jain, Wei zhang

With these metrics, one can easily identify meta-features with the most complementary behaviors in two classifiers, and use them to better ensemble the classifiers.

Binary Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.