Search Results for author: Yu Wang

Found 370 papers, 147 papers with code

Unified Language Model Pre-training for Natural Language Understanding and Generation

9 code implementations NeurIPS 2019 Li Dong, Nan Yang, Wenhui Wang, Furu Wei, Xiaodong Liu, Yu Wang, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon

This paper presents a new Unified pre-trained Language Model (UniLM) that can be fine-tuned for both natural language understanding and generation tasks.

Ranked #2 on Generative Question Answering on CoQA (using extra training data)

Abstractive Text Summarization Document Summarization +7

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

3 code implementations28 Feb 2020 Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Songhao Piao, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM).

Ranked #4 on Question Generation on SQuAD1.1 (using extra training data)

Abstractive Text Summarization Decoder +4

Pseudo-Masked Language Models for Unified Language Model Pre-Training

1 code implementation ICML 2020 Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Jianfeng Gao, Songhao Piao, Ming Zhou, Hsiao-Wuen Hon

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM).

Decoder Language Modelling +2

ESNet: An Efficient Symmetric Network for Real-time Semantic Segmentation

2 code implementations24 Jun 2019 Yu Wang, Quan Zhou, Xiaofu Wu

The whole network has nearly symmetric architecture, which is mainly composed of a series of factorized convolution unit (FCU) and its parallel counterparts (PFCU).

Real-Time Semantic Segmentation Segmentation

The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

15 code implementations2 Mar 2021 Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, Yi Wu

This is often due to the belief that PPO is significantly less sample efficient than off-policy methods in multi-agent systems.

Multi-agent Reinforcement Learning reinforcement-learning +3

Adversarial Training for Large Neural Language Models

3 code implementations20 Apr 2020 Xiaodong Liu, Hao Cheng, Pengcheng He, Weizhu Chen, Yu Wang, Hoifung Poon, Jianfeng Gao

In natural language processing (NLP), pre-training large neural language models such as BERT have demonstrated impressive gain in generalization for a variety of tasks, with further improvement from adversarial fine-tuning.

Ranked #6 on Natural Language Inference on ANLI test (using extra training data)

Natural Language Inference Natural Language Understanding

GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks

2 code implementations7 Jul 2020 Guyue Huang, Guohao Dai, Yu Wang, Huazhong Yang

GE-SpMM performs SpMM-like operation on sparse matrices represented in the most common Compressed Sparse Row (CSR) format, so it can be embedded in GNN frameworks with no preprocessing overheads and support general GNN algorithms.

Distributed, Parallel, and Cluster Computing

CogDL: A Comprehensive Library for Graph Deep Learning

1 code implementation1 Mar 2021 Yukuo Cen, Zhenyu Hou, Yan Wang, Qibin Chen, Yizhen Luo, Zhongming Yu, Hengrui Zhang, Xingcheng Yao, Aohan Zeng, Shiguang Guo, Yuxiao Dong, Yang Yang, Peng Zhang, Guohao Dai, Yu Wang, Chang Zhou, Hongxia Yang, Jie Tang

In CogDL, we propose a unified design for the training and evaluation of GNN models for various graph tasks, making it unique among existing graph learning libraries.

Graph Classification Graph Embedding +5

DNNVM : End-to-End Compiler Leveraging Heterogeneous Optimizations on FPGA-based CNN Accelerators

1 code implementation20 Feb 2019 Yu Xing, Shuang Liang, Lingzhi Sui, Xijie Jia, Jiantao Qiu, Xin Liu, Yushun Wang, Yu Wang, Yi Shan

On the Xilinx ZU2 @330 MHz and ZU9 @330 MHz, we achieve equivalently state-of-the-art performance on our benchmarks by na\"ive implementations without optimizations, and the throughput is further improved up to 1. 26x by leveraging heterogeneous optimizations in DNNVM.

TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs

1 code implementation25 Oct 2023 Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han

On top of this, we design the Sparse Autotuner, which extends the design space of existing sparse convolution libraries and searches for the best dataflow configurations for training and inference workloads.

Autonomous Driving Recommendation Systems

ChemicalX: A Deep Learning Library for Drug Pair Scoring

1 code implementation10 Feb 2022 Benedek Rozemberczki, Charles Tapley Hoyt, Anna Gogleva, Piotr Grabowski, Klas Karis, Andrej Lamov, Andriy Nikolov, Sebastian Nilsson, Michael Ughetto, Yu Wang, Tyler Derr, Benjamin M Gyori

In this paper, we introduce ChemicalX, a PyTorch-based deep learning library designed for providing a range of state of the art models to solve the drug pair scoring task.

BIG-bench Machine Learning

Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator

2 code implementations13 Mar 2024 Yusheng Liao, Yutong Meng, Yuhao Wang, Hongcheng Liu, Yanfeng Wang, Yu Wang

Large Language Models (LLMs) have demonstrated remarkable proficiency in human interactions, yet their application within the medical field remains insufficiently explored.

MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts

2 code implementations13 Apr 2024 Yusheng Liao, Shuyang Jiang, Yu Wang, Yanfeng Wang

Large language models like ChatGPT have shown substantial progress in natural language understanding and generation, proving valuable across various disciplines, including the medical field.

Language Modelling Large Language Model +2

Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets

1 code implementation ECCV 2020 Tong Wu, Qingqiu Huang, Ziwei Liu, Yu Wang, Dahua Lin

We present a new loss function called Distribution-Balanced Loss for the multi-label recognition problems that exhibit long-tailed class distributions.

Binary Classification General Classification +2

Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

3 code implementations ICLR 2018 Yujun Lin, Song Han, Huizi Mao, Yu Wang, William J. Dally

The situation gets even worse with distributed training on mobile devices (federated learning), which suffers from higher latency, lower throughput, and intermittent poor connections.

Federated Learning Image Classification +3

Evaluating Efficient Performance Estimators of Neural Architectures

1 code implementation NeurIPS 2021 Xuefei Ning, Changcheng Tang, Wenshuo Li, Zixuan Zhou, Shuang Liang, Huazhong Yang, Yu Wang

Conducting efficient performance estimations of neural architectures is a major challenge in neural architecture search (NAS).

Neural Architecture Search

aw_nas: A Modularized and Extensible NAS framework

1 code implementation25 Nov 2020 Xuefei Ning, Changcheng Tang, Wenshuo Li, Songyi Yang, Tianchen Zhao, Niansong Zhang, Tianyi Lu, Shuang Liang, Huazhong Yang, Yu Wang

Neural Architecture Search (NAS) has received extensive attention due to its capability to discover neural network architectures in an automated manner.

Adversarial Robustness Neural Architecture Search

Discovering Robust Convolutional Architecture at Targeted Capacity: A Multi-Shot Approach

1 code implementation22 Dec 2020 Xuefei Ning, Junbo Zhao, Wenshuo Li, Tianchen Zhao, Yin Zheng, Huazhong Yang, Yu Wang

In this paper, considering scenarios with capacity budget, we aim to discover adversarially robust architecture at targeted capacities.

Neural Architecture Search

CLOSE: Curriculum Learning On the Sharing Extent Towards Better One-shot NAS

1 code implementation16 Jul 2022 Zixuan Zhou, Xuefei Ning, Yi Cai, Jiashu Han, Yiping Deng, Yuhan Dong, Huazhong Yang, Yu Wang

Specifically, we train the supernet with a large sharing extent (an easier curriculum) at the beginning and gradually decrease the sharing extent of the supernet (a harder curriculum).

Neural Architecture Search

Curating a COVID-19 data repository and forecasting county-level death counts in the United States

1 code implementation16 May 2020 Nick Altieri, Rebecca L. Barter, James Duncan, Raaz Dwivedi, Karl Kumbier, Xiao Li, Robert Netzorg, Briton Park, Chandan Singh, Yan Shuo Tan, Tiffany Tang, Yu Wang, Chao Zhang, Bin Yu

We use this data to develop predictions and corresponding prediction intervals for the short-term trajectory of COVID-19 cumulative death counts at the county-level in the United States up to two weeks ahead.

COVID-19 Tracking Decision Making +2

Deep Gradient Compression Reduce the Communication Bandwidth For distributed Traning

1 code implementation The International Conference on Learning Representations 2017 Yujun Lin, Song Han, Huizi Mao, Yu Wang, W. Dally

Large-scale distributed training requires significant communication bandwidth for gradient exchange that limits the scalability of multi-node training, and requires expensive high-bandwidth network infrastructure.

Federated Learning Image Classification +3

Microsoft Dialogue Challenge: Building End-to-End Task-Completion Dialogue Systems

2 code implementations29 Jul 2018 Xiujun Li, Yu Wang, Siqi Sun, Sarah Panda, Jingjing Liu, Jianfeng Gao

This proposal introduces a Dialogue Challenge for building end-to-end task-completion dialogue systems, with the goal of encouraging the dialogue research community to collaborate and benchmark on standard datasets and unified experimental environment.

Knowledge Graph Prompting for Multi-Document Question Answering

1 code implementation22 Aug 2023 Yu Wang, Nedim Lipka, Ryan A. Rossi, Alexa Siu, Ruiyi Zhang, Tyler Derr

Concurrently, the graph traversal agent acts as a local navigator that gathers pertinent context to progressively approach the question and guarantee retrieval quality.

graph construction Open-Domain Question Answering +1

AFDet: Anchor Free One Stage 3D Object Detection

6 code implementations23 Jun 2020 Runzhou Ge, Zhuangzhuang Ding, Yihan Hu, Yu Wang, Sijia Chen, Li Huang, Yuan Li

High-efficiency point cloud 3D object detection operated on embedded systems is important for many robotics applications including autonomous driving.

3D Object Detection Autonomous Driving +2

Dual Vision Transformer

1 code implementation11 Jul 2022 Ting Yao, Yehao Li, Yingwei Pan, Yu Wang, Xiao-Ping Zhang, Tao Mei

Dual-ViT is henceforth able to reduce the computational complexity without compromising much accuracy.

Improving Out-of-Distribution Robustness via Selective Augmentation

2 code implementations2 Jan 2022 Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou, Chelsea Finn

Machine learning algorithms typically assume that training and test examples are drawn from the same distribution.

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

1 code implementation28 Jul 2023 Xuefei Ning, Zinan Lin, Zixuan Zhou, Zifu Wang, Huazhong Yang, Yu Wang

This work aims at decreasing the end-to-end generation latency of large language models (LLMs).

OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control

1 code implementation22 Sep 2023 Botian Xu, Feng Gao, Chao Yu, Ruize Zhang, Yi Wu, Yu Wang

In this work, we introduce OmniDrones, an efficient and flexible platform tailored for reinforcement learning in drone control, built on Nvidia's Omniverse Isaac Sim.

reinforcement-learning

Adversarial Robustness under Long-Tailed Distribution

1 code implementation CVPR 2021 Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, Dahua Lin

We then perform a systematic study on existing long-tailed recognition methods in conjunction with the adversarial training framework.

Adversarial Robustness

Large Trajectory Models are Scalable Motion Predictors and Planners

1 code implementation30 Oct 2023 Qiao Sun, Shiduo Zhang, Danjiao Ma, Jingzhe Shi, Derun Li, Simian Luo, Yu Wang, Ningyi Xu, Guangzhi Cao, Hang Zhao

STR reformulates the motion prediction and motion planning problems by arranging observations, states, and actions into one unified sequence modeling task.

Autonomous Driving Language Modelling +2

Learning a Unified Sample Weighting Network for Object Detection

1 code implementation CVPR 2020 Qi Cai, Yingwei Pan, Yu Wang, Jingen Liu, Ting Yao, Tao Mei

To this end, we devise a general loss function to cover most region-based object detectors with various sampling strategies, and then based on it we propose a unified sample weighting network to predict a sample's task weights.

General Classification Object +3

Efficient Context-Aware Network for Abdominal Multi-organ Segmentation

1 code implementation22 Sep 2021 Fan Zhang, Yu Wang, Hua Yang

For the context block, we propose strip pooling module to capture anisotropic and long-range contextual information, which exists in abdominal scene.

Decoder Organ Segmentation

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation

1 code implementation15 Jun 2023 Ziyang Ma, Zhisheng Zheng, Guanrou Yang, Yu Wang, Chao Zhang, Xie Chen

Our models outperform other SSL models significantly on the LibriSpeech benchmark without the need for iterative re-clustering and re-training.

Automatic Speech Recognition Clustering +4

Learning Self-Supervised Low-Rank Network for Single-Stage Weakly and Semi-Supervised Semantic Segmentation

1 code implementation19 Mar 2022 Junwen Pan, Pengfei Zhu, Kaihua Zhang, Bing Cao, Yu Wang, Dingwen Zhang, Junwei Han, QinGhua Hu

Semantic segmentation with limited annotations, such as weakly supervised semantic segmentation (WSSS) and semi-supervised semantic segmentation (SSSS), is a challenging task that has attracted much attention recently.

Pseudo Label Segmentation +3

Evaluating Quantized Large Language Models

1 code implementation28 Feb 2024 Shiyao Li, Xuefei Ning, Luning Wang, Tengxuan Liu, Xiangsheng Shi, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

Post-training quantization (PTQ) has emerged as a promising technique to reduce the cost of large language models (LLMs).

Quantization

Multi-view Deep Subspace Clustering Networks

2 code implementations6 Aug 2019 Pengfei Zhu, Xinjie Yao, Yu Wang, Binyuan Hui, Dawei Du, QinGhua Hu

Dnet learns view-specific self-representation matrices, whereas Unet learns a common self-representation matrix for all views.

Clustering Model Selection +1

ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi

1 code implementation3 Apr 2021 Yu Wang, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki

This paper describes the ExKaldi-RT online automatic speech recognition (ASR) toolkit that is implemented based on the Kaldi ASR toolkit and Python language.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Collaboration-Aware Graph Convolutional Network for Recommender Systems

1 code implementation3 Jul 2022 Yu Wang, Yuying Zhao, Yi Zhang, Tyler Derr

Graph Neural Networks (GNNs) have been successfully adopted in recommender systems by virtue of the message-passing that implicitly captures collaborative effect.

Recommendation Systems

Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extraction

2 code implementations13 Mar 2021 Shaowei Chen, Yu Wang, Jie Liu, Yuelin Wang

Aspect sentiment triplet extraction (ASTE), which aims to identify aspects from review sentences along with their corresponding opinion expressions and sentiments, is an emerging task in fine-grained opinion mining.

Aspect Sentiment Triplet Extraction Machine Reading Comprehension +2

Imbalanced Graph Classification via Graph-of-Graph Neural Networks

2 code implementations1 Dec 2021 Yu Wang, Yuying Zhao, Neil Shah, Tyler Derr

To this end, we introduce a novel framework, Graph-of-Graph Neural Networks (G$^2$GNN), which alleviates the graph imbalance issue by deriving extra supervision globally from neighboring graphs and locally from stochastic augmentations of graphs.

Graph Classification Node Classification

Joint Contrastive Learning with Infinite Possibilities

1 code implementation NeurIPS 2020 Qi Cai, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei

This paper explores useful modifications of the recent development in contrastive learning via novel probabilistic modeling.

Contrastive Learning

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

2 code implementations ICLR 2021 Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

We propose a simple, general and effective technique, Reward Randomization for discovering diverse strategic policies in complex multi-agent games.

Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis

1 code implementation9 Oct 2023 Haoyu Zhang, Yu Wang, Guanghao Yin, Kejun Liu, Yuanyuan Liu, Tianshu Yu

Though Multimodal Sentiment Analysis (MSA) proves effective by utilizing rich information from multiple sources (e. g., language, video, and audio), the potential sentiment-irrelevant and conflicting information across modalities may hinder the performance from being further improved.

Multimodal Sentiment Analysis

A Bi-model based RNN Semantic Frame Parsing Model for Intent Detection and Slot Filling

1 code implementation NAACL 2018 Yu Wang, Yilin Shen, Hongxia Jin

The most effective algorithms are based on the structures of sequence to sequence models (or "encoder-decoder" models), and generate the intents and semantic tags either using separate models or a joint model.

Decoder +5

Few-Shot Specific Emitter Identification via Deep Metric Ensemble Learning

2 code implementations14 Jul 2022 Yu Wang, Guan Gui, Yun Lin, Hsiao-Chun Wu, Chau Yuen, Fumiyuki Adachi

Thus, we focus on few-shot SEI (FS-SEI) for aircraft identification via automatic dependent surveillance-broadcast (ADS-B) signals, and a novel FS-SEI method is proposed, based on deep metric ensemble learning (DMEL).

Ensemble Learning Metric Learning

Distance-wise Prototypical Graph Neural Network in Node Imbalance Classification

1 code implementation22 Oct 2021 Yu Wang, Charu Aggarwal, Tyler Derr

Recent years have witnessed the significant success of applying graph neural networks (GNNs) in learning effective node representations for classification.

Classification Metric Learning +2

ContrastVAE: Contrastive Variational AutoEncoder for Sequential Recommendation

1 code implementation27 Aug 2022 Yu Wang, Hengrui Zhang, Zhiwei Liu, Liangwei Yang, Philip S. Yu

Then we propose Contrastive Variational AutoEncoder (ContrastVAE in short), a two-branched VAE model with contrastive regularization as an embodiment of ContrastELBO for sequential recommendation.

Contrastive Learning Sequential Recommendation

Learning Concise and Descriptive Attributes for Visual Recognition

1 code implementation ICCV 2023 An Yan, Yu Wang, Yiwu Zhong, chengyu dong, Zexue He, Yujie Lu, William Wang, Jingbo Shang, Julian McAuley

Recent advances in foundation models present new opportunities for interpretable visual recognition -- one can first query Large Language Models (LLMs) to obtain a set of attributes that describe each class, then apply vision-language models to classify images via these attributes.

Descriptive

LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models

1 code implementation20 Aug 2023 Zihan Zhao, Yiyang Jiang, Heyang Liu, Yanfeng Wang, Yu Wang

While Large Language Models (LLMs) have demonstrated commendable performance across a myriad of domains and tasks, existing LLMs still exhibit a palpable deficit in handling multimodal functionalities, especially for the Spoken Question Answering (SQA) task which necessitates precise alignment and deep interaction between speech and text features.

Multiple-choice Question Answering

Synchronous Double-channel Recurrent Network for Aspect-Opinion Pair Extraction

1 code implementation ACL 2020 Shaowei Chen, Jie Liu, Yu Wang, Wenzheng Zhang, Ziming Chi

The opinion entity extraction unit and the relation detection unit are developed as two channels to extract opinion entities and relations simultaneously.

Entity Extraction using GAN Opinion Mining +4

Learning to Adapt to Evolving Domains

1 code implementation NeurIPS 2020 Hong Liu, Mingsheng Long, Jianmin Wang, Yu Wang

(2) Since the target data arrive online, the agent should also maintain competence on previous target domains, i. e. to adapt without forgetting.

Meta-Learning Transfer Learning +1

Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation

1 code implementation ICCV 2023 Yichen Yuan, Yifan Wang, Lijun Wang, Xiaoqi Zhao, Huchuan Lu, Yu Wang, Weibo Su, Lei Zhang

Recent leading zero-shot video object segmentation (ZVOS) works devote to integrating appearance and motion information by elaborately designing feature fusion modules and identically applying them in multiple feature stages.

Semantic Segmentation Video Object Segmentation +2

Improving Fairness in Graph Neural Networks via Mitigating Sensitive Attribute Leakage

1 code implementation7 Jun 2022 Yu Wang, Yuying Zhao, Yushun Dong, Huiyuan Chen, Jundong Li, Tyler Derr

Motivated by our analysis, we propose Fair View Graph Neural Network (FairVGNN) to generate fair views of features by automatically identifying and masking sensitive-correlated features considering correlation variation after feature propagation.

Attribute Fairness +1

PiCO: Peer Review in LLMs based on the Consistency Optimization

1 code implementation2 Feb 2024 Kun-Peng Ning, Shuo Yang, Yu-Yang Liu, Jia-Yu Yao, Zhen-Hui Liu, Yu Wang, Ming Pang, Li Yuan

Existing large language models (LLMs) evaluation methods typically focus on testing the performance on some closed-environment and domain-specific benchmarks with human annotations.

PICO

LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

1 code implementation6 Feb 2024 Tao Yuan, Xuefei Ning, Dong Zhou, Zhijie Yang, Shiyao Li, Minghui Zhuang, Zheyue Tan, Zhuyu Yao, Dahua Lin, Boxun Li, Guohao Dai, Shengen Yan, Yu Wang

In contrast, the average context lengths of mainstream benchmarks are insufficient (5k-21k), and they suffer from potential knowledge leakage and inaccurate metrics, resulting in biased evaluation.

16k

Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

2 code implementations16 Sep 2019 Alper Kamil Bozkurt, Yu Wang, Michael M. Zavlanos, Miroslav Pajic

We present a reinforcement learning (RL) framework to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a Markov Decision Process (MDP).

Motion Planning reinforcement-learning +1

Efficient Federated Learning on Knowledge Graphs via Privacy-preserving Relation Embedding Aggregation

1 code implementation17 Mar 2022 Kai Zhang, Yu Wang, Hongyi Wang, Lifu Huang, Carl Yang, Xun Chen, Lichao Sun

Furthermore, we propose a Federated learning paradigm with privacy-preserving Relation embedding aggregation (FedR) to tackle the privacy issue in FedE.

Entity Embeddings Federated Learning +4

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

1 code implementation14 Mar 2024 Yu Wang, Xiaogeng Liu, Yu Li, Muhao Chen, Chaowei Xiao

However, with the integration of additional modalities, MLLMs are exposed to new vulnerabilities, rendering them prone to structured-based jailbreak attacks, where semantic content (e. g., "harmful text") has been injected into the images to mislead MLLMs.

ParCo: Part-Coordinating Text-to-Motion Synthesis

1 code implementation27 Mar 2024 Qiran Zou, Shangyuan Yuan, Shian Du, Yu Wang, Chang Liu, Yi Xu, Jie Chen, Xiangyang Ji

However, these methods encounter challenges such as the lack of coordination between different part motions and difficulties for networks to understand part concepts.

Motion Synthesis

Face Hallucination via Split-Attention in Split-Attention Network

1 code implementation22 Oct 2020 Tao Lu, Yuanzhi Wang, Yanduo Zhang, Yu Wang, Wei Liu, Zhongyuan Wang, Junjun Jiang

However, most of them fail to take into account the overall facial profile and fine texture details simultaneously, resulting in reduced naturalness and fidelity of the reconstructed face, and further impairing the performance of downstream tasks (e. g., face detection, facial recognition).

Face Detection Face Hallucination +4

DSKReG: Differentiable Sampling on Knowledge Graph for Recommendation with Relational GNN

1 code implementation26 Aug 2021 Yu Wang, Zhiwei Liu, Ziwei Fan, Lichao Sun, Philip S. Yu

In the information explosion era, recommender systems (RSs) are widely studied and applied to discover user-preferred information.

Knowledge Graphs Recommendation Systems

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

1 code implementation CVPR 2023 Yu Wang, Pengchong Qiao, Chang Liu, Guoli Song, Xiawu Zheng, Jie Chen

We argue that an overlooked problem of robust SSL is its corrupted information on semantic level, practically limiting the development of the field.

Tree Decomposed Graph Neural Network

1 code implementation25 Aug 2021 Yu Wang, Tyler Derr

Nevertheless, iterative propagation restricts the information of higher-layer neighborhoods to be transported through and fused with the lower-layer neighborhoods', which unavoidably results in feature smoothing between neighborhoods in different layers and can thus compromise the performance, especially on heterophily networks.

Node Classification Tree Decomposition

OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models

1 code implementation15 Jun 2023 Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang

Diffusion probabilistic models (DPMs) are a new class of generative models that have achieved state-of-the-art generation quality in various domains.

Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems

1 code implementation NeurIPS 2021 Jiayu Chen, Yuanxin Zhang, Yuanfan Xu, Huimin Ma, Huazhong Yang, Jiaming Song, Yu Wang, Yi Wu

We motivate our paradigm through a variational perspective, where the learning objective can be decomposed into two terms: task learning on the current task distribution, and curriculum update to a new task distribution.

Multi-agent Reinforcement Learning

A Survey on Privacy in Graph Neural Networks: Attacks, Preservation, and Applications

1 code implementation31 Aug 2023 Yi Zhang, Yuying Zhao, Zhaoqing Li, Xueqi Cheng, Yu Wang, Olivera Kotevska, Philip S. Yu, Tyler Derr

Despite this progress, there is a lack of a comprehensive overview of the attacks and the techniques for preserving privacy in the graph domain.

Privacy Preserving

Meta-learning with an Adaptive Task Scheduler

2 code implementations NeurIPS 2021 Huaxiu Yao, Yu Wang, Ying WEI, Peilin Zhao, Mehrdad Mahdavi, Defu Lian, Chelsea Finn

In ATS, for the first time, we design a neural scheduler to decide which meta-training tasks to use next by predicting the probability being sampled for each candidate task, and train the scheduler to optimize the generalization capacity of the meta-model to unseen tasks.

Drug Discovery Meta-Learning

Unsupervised Remote Sensing Super-Resolution via Migration Image Prior

1 code implementation8 May 2021 JiaMing Wang, Zhenfeng Shao, Tao Lu, Xiao Huang, Ruiqian Zhang, Yu Wang

Despite their success, however, low/high spatial resolution pairs are usually difficult to obtain in satellites with a high temporal resolution, making such approaches in SR impractical to use.

Generative Adversarial Network Super-Resolution

Contribution of Radio Halos to the Foreground for SKA EoR Experiments

4 code implementations14 May 2019 Weitian Li, Haiguang Xu, Zhixian Ma, Dan Hu, Zhenghao Zhu, Chenxi Shan, Jingying Wang, Junhua Gu, Dongchao Zheng, Xiaoli Lian, Qian Zheng, Yu Wang, Jie Zhu, Xiang-Ping Wu

The overwhelming foreground contamination is one of the primary impediments to probing the EoR through measuring the redshifted 21 cm signal.

Cosmology and Nongalactic Astrophysics

Ultra Lite Convolutional Neural Network for Fast Automatic Modulation Classification in Low-Resource Scenarios

1 code implementation9 Aug 2022 Lantu Guo, Yu Wang, Yun Lin, Haitao Zhao, Guan Gui

Automatic modulation classification (AMC) is a key technique for designing non-cooperative communication systems, and deep learning (DL) is applied effectively to AMC for improving classification accuracy.

Classification Data Augmentation

DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting

1 code implementation16 Aug 2023 Tianyu Fu, Chiyue Wei, Yu Wang, Rex Ying

We introduce DeSCo, a scalable neural deep subgraph counting pipeline, designed to accurately predict both the count and occurrence position of queries on target graphs post single training.

Graph Regression Position +1

Large-scale Personalized Video Game Recommendation via Social-aware Contextualized Graph Neural Network

1 code implementation7 Feb 2022 Liangwei Yang, Zhiwei Liu, Yu Wang, Chen Wang, Ziwei Fan, Philip S. Yu

We conduct a comprehensive analysis of users' online game behaviors, which motivates the necessity of handling those three characteristics in the online game recommendation.

Recommendation Systems

LINFA: a Python library for variational inference with normalizing flow and annealing

1 code implementation10 Jul 2023 Yu Wang, Emma R. Cobian, Jubilee Lee, Fang Liu, Jonathan D. Hauenstein, Daniele E. Schiavazzi

Variational inference is an increasingly popular method in statistics and machine learning for approximating probability distributions.

Variational Inference

A Debiased MDI Feature Importance Measure for Random Forests

3 code implementations NeurIPS 2019 Xiao Li, Yu Wang, Sumanta Basu, Karl Kumbier, Bin Yu

Based on the original definition of MDI by Breiman et al. for a single tree, we derive a tight non-asymptotic bound on the expected bias of MDI importance of noisy features, showing that deep trees have higher (expected) feature selection bias than shallow ones.

Feature Importance feature selection +1

GypSum: Learning Hybrid Representations for Code Summarization

1 code implementation26 Apr 2022 Yu Wang, Yu Dong, Xuesong Lu, Aoying Zhou

Current deep learning models for code summarization generally follow the principle in neural machine translation and adopt the encoder-decoder framework, where the encoder learns the semantic representations from source code and the decoder transforms the learnt representations into human-readable text that describes the functionality of code snippets.

Code Summarization Decoder +4

LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination

1 code implementation23 Dec 2023 Jijia Liu, Chao Yu, Jiaxuan Gao, Yuqing Xie, Qingmin Liao, Yi Wu, Yu Wang

AI agents powered by Large Language Models (LLMs) have made significant advances, enabling them to assist humans in diverse complex tasks and leading to a revolution in human-AI coordination.

Code Generation

Steklov Spectral Geometry for Extrinsic Shape Analysis

1 code implementation21 Jul 2017 Yu Wang, Mirela Ben-Chen, Iosif Polterovich, Justin Solomon

We propose using the Dirichlet-to-Neumann operator as an extrinsic alternative to the Laplacian for spectral geometry processing and shape analysis.

Graphics

SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement

1 code implementation15 Nov 2022 Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei

In this paper, we propose a novel deep architecture tailored for 3D point cloud applications, named as SPE-Net.

Exploring Diverse Representations for Open Set Recognition

1 code implementation12 Jan 2024 Yu Wang, Junxian Mu, Pengfei Zhu, QinGhua Hu

We show that the differences in attention maps can lead to diverse representations so that the fused representations can well handle the open space.

Open Set Learning

PepLand: a large-scale pre-trained peptide representation model for a comprehensive landscape of both canonical and non-canonical amino acids

1 code implementation8 Nov 2023 Ruochi Zhang, Haoran Wu, Yuting Xiu, Kewei Li, Ningning Chen, Yu Wang, Yan Wang, Xin Gao, Fengfeng Zhou

In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation.

Learning a Static Bug Finder from Data

1 code implementation12 Jul 2019 Yu Wang, Fengjuan Gao, Linzhang Wang, Ke Wang

In a cross-project prediction task, three neural bug detectors we instantiate from NeurSA are effective in catching null pointer dereference, array index out of bound and class cast bugs in unseen code.

LVCHAT: Facilitating Long Video Comprehension

1 code implementation19 Feb 2024 Yu Wang, Zeyuan Zhang, Julian McAuley, Zexue He

To address this issue, we propose Long Video Chat (LVChat), where Frame-Scalable Encoding (FSE) is introduced to dynamically adjust the number of embeddings in alignment with the duration of the video to ensure long videos are not overly compressed into a few embeddings.

Video Captioning

DSA: More Efficient Budgeted Pruning via Differentiable Sparsity Allocation

1 code implementation ECCV 2020 Xuefei Ning, Tianchen Zhao, Wenshuo Li, Peng Lei, Yu Wang, Huazhong Yang

In budgeted pruning, how to distribute the resources across layers (i. e., sparsity allocation) is the key problem.

BoolNet: Minimizing The Energy Consumption of Binary Neural Networks

1 code implementation13 Jun 2021 Nianhui Guo, Joseph Bethge, Haojin Yang, Kai Zhong, Xuefei Ning, Christoph Meinel, Yu Wang

Recent works on Binary Neural Networks (BNNs) have made promising progress in narrowing the accuracy gap of BNNs to their 32-bit counterparts.

Out-of-Distribution Detection with Hilbert-Schmidt Independence Optimization

1 code implementation26 Sep 2022 Jingyang Lin, Yu Wang, Qi Cai, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei

Existing works attempt to solve the problem by explicitly imposing uncertainty on classifiers when OOD inputs are exposed to the classifier during training.

Outlier Detection Out-of-Distribution Detection +1

Binding-Adaptive Diffusion Models for Structure-Based Drug Design

1 code implementation15 Jan 2024 Zhilin Huang, Ling Yang, Zaixi Zhang, Xiangxin Zhou, Yu Bao, Xiawu Zheng, Yuwei Yang, Yu Wang, Wenming Yang

Then the selected protein-ligand subcomplex is processed with SE(3)-equivariant neural networks, and transmitted back to each atom of the complex for augmenting the target-aware 3D molecule diffusion generation with binding interaction information.

Avg

On Structural Explanation of Bias in Graph Neural Networks

1 code implementation24 Jun 2022 Yushun Dong, Song Wang, Yu Wang, Tyler Derr, Jundong Li

The low transparency on how the structure of the input network influences the bias in GNN outcome largely limits the safe adoption of GNNs in various decision-critical scenarios.

Decision Making Fairness

Relational Prior Knowledge Graphs for Detection and Instance Segmentation

1 code implementation11 Oct 2023 Osman Ülger, Yu Wang, Ysbrand Galama, Sezer Karaoglu, Theo Gevers, Martin R. Oswald

Humans have a remarkable ability to perceive and reason about the world around them by understanding the relationships between objects.

Instance Segmentation Knowledge Graphs +5

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

1 code implementation2 Apr 2024 Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Sergey Yekhanin, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

For example, LCSC achieves better performance using 1 number of function evaluation (NFE) than the base model with 2 NFE on consistency distillation, and decreases the NFE of DM from 15 to 9 while maintaining the generation quality on CIFAR-10.

Fairness and Explainability: Bridging the Gap Towards Fair Model Explanations

1 code implementation7 Dec 2022 Yuying Zhao, Yu Wang, Tyler Derr

Although research efforts have been devoted to measuring and mitigating bias, they mainly study bias from the result-oriented perspective while neglecting the bias encoded in the decision-making procedure.

Decision Making Fairness

Toward Extremely Lightweight Distracted Driver Recognition With Distillation-Based Neural Architecture Search and Knowledge Transfer

1 code implementation9 Feb 2023 Dichao Liu, Toshihiko Yamasaki, Yu Wang, Kenji Mase, Jien Kato

Experimental results on the Statefarm Distracted Driver Detection Dataset and AUC Distracted Driver Dataset show that the proposed approach is highly effective for recognizing distracted driving behaviors from photos: (1) the teacher network's accuracy surpasses the previous best accuracy; (2) the student network achieves very high accuracy with only 0. 42M parameters (around 55% of the previous most lightweight model).

Knowledge Distillation Neural Architecture Search +1

Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation

1 code implementation CVPR 2023 Tianli Zhang, Mengqi Xue, Jiangtao Zhang, Haofei Zhang, Yu Wang, Lechao Cheng, Jie Song, Mingli Song

Most existing online knowledge distillation(OKD) techniques typically require sophisticated modules to produce diverse knowledge for improving students' generalization ability.

Knowledge Distillation

COLA: Cross-city Mobility Transformer for Human Trajectory Simulation

1 code implementation4 Mar 2024 Yu Wang, Tongya Zheng, Yuxuan Liang, Shunyu Liu, Mingli Song

To address these challenges, we have tailored a Cross-city mObiLity trAnsformer (COLA) with a dedicated model-agnostic transfer framework by effectively transferring cross-city knowledge for human trajectory simulation.

CoLA Transfer Learning

A Topological Perspective on Demystifying GNN-Based Link Prediction Performance

1 code implementation6 Oct 2023 Yu Wang, Tong Zhao, Yuying Zhao, Yunchao Liu, Xueqi Cheng, Neil Shah, Tyler Derr

Despite the widespread belief that low-degree nodes exhibit poorer LP performance, our empirical findings provide nuances to this viewpoint and prompt us to propose a better metric, Topological Concentration (TC), based on the intersection of the local subgraph of each node with the ones of its neighbors.

Link Prediction

Spatial Knowledge-Infused Hierarchical Learning: An Application in Flood Mapping on Earth Imagery

1 code implementation12 Dec 2023 Zelin Xu, Tingsong Xiao, Wenchong He, Yu Wang, Zhe Jiang

The problem is challenging due to the sparse and noisy input labels, spatial uncertainty within the label inference process, and high computational costs associated with a large number of sample locations.

Every Node is Different: Dynamically Fusing Self-Supervised Tasks for Attributed Graph Clustering

1 code implementation12 Jan 2024 Pengfei Zhu, Qian Wang, Yu Wang, Jialu Li, QinGhua Hu

In this paper, we propose to dynamically learn the weights of SSL tasks for different nodes and fuse the embeddings learned from different SSL tasks to boost performance.

Clustering Graph Clustering +1

Disentangled Condensation for Large-scale Graphs

1 code implementation18 Jan 2024 Zhenbang Xiao, Shunyu Liu, Yu Wang, Tongya Zheng, Mingli Song

Graph condensation has emerged as an intriguing technique to provide Graph Neural Networks for large-scale graphs with a more compact yet informative small graph to save the expensive costs of large-scale graph learning.

Graph Learning Link Prediction +1

Multi-Granularity Regularized Re-Balancing for Class Incremental Learning

1 code implementation30 Jun 2022 Huitong Chen, Yu Wang, QinGhua Hu

Re-balancing methods are used to alleviate the influence of data imbalance; however, we empirically discover that they would under-fit new classes.

Class Incremental Learning Incremental Learning

Few-Shot Specific Emitter Identification via Hybrid Data Augmentation and Deep Metric Learning

1 code implementation1 Dec 2022 Cheng Wang, Xue Fu, Yu Wang, Guan Gui, Haris Gacanin, Hikmet Sari, Fumiyuki Adachi

Specific emitter identification (SEI) is a potential physical layer authentication technology, which is one of the most critical complements of upper layer authentication.

Data Augmentation Metric Learning

Dynamic Ensemble of Low-fidelity Experts: Mitigating NAS "Cold-Start"

1 code implementation2 Feb 2023 Junbo Zhao, Xuefei Ning, Enshu Liu, Binxin Ru, Zixuan Zhou, Tianchen Zhao, Chen Chen, Jiajin Zhang, Qingmin Liao, Yu Wang

In the first step, we train different sub-predictors on different types of available low-fidelity information to extract beneficial knowledge as low-fidelity experts.

Neural Architecture Search

Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation

1 code implementation8 Feb 2023 Xinyi Yang, Shiyu Huang, Yiwen Sun, Yuxiang Yang, Chao Yu, Wei-Wei Tu, Huazhong Yang, Yu Wang

Goal-conditioned hierarchical reinforcement learning (HRL) provides a promising direction to tackle this challenge by introducing a hierarchical structure to decompose the search space, where the low-level policy predicts primitive actions in the guidance of the goals derived from the high-level policy.

Hierarchical Reinforcement Learning Multi-agent Reinforcement Learning +2

MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception

1 code implementation15 Jan 2024 Yuhao Wang, Yusheng Liao, Heyang Liu, Hongcheng Liu, Yu Wang, Yanfeng Wang

We believe that these hallucinations are partially due to the models' struggle with understanding what they can and cannot perceive from images, a capability we refer to as self-awareness in perception.

Generalizable control for quantum parameter estimation through reinforcement learning

1 code implementation25 Apr 2019 Han Xu, Junning Li, Liqiang Liu, Yu Wang, Haidong Yuan, Xin Wang

Measurement and estimation of parameters are essential for science and engineering, where one of the main quests is to find systematic schemes that can achieve high precision.

Quantum Physics Mesoscale and Nanoscale Physics

Variational Inference with NoFAS: Normalizing Flow with Adaptive Surrogate for Computationally Expensive Models

1 code implementation28 Aug 2021 Yu Wang, Fang Liu, Daniele E. Schiavazzi

To reduce the computational cost without sacrificing inferential accuracy, we propose Normalizing Flow with Adaptive Surrogate (NoFAS), an optimization strategy that alternatively updates the normalizing flow parameters and surrogate model parameters.

Bayesian Inference Variational Inference

Multiway Ensemble Kalman Filter

1 code implementation8 Dec 2021 Yu Wang, Alfred Hero

In this work, we study the emergence of sparsity and multiway structures in second-order statistical characterizations of dynamical processes governed by partial differential equations (PDEs).

LAR-SR: A Local Autoregressive Model for Image Super-Resolution

1 code implementation CVPR 2022 Baisong Guo, Xiaoyun Zhang, HaoNing Wu, Yu Wang, Ya zhang, Yan-Feng Wang

Previous super-resolution (SR) approaches often formulate SR as a regression problem and pixel wise restoration, which leads to a blurry and unreal SR output.

Image Super-Resolution

Deciphering Compatibility Relationships with Textual Descriptions via Extraction and Explanation

1 code implementation17 Dec 2023 Yu Wang, Zexue He, Zhankui He, Hao Xu, Julian McAuley

This fine-tuning allows the model to generate explanations that convey the compatibility relationships between items.

Hidden Talents of the Variational Autoencoder

1 code implementation16 Jun 2017 Bin Dai, Yu Wang, John Aston, Gang Hua, David Wipf

Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying distribution.

Dimensionality Reduction

Predicting Solar Flares Using CNN and LSTM on Two Solar Cycles of Active Region Data

1 code implementation7 Apr 2022 Zeyu Sun, Monica G. Bobra, Xiantong Wang, Yu Wang, Hu Sun, Tamas Gombosi, Yang Chen, Alfred Hero

We consider the flare prediction problem that distinguishes flare-imminent active regions that produce an M- or X-class flare in the future 24 hours, from quiet active regions that do not produce any flare within $\pm 24$ hours.

Attribute

Dynamic Sub-graph Distillation for Robust Semi-supervised Continual Learning

1 code implementation27 Dec 2023 Yan Fan, Yu Wang, Pengfei Zhu, QinGhua Hu

In this work, we focus on semi-supervised continual learning (SSCL), where the model progressively learns from partially labeled data with unknown categories.

Continual Learning graph construction +1

Variational Resampling Based Assessment of Deep Neural Networks under Distribution Shift

1 code implementation7 Jun 2019 Xudong Sun, Alexej Gossmann, Yu Wang, Bernd Bischl

A novel variational inference based resampling framework is proposed to evaluate the robustness and generalization capability of deep learning models with respect to distribution shift.

Domain Generalization General Classification +3

The Sylvester Graphical Lasso (SyGlasso)

1 code implementation1 Feb 2020 Yu Wang, Byoungwook Jang, Alfred Hero

We apply the SyGlasso to an electroencephalography (EEG) study to compare the brain connectivity of alcoholic and nonalcoholic subjects.

EEG

Energy-Aware DNN Graph Optimization

1 code implementation12 May 2020 Yu Wang, Rong Ge, Shuang Qiu

Unlike existing work in deep neural network (DNN) graphs optimization for inference performance, we explore DNN graph optimization for energy awareness and savings for power- and resource-constrained machine learning devices.

Learning Dynamic Preference Structure Embedding From Temporal Networks

1 code implementation23 Nov 2021 Tongya Zheng, Zunlei Feng, Yu Wang, Chengchao Shen, Mingli Song, Xingen Wang, Xinyu Wang, Chun Chen, Hao Xu

Our proposed Dynamic Preference Structure (DPS) framework consists of two stages: structure sampling and graph fusion.

Graph Sampling

OpenMix+: Revisiting Data Augmentation for Open Set Recognition

1 code implementation IEEE Transactions on Circuits and Systems for Video Technology 2023 Guosong Jiang, Pengfei Zhu, Yu Wang, QinGhua Hu

In this paper, we point out that balancing between structural risk and open space risk is crucial for open set recognition, and re-formalize it as open set structural risk.

Data Augmentation Open Set Learning

Nonparametric Topic Modeling with Neural Inference

no code implementations18 Jun 2018 Xuefei Ning, Yin Zheng, Zhuxi Jiang, Yu Wang, Huazhong Yang, Junzhou Huang

Moreover, we also propose HiTM-VAE, where the document-specific topic distributions are generated in a hierarchical manner.

Topic Models

A Deeply-Recursive Convolutional Network for Crowd Counting

no code implementations15 May 2018 Xinghao Ding, Zhirui Lin, Fujin He, Yu Wang, Yue Huang

The estimation of crowd count in images has a wide range of applications such as video surveillance, traffic monitoring, public safety and urban planning.

Crowd Counting

Hu-Fu: Hardware and Software Collaborative Attack Framework against Neural Networks

no code implementations14 May 2018 Wenshuo Li, Jincheng Yu, Xuefei Ning, Pengjun Wang, Qi Wei, Yu Wang, Huazhong Yang

So, in this paper, we propose a hardware-software collaborative attack framework to inject hidden neural network Trojans, which works as a back-door without requiring manipulating input images and is flexible for different scenarios.

Autonomous Driving Cloud Computing +6

A Counter-Forensic Method for CNN-Based Camera Model Identification

no code implementations6 May 2018 David Güera, Yu Wang, Luca Bondi, Paolo Bestagini, Stefano Tubaro, Edward J. Delp

We examine in this paper the problem of identifying the camera model or type that was used to take an image and that can be spoofed.

PoTrojan: powerful neural-level trojan designs in deep learning models

no code implementations8 Feb 2018 Minhui Zou, Yang Shi, Chengliang Wang, Fangyu Li, WenZhan Song, Yu Wang

With the popularity of deep learning (DL), artificial intelligence (AI) has been applied in many areas of human life.

Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription

no code implementations1 Feb 2018 Yu Wang, Xie Chen, Mark Gales, Anton Ragni, Jeremy Wong

As the combination approaches become more complicated the difference between the phonetic and graphemic systems further decreases.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Telepath: Understanding Users from a Human Vision Perspective in Large-Scale Recommender Systems

no code implementations1 Sep 2017 Yu Wang, Jixing Xu, Aohan Wu, Mantian Li, Yang He, Jinghe Hu, Weipeng P. Yan

This paper proposes Telepath, a vision-based bionic recommender system model, which understands users from such perspective.

Recommendation Systems

LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

no code implementations18 Aug 2017 Yu Wang, Jiayi Liu, Yuxiang Liu, Jun Hao, Yang He, Jinghe Hu, Weipeng P. Yan, Mantian Li

We present LADDER, the first deep reinforcement learning agent that can successfully learn control policies for large-scale real-world problems directly from raw inputs composed of high-level semantic information.

Future Word Contexts in Neural Network Language Models

no code implementations18 Aug 2017 Xie Chen, Xunying Liu, Anton Ragni, Yu Wang, Mark Gales

Instead of using a recurrent unit to capture the complete future word contexts, a feedforward unit is used to model a finite number of succeeding, future, words.

speech-recognition Speech Recognition

A Deep Learning Approach for Blind Drift Calibration of Sensor Networks

no code implementations16 Jun 2017 Yuzhi Wang, Anqi Yang, Xiaoming Chen, Pengjun Wang, Yu Wang, Huazhong Yang

Temporal drift of sensory data is a severe problem impacting the data quality of wireless sensor networks (WSNs).

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

no code implementations24 May 2017 Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, William J. Dally

Since memory reference is more than two orders of magnitude more expensive than arithmetic operations, the regularity of sparse structure leads to more efficient hardware design.

Fast Change Point Detection on Dynamic Social Networks

no code implementations20 May 2017 Yu Wang, Aniket Chakrabarti, David Sivakoff, Srinivasan Parthasarathy

In this work we devise an effective and efficient three-step-approach for detecting change points in dynamic networks under the snapshot model.

Change Point Detection Sociology

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

no code implementations1 Dec 2016 Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Evaluated on the LSTM for speech recognition benchmark, ESE is 43x and 3x faster than Core i7 5930k CPU and Pascal Titan X GPU implementations.

Quantization speech-recognition +1

Do They All Look the Same? Deciphering Chinese, Japanese and Koreans by Fine-Grained Deep Learning

no code implementations6 Oct 2016 Yu Wang, Haofu Liao, Yang Feng, Xiangyang Xu, Jiebo Luo

We find that Chinese, Japanese and Koreans do exhibit substantial differences in certain attributes, such as bangs, smiling, and bushy eyebrows.

Attribute Marketing

Improving the Speed of Response of Learning Algorithms Using Multiple Models

no code implementations16 Oct 2015 Kumpati S. Narendra, Snehasis Mukhopadyhay, Yu Wang

During the past two decades, the first author has worked on numerous methods for improving the stability, robustness, and performance of adaptive systems using multiple models and the other authors have collaborated with him on some of them.

Sparse K-Means with $\ell_{\infty}/\ell_0$ Penalty for High-Dimensional Data Clustering

no code implementations31 Mar 2014 Xiangyu Chang, Yu Wang, Rongjian Li, Zongben Xu

Nevertheless, this framework has two serious drawbacks: One is that the solution of the framework unavoidably involves a considerable portion of redundant noise features in many situations, and the other is that the framework neither offers intuitive explanations on why this framework can select relevant features nor leads to any theoretical guarantee for feature selection consistency.

Clustering feature selection

A Survey of FPGA Based Neural Network Accelerator

no code implementations24 Dec 2017 Kaiyuan Guo, Shulin Zeng, Jincheng Yu, Yu Wang, Huazhong Yang

Various FPGA based accelerator designs have been proposed with software and hardware optimization techniques to achieve high speed and energy efficiency.

Hardware Architecture

User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks

no code implementations18 Sep 2018 Yilin Shen, Xiangyu Zeng, Yu Wang, Hongxia Jin

The results show that our approach leverages such simple user information to outperform state-of-the-art approaches by 0. 25% for intent detection and 0. 31% for slot filling using standard training data.

Intent Detection Semantic Frame Parsing +3

Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks

no code implementations30 Oct 2018 Anton Ragni, Qiujia Li, Mark Gales, Yu Wang

These errors are not accounted for by the standard confidence estimation schemes and are hard to rectify in the upstream and downstream processing.

NEURAL MALWARE CONTROL WITH DEEP REINFORCEMENT LEARNING

no code implementations ICLR 2019 Yu Wang, Jack W. Stokes, Mady Marinescu

Antimalware products are a key component in detecting malware attacks, and their engines typically execute unknown programs in a sandbox prior to running them on the native operating system.

reinforcement-learning Reinforcement Learning (RL)

CNNSAT: Fast, Accurate Boolean Satisfiability using Convolutional Neural Networks

no code implementations ICLR 2019 Yu Wang, Fengjuan Gao, Amin Alipour, Linzhang Wang, Xuandong Li, Zhendong Su

Boolean satisfiability (SAT) is one of the most well-known NP-complete problems and has been extensively studied.

A Bayesian Nonparametric Topic Model with Variational Auto-Encoders

no code implementations ICLR 2018 Xuefei Ning, Yin Zheng, Zhuxi Jiang, Yu Wang, Huazhong Yang, Junzhou Huang

On the other hand, different with the other BNP topic models, the inference of iTM-VAE is modeled by neural networks, which has rich representation capacity and can be computed in a simple feed-forward manner.

Representation Learning Retrieval +1

Unique Sharp Local Minimum in $\ell_1$-minimization Complete Dictionary Learning

no code implementations22 Feb 2019 Yu Wang, Siqi Wu, Bin Yu

First, we obtain a necessary and sufficient norm condition for the reference dictionary $D^*$ to be a sharp local minimum of the expected $\ell_1$ objective function.

Dictionary Learning

Transferrable Prototypical Networks for Unsupervised Domain Adaptation

no code implementations CVPR 2019 Yingwei Pan, Ting Yao, Yehao Li, Yu Wang, Chong-Wah Ngo, Tao Mei

Specifically, we present Transferrable Prototypical Networks (TPN) for adaptation such that the prototypes for each class in source and target domains are close in the embedding space and the score distributions predicted by prototypes separately on source and target data are similar.

Pseudo Label Unsupervised Domain Adaptation

New Era of Deeplearning-Based Malware Intrusion Detection: The Malware Detection and Prediction Based On Deep Learning

no code implementations19 Jul 2019 Shuqiang Lu, Lingyun Ying, Wenjie Lin, Yu Wang, Meining Nie, Kaiwen Shen, Lu Liu, Haixin Duan

With the development of artificial intelligence algorithms like deep learning models and the successful applications in many different fields, further similar trails of deep learning technology have been made in cyber security area.

Clustering General Classification +3

Single Training Dimension Selection for Word Embedding with PCA

no code implementations IJCNLP 2019 Yu Wang

In this paper, we present a fast and reliable method based on PCA to select the number of dimensions for word embeddings.

Question Answering Sentiment Analysis +1

Review of Learning-based Longitudinal Motion Planning for Autonomous Vehicles: Research Gaps between Self-driving and Traffic Congestion

no code implementations2 Oct 2019 Hao Zhou, Jorge Laval, Anye Zhou, Yu Wang, Wenchao Wu, Zhu Qing, Srinivas Peeta

Some suggestions towards congestion mitigation for future mMP studies are proposed: i) enrich data collection to facilitate the congestion learning, ii) incorporate non-imitation learning methods to combine traffic efficiency into a safety-oriented technical route, and iii) integrate domain knowledge from the traditional car following (CF) theory to improve the string stability of mMP.

Autonomous Vehicles BIG-bench Machine Learning +3

Application of Pre-training Models in Named Entity Recognition

no code implementations9 Feb 2020 Yu Wang, Yining Sun, Zuchang Ma, Lisheng Gao, Yang Xu, Ting Sun

Then, we apply these pre-training models to a NER task by fine-tuning, and compare the effects of the different model architecture and pre-training tasks on the NER task.

named-entity-recognition Named Entity Recognition +1

FTT-NAS: Discovering Fault-Tolerant Convolutional Neural Architecture

no code implementations20 Mar 2020 Xuefei Ning, Guangjun Ge, Wenshuo Li, Zhenhua Zhu, Yin Zheng, Xiaoming Chen, Zhen Gao, Yu Wang, Huazhong Yang

By inspecting the discovered architectures, we find that the operation primitives, the weight quantization range, the capacity of the model, and the connection pattern have influences on the fault resilience capability of NN models.

Neural Architecture Search Quantization

Enabling Efficient and Flexible FPGA Virtualization for Deep Learning in the Cloud

no code implementations26 Mar 2020 Shulin Zeng, Guohao Dai, Hanbo Sun, Kai Zhong, Guangjun Ge, Kaiyuan Guo, Yu Wang, Huazhong Yang

Currently, the majority of FPGA-based DNN accelerators in the cloud run in a time-division multiplexing way for multiple users sharing a single FPGA, and require re-compilation with $\sim$100 s overhead.

Differentially Private Algorithms for Statistical Verification of Cyber-Physical Systems

no code implementations1 Apr 2020 Yu Wang, Hussein Sibai, Mark Yen, Sayan Mitra, Geir E. Dullerud

We also show that the standard exponential mechanism that randomizes the output of an algorithm to achieve differential privacy fails to do so in the context of sequential algorithms.

Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

no code implementations1 Apr 2020 Yu Wang, Nima Roohi, Matthew West, Mahesh Viswanathan, Geir E. Dullerud

Probabilistic Computation Tree Logic (PCTL) is frequently used to formally specify control objectives such as probabilistic reachability and safety.

Negation Q-Learning +2

The Ivory Tower Lost: How College Students Respond Differently than the General Public to the COVID-19 Pandemic

no code implementations21 Apr 2020 Viet Duong, Phu Pham, Tongyu Yang, Yu Wang, Jiebo Luo

Recently, the pandemic of the novel Coronavirus Disease-2019 (COVID-19) has presented governments with ultimate challenges.

In the Eyes of the Beholder: Analyzing Social Media Use of Neutral and Controversial Terms for COVID-19

no code implementations21 Apr 2020 Long Chen, Hanjia Lyu, Tongyu Yang, Yu Wang, Jiebo Luo

To model the substantive difference of tweets with controversial terms and those with non-controversial terms, we apply topic modeling and LIWC-based sentiment analysis.

Sentiment Analysis

Communication Lower Bound in Convolution Accelerators

no code implementations8 Nov 2019 Xiaoming Chen, Yinhe Han, Yu Wang

Evaluations based on the 65nm technology demonstrate that the proposed architecture nearly reaches the theoretical minimum communication in a three-level memory hierarchy and it is computation dominant.

Distributed, Parallel, and Cluster Computing Hardware Architecture

A Comprehensive Survey of Grammar Error Correction

no code implementations2 May 2020 Yu Wang, Yuelin Wang, Jie Liu, Zhuo Liu

More importantly, we discuss four kinds of basic approaches, including statistical machine translation based approach, neural machine translation based approach, classification based approach and language model based approach, six commonly applied performance boosting techniques for GEC systems and two data augmentation methods.

Data Augmentation Language Modelling +3

Learning Semantic Program Embeddings with Graph Interval Neural Network

no code implementations18 May 2020 Yu Wang, Fengjuan Gao, Linzhang Wang, Ke Wang

We have also created a neural bug detector based on GINN to catch null pointer deference bugs in Java code.

Method name prediction Variable misuse

Precisely Predicting Acute Kidney Injury with Convolutional Neural Network Based on Electronic Health Record Data

no code implementations27 May 2020 Yu Wang, Junpeng Bao, JianQiang Du, Yongfeng Li

Compared with the existing AKI predictors, the predictor in this work greatly improves the precision of early prediction of AKI by using the Convolutional Neural Network architecture and a more concise input vector.

Exploring the Potential of Low-bit Training of Convolutional Neural Networks

no code implementations4 Jun 2020 Kai Zhong, Xuefei Ning, Guohao Dai, Zhenhua Zhu, Tianchen Zhao, Shulin Zeng, Yu Wang, Huazhong Yang

For training a variety of models on CIFAR-10, using 1-bit mantissa and 2-bit exponent is adequate to keep the accuracy loss within $1\%$.

Quantization

Learning Monotone Dynamics by Neural Networks

no code implementations11 Jun 2020 Yu Wang, Qitong Gao, Miroslav Pajic

For monotonicity constraints, we propose to use nonnegative neural networks and batch normalization.

1st Place Solutions for Waymo Open Dataset Challenges -- 2D and 3D Tracking

no code implementations28 Jun 2020 Yu Wang, Sijia Chen, Li Huang, Runzhou Ge, Yihan Hu, Zhuangzhuang Ding, Jie Liao

This technical report presents the online and real-time 2D and 3D multi-object tracking (MOT) algorithms that reached the 1st places on both Waymo Open Dataset 2D tracking and 3D tracking challenges.

3D Multi-Object Tracking

1st Place Solution for Waymo Open Dataset Challenge -- 3D Detection and Domain Adaptation

no code implementations28 Jun 2020 Zhuangzhuang Ding, Yihan Hu, Runzhou Ge, Li Huang, Sijia Chen, Yu Wang, Jie Liao

We proposed a one-stage, anchor-free and NMS-free 3D point cloud object detector AFDet, using object key-points to encode the 3D attributes, and to learn an end-to-end point cloud object detection without the need of hand-engineering or learning the anchors.

Domain Adaptation Object +2

Monitoring Depression Trend on Twitter during the COVID-19 Pandemic

no code implementations1 Jul 2020 Yi-Peng Zhang, Hanjia Lyu, Yubao Liu, Xiyang Zhang, Yu Wang, Jiebo Luo

The COVID-19 pandemic has severely affected people's daily lives and caused tremendous economic loss worldwide.

Expert Training: Task Hardness Aware Meta-Learning for Few-Shot Classification

no code implementations13 Jul 2020 Yucan Zhou, Yu Wang, Jianfei Cai, Yu Zhou, QinGhua Hu, Weiping Wang

Some works in the optimization of deep neural networks have shown that a better arrangement of training data can make the classifier converge faster and perform better.

General Classification Meta-Learning

Physical Adversarial Attack on Vehicle Detector in the Carla Simulator

no code implementations31 Jul 2020 Tong Wu, Xuefei Ning, Wenshuo Li, Ranran Huang, Huazhong Yang, Yu Wang

In this paper, we tackle the issue of physical adversarial examples for object detectors in the wild.

Adversarial Attack

Shift Equivariance in Object Detection

no code implementations13 Aug 2020 Marco Manfredi, Yu Wang

Robustness to small image translations is a highly desirable property for object detectors.

Object object-detection +1

SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context

no code implementations11 Sep 2020 Mark Cartwright, Jason Cramer, Ana Elisa Mendez Mendez, Yu Wang, Ho-Hsiang Wu, Vincent Lostanlen, Magdalena Fuentes, Graham Dove, Charlie Mydlarz, Justin Salamon, Oded Nov, Juan Pablo Bello

In this article, we describe our data collection procedure and propose evaluation metrics for multilabel classification of urban sound tags.

A Survey of FPGA-Based Robotic Computing

no code implementations13 Sep 2020 Zishen Wan, Bo Yu, Thomas Yuang Li, Jie Tang, Yuhao Zhu, Yu Wang, Arijit Raychowdhury, Shaoshan Liu

On the other hand, FPGA-based robotic accelerators are becoming increasingly competitive alternatives, especially in latency-critical and power-limited scenarios.

Autonomous Vehicles

Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms

no code implementations1 Jan 2021 Chao Yu, Akash Velu, Eugene Vinitsky, Yu Wang, Alexandre Bayen, Yi Wu

We benchmark commonly used multi-agent deep reinforcement learning (MARL) algorithms on a variety of cooperative multi-agent games.

Benchmarking reinforcement-learning +2

Explore the Potential of CNN Low Bit Training

no code implementations1 Jan 2021 Kai Zhong, Xuefei Ning, Tianchen Zhao, Zhenhua Zhu, Shulin Zeng, Guohao Dai, Yu Wang, Huazhong Yang

Through this dynamic precision framework, we can reduce the bit-width of convolution, which is the most computational cost, while keeping the training process close to the full precision floating-point training.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.