Search Results for author: Jun Wang

Found 530 papers, 169 papers with code

Product-based Neural Networks for User Response Prediction

11 code implementations1 Nov 2016 Yanru Qu, Han Cai, Kan Ren, Wei-Nan Zhang, Yong Yu, Ying Wen, Jun Wang

Predicting user responses, such as clicks and conversions, is of great importance and has found its usage in many Web applications including recommender systems, web search and online advertising.

Click-Through Rate Prediction Recommendation Systems

Deep Learning over Multi-field Categorical Data: A Case Study on User Response Prediction

5 code implementations11 Jan 2016 Wei-Nan Zhang, Tianming Du, Jun Wang

Different from continuous raw features that we usually found in the image and audio domains, the input features in web space are always of multi-field and are mostly discrete and categorical while their dependencies are little known.

Click-Through Rate Prediction

Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

1 code implementation14 Feb 2022 Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

Satisfying safety constraints almost surely (or with probability one) can be critical for the deployment of Reinforcement Learning (RL) in real-life applications.

reinforcement-learning Reinforcement Learning (RL) +1

Effects of Safety State Augmentation on Safe Exploration

1 code implementation6 Jun 2022 Aivar Sootla, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou Ammar

We further show that Simmer can stabilize training and improve the performance of safe RL with average constraints.

Reinforcement Learning (RL) Safe Exploration +1

Pelee: A Real-Time Object Detection System on Mobile Devices

2 code implementations NeurIPS 2018 Jun Wang, Tanner Bohn, Charles Ling

In this study, we propose an efficient architecture named PeleeNet, which is built with conventional convolution instead.

object-detection Real-Time Object Detection

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

23 code implementations18 Sep 2016 Lantao Yu, Wei-Nan Zhang, Jun Wang, Yong Yu

As a new way of training generative models, Generative Adversarial Nets (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data.

Reinforcement Learning (RL) Text Generation

FaceX-Zoo: A PyTorch Toolbox for Face Recognition

2 code implementations12 Jan 2021 Jun Wang, Yinglu Liu, Yibo Hu, Hailin Shi, Tao Mei

For example, the production of face representation network desires a modular training scheme to consider the proper choice from various candidates of state-of-the-art backbone and training supervision subject to the real-world face recognition demand; for performance analysis and comparison, the standard and automatic evaluation with a bunch of models on multiple benchmarks will be a desired tool as well; besides, a public groundwork is welcomed for deploying the face recognition in the shape of holistic pipeline.

Face Recognition

Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition

1 code implementation CVPR 2021 Jiahui She, Yibo Hu, Hailin Shi, Jun Wang, Qiu Shen, Tao Mei

Due to the subjective annotation and the inherent interclass similarity of facial expressions, one of key challenges in Facial Expression Recognition (FER) is the annotation ambiguity.

Facial Expression Recognition Facial Expression Recognition (FER)

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

3 code implementations2 Dec 2017 Lianmin Zheng, Jiacheng Yang, Han Cai, Wei-Nan Zhang, Jun Wang, Yong Yu

Unlike previous research platforms on single or multi-agent reinforcement learning, MAgent focuses on supporting the tasks and the applications that require hundreds to millions of agents.

Multi-agent Reinforcement Learning reinforcement-learning +1

Privacy-Preserving Face Recognition Using Trainable Feature Subtraction

1 code implementation19 Mar 2024 Yuxi Mi, Zhizhou Zhong, Yuge Huang, Jiazhen Ji, Jianqing Xu, Jun Wang, Shaoming Wang, Shouhong Ding, Shuigeng Zhou

Recognizable identity features within the image are encouraged by co-training a recognition model on its high-dimensional feature representation.

Face Recognition Image Compression +1

CausalVAE: Structured Causal Disentanglement in Variational Autoencoder

2 code implementations CVPR 2021 Mengyue Yang, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, Jun Wang

Learning disentanglement aims at finding a low dimensional representation which consists of multiple explanatory and generative factors of the observational data.

counterfactual Disentanglement

Ordering-Based Causal Discovery with Reinforcement Learning

1 code implementation14 May 2021 Xiaoqiang Wang, Yali Du, Shengyu Zhu, Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang

It is a long-standing question to discover causal relations among a set of variables in many empirical sciences.

Causal Discovery reinforcement-learning +2

Texygen: A Benchmarking Platform for Text Generation Models

1 code implementation6 Feb 2018 Yaoming Zhu, Sidi Lu, Lei Zheng, Jiaxian Guo, Wei-Nan Zhang, Jun Wang, Yong Yu

We introduce Texygen, a benchmarking platform to support research on open-domain text generation models.

Benchmarking Text Generation

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models

3 code implementations30 May 2017 Jun Wang, Lantao Yu, Wei-Nan Zhang, Yu Gong, Yinghui Xu, Benyou Wang, Peng Zhang, Dell Zhang

This paper provides a unified account of two schools of thinking in information retrieval modelling: the generative retrieval focusing on predicting relevant documents given a query, and the discriminative retrieval focusing on predicting relevancy given a query-document pair.

Document Ranking Information Retrieval +2

Long Text Generation via Adversarial Training with Leaked Information

6 code implementations24 Sep 2017 Jiaxian Guo, Sidi Lu, Han Cai, Wei-Nan Zhang, Yong Yu, Jun Wang

Automatically generating coherent and semantically meaningful text has many applications in machine translation, dialogue systems, image captioning, etc.

Sentence Text Generation

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

1 code implementation5 Jun 2021 Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang

Our framework is comprised of three key components: (1) a centralized task dispatching model, which supports the self-generated tasks and scalable training with heterogeneous policy combinations; (2) a programming architecture named Actor-Evaluator-Learner, which achieves high parallelism for both training and sampling, and meets the evaluation requirement of auto-curriculum learning; (3) a higher-level abstraction of MARL training paradigms, which enables efficient code reuse and flexible deployments on different distributed computing paradigms.

Atari Games Distributed Computing +3

FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis

2 code implementations21 Apr 2022 Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao

Also, FastDiff enables a sampling speed of 58x faster than real-time on a V100 GPU, making diffusion models practically applicable to speech synthesis deployment for the first time.

Ranked #7 on Text-To-Speech Synthesis on LJSpeech (using extra training data)

Denoising Speech Synthesis +2

A Review of Safe Reinforcement Learning: Methods, Theory and Applications

1 code implementation20 May 2022 Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois Knoll

To establish a good foundation for future research in this thread, in this paper, we provide a review for safe RL from the perspectives of methods, theory and applications.

Autonomous Driving Decision Making +3

SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking

2 code implementations CVPR 2020 Dongyan Guo, Jun Wang, Ying Cui, Zhenhua Wang, Sheng-Yong Chen

The proposed framework SiamCAR consists of two simple subnetworks: one Siamese subnetwork for feature extraction and one classification-regression subnetwork for bounding box prediction.

Classification General Classification +3

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

1 code implementation30 May 2022 Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang

In this paper, we introduce a novel architecture named Multi-Agent Transformer (MAT) that effectively casts cooperative multi-agent reinforcement learning (MARL) into SM problems wherein the task is to map agents' observation sequence to agents' optimal action sequence.

Decision Making Multi-agent Reinforcement Learning +2

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

1 code implementation ICLR 2022 Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu

We propose a new bilateral denoising diffusion model (BDDM) that parameterizes both the forward and reverse processes with a schedule network and a score network, which can train with a novel bilateral modeling objective.

Image Generation Speech Synthesis

User Behavior Simulation with Large Language Model based Agents

1 code implementation5 Jun 2023 Lei Wang, Jingsen Zhang, Hao Yang, ZhiYuan Chen, Jiakai Tang, Zeyu Zhang, Xu Chen, Yankai Lin, Ruihua Song, Wayne Xin Zhao, Jun Xu, Zhicheng Dou, Jun Wang, Ji-Rong Wen

Simulating high quality user behavior data has always been a fundamental problem in human-centered applications, where the major difficulty originates from the intricate mechanism of human decision process.

Language Modelling Large Language Model +2

Real-Time Bidding by Reinforcement Learning in Display Advertising

1 code implementation10 Jan 2017 Han Cai, Kan Ren, Wei-Nan Zhang, Kleanthis Malialis, Jun Wang, Yong Yu, Defeng Guo

In this paper, we formulate the bid decision process as a reinforcement learning problem, where the state space is represented by the auction information and the campaign's real-time parameters, while an action is the bid price to set.

reinforcement-learning Reinforcement Learning (RL)

Efficient Architecture Search by Network Transformation

3 code implementations16 Jul 2017 Han Cai, Tianyao Chen, Wei-Nan Zhang, Yong Yu, Jun Wang

Techniques for automatically designing deep neural network architectures such as reinforcement learning based approaches have recently shown promising results.

Image Classification Neural Architecture Search +2

SAMBA: Safe Model-Based & Active Reinforcement Learning

1 code implementation12 Jun 2020 Alexander I. Cowen-Rivers, Daniel Palenicek, Vincent Moens, Mohammed Abdullah, Aivar Sootla, Jun Wang, Haitham Ammar

In this paper, we propose SAMBA, a novel framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics.

Reinforcement Learning (RL) Safe Reinforcement Learning

Lung Infection Quantification of COVID-19 in CT Images with Deep Learning

1 code implementation10 Mar 2020 Fei Shan, Yaozong Gao, Jun Wang, Weiya Shi, Nannan Shi, Miaofei Han, Zhong Xue, Dinggang Shen, Yuxin Shi

The performance of the system was evaluated by comparing the automatically segmented infection regions with the manually-delineated ones on 300 chest CT scans of 300 COVID-19 patients.

COVID-19 Image Segmentation Segmentation

Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach

1 code implementation19 Dec 2023 Weiyu Ma, Qirui Mi, Xue Yan, Yuqiao Wu, Runji Lin, Haifeng Zhang, Jun Wang

StarCraft II is a challenging benchmark for AI agents due to the necessity of both precise micro level operations and strategic macro awareness.

Language Modelling Large Language Model +2

Multi-Agent Constrained Policy Optimisation

3 code implementations6 Oct 2021 Shangding Gu, Jakub Grudzien Kuba, Munning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois Knoll, Yaodong Yang

To fill these gaps, in this work, we formulate the safe MARL problem as a constrained Markov game and solve it with policy optimisation methods.

Multi-agent Reinforcement Learning reinforcement-learning +1

Real-Time Bidding Benchmarking with iPinYou Dataset

2 code implementations25 Jul 2014 Wei-Nan Zhang, Shuai Yuan, Jun Wang, Xuehua Shen

This dataset directly supports the experiments of some important research problems such as bid optimisation and CTR estimation.

Computer Science and Game Theory Computers and Society

On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective

1 code implementation24 Dec 2022 Ying Wen, Ziyu Wan, Ming Zhou, Shufang Hou, Zhe Cao, Chenyang Le, Jingxiao Chen, Zheng Tian, Weinan Zhang, Jun Wang

The pervasive uncertainty and dynamic nature of real-world environments present significant challenges for the widespread implementation of machine-driven Intelligent Decision-Making (IDM) systems.

Decision Making Image Captioning +2

How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances

1 code implementation11 Oct 2023 Zihan Zhang, Meng Fang, Ling Chen, Mohammad-Reza Namazi-Rad, Jun Wang

Although large language models (LLMs) are impressive in solving various tasks, they can quickly be outdated after deployment.

World Knowledge

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training

1 code implementation29 Sep 2023 Xidong Feng, Ziyu Wan, Muning Wen, Stephen Marcus McAleer, Ying Wen, Weinan Zhang, Jun Wang

Empirical results across reasoning, planning, alignment, and decision-making tasks show that TS-LLM outperforms existing approaches and can handle trees with a depth of 64.

Decision Making Language Modelling +1

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

3 code implementations18 Dec 2020 Peng Shi, Patrick Ng, Zhiguo Wang, Henghui Zhu, Alexander Hanbo Li, Jun Wang, Cicero Nogueira dos santos, Bing Xiang

Most recently, there has been significant interest in learning contextual representations for various NLP tasks, by leveraging large scale text corpora to train large neural language models with self-supervised learning objectives, such as Masked Language Model (MLM).

Ranked #7 on Text-To-SQL on spider (Exact Match Accuracy (Dev) metric)

Language Modelling Self-Supervised Learning +2

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks

1 code implementation6 Dec 2021 Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu

In this paper, we facilitate the research by providing large-scale datasets, and use them to examine the usage of the Decision Transformer in the context of MARL.

Offline RL reinforcement-learning +4

M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers

1 code implementation24 Apr 2021 Tianrui Guan, Jun Wang, Shiyi Lan, Rohan Chandra, Zuxuan Wu, Larry Davis, Dinesh Manocha

We present a novel architecture for 3D object detection, M3DeTR, which combines different point cloud representations (raw, voxels, bird-eye view) with different feature scales based on multi-scale feature pyramids.

3D Object Detection object-detection +1

An Empirical Study on Google Research Football Multi-agent Scenarios

1 code implementation16 May 2023 Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang

Few multi-agent reinforcement learning (MARL) research on Google Research Football (GRF) focus on the 11v11 multi-agent full-game scenario and to the best of our knowledge, no open benchmark on this scenario has been released to the public.

Benchmarking Multi-agent Reinforcement Learning +1

Attention-aware Multi-stroke Style Transfer

1 code implementation CVPR 2019 Yuan Yao, Jianqiang Ren, Xuansong Xie, Weidong Liu, Yong-Jin Liu, Jun Wang

Neural style transfer has drawn considerable attention from both academic and industrial field.

Style Transfer

ChessGPT: Bridging Policy Learning and Language Modeling

1 code implementation NeurIPS 2023 Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang

Thus, we propose ChessGPT, a GPT model bridging policy learning and language modeling by integrating data from these two sources in Chess games.

Decision Making Language Modelling

HRBF-Fusion: Accurate 3D reconstruction from RGB-D data using on-the-fly implicits

1 code implementation3 Feb 2022 Yabin Xu, Liangliang Nan, Laishui Zhou, Jun Wang, Charlie C. L. Wang

However, due to the discrete nature and limited resolution of their surface representations (e. g., point- or voxel-based), existing approaches suffer from the accumulation of errors in camera tracking and distortion in the reconstruction, which leads to an unsatisfactory 3D reconstruction.

3D Reconstruction

Feedback Control of Real-Time Display Advertising

1 code implementation3 Mar 2016 Wei-Nan Zhang, Yifei Rong, Jun Wang, Tianchi Zhu, Xiaofan Wang

In this paper, we propose a feedback control mechanism for RTB which helps advertisers dynamically adjust the bids to effectively control the KPIs, e. g., the auction winning ratio and the effective cost per click.

Computer Science and Game Theory Systems and Control

Empirical or Invariant Risk Minimization? A Sample Complexity Perspective

3 code implementations ICLR 2021 Kartik Ahuja, Jun Wang, Amit Dhurandhar, Karthikeyan Shanmugam, Kush R. Varshney

Recently, invariant risk minimization (IRM) was proposed as a promising solution to address out-of-distribution (OOD) generalization.

Cross-modal Prototype Driven Network for Radiology Report Generation

1 code implementation11 Jul 2022 Jun Wang, Abhir Bhalerao, Yulan He

Radiology report generation (RRG) aims to describe automatically a radiology image with human-like language and could potentially support the work of radiologists, reducing the burden of manual reporting.

Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising

1 code implementation11 Aug 2018 Kan Ren, Yuchen Fang, Wei-Nan Zhang, Shuhao Liu, Jiajun Li, Ya zhang, Yong Yu, Jun Wang

To achieve this, we utilize sequence-to-sequence prediction for user clicks, and combine both post-view and post-click attribution patterns together for the final conversion estimation.

PointAttN: You Only Need Attention for Point Cloud Completion

1 code implementation16 Mar 2022 Jun Wang, Ying Cui, Dongyan Guo, Junxia Li, Qingshan Liu, Chunhua Shen

To solve the problems, we leverage the cross-attention and self-attention mechanisms to design novel neural network for processing point cloud in a per-point manner to eliminate kNNs.

Point Cloud Completion

DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning

1 code implementation27 Feb 2024 Siyuan Guo, Cheng Deng, Ying Wen, Hechang Chen, Yi Chang, Jun Wang

In the development stage, DS-Agent follows the CBR framework to structure an automatic iteration pipeline, which can flexibly capitalize on the expert knowledge from Kaggle, and facilitate consistent performance improvement through the feedback mechanism.

Code Generation

A Deep Recurrent Survival Model for Unbiased Ranking

1 code implementation30 Apr 2020 Jiarui Jin, Yuchen Fang, Wei-Nan Zhang, Kan Ren, Guorui Zhou, Jian Xu, Yong Yu, Jun Wang, Xiaoqiang Zhu, Kun Gai

Position bias is a critical problem in information retrieval when dealing with implicit yet biased user feedback data.

Information Retrieval Position +2

Bi-level Actor-Critic for Multi-agent Coordination

1 code implementation8 Sep 2019 Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Wei-Nan Zhang, Jun Wang

Coordination is one of the essential problems in multi-agent systems.

Multiagent Systems

Adaptive Multi-Teacher Multi-level Knowledge Distillation

1 code implementation6 Mar 2021 Yuang Liu, Wei zhang, Jun Wang

Knowledge distillation~(KD) is an effective learning paradigm for improving the performance of lightweight student networks by utilizing additional supervision knowledge distilled from teacher networks.

Knowledge Distillation

Multi-Agent Determinantal Q-Learning

1 code implementation ICML 2020 Yaodong Yang, Ying Wen, Li-Heng Chen, Jun Wang, Kun Shao, David Mguni, Wei-Nan Zhang

Though practical, current methods rely on restrictive assumptions to decompose the centralized value function across agents for execution.

Q-Learning

An effective self-supervised framework for learning expressive molecular global representations to drug discovery

1 code implementation Briefings in Bioinformatics 2021 Pengyong Li, Jun Wang, Yixuan Qiao, Hao Chen, Yihuan Yu, Xiaojun Yao, Peng Gao, Guotong Xie, Sen Song

In MPG, we proposed a powerful GNN for modelling molecular graph named MolGNet, and designed an effective self-supervised strategy for pre-training the model at both the node and graph-level.

Drug Discovery

Unsupervised Generative Modeling Using Matrix Product States

1 code implementation6 Sep 2017 Zhao-Yu Han, Jun Wang, Heng Fan, Lei Wang, Pan Zhang

Generative modeling, which learns joint probability distribution from data and generates samples according to it, is an important task in machine learning and artificial intelligence.

BIG-bench Machine Learning

Semi-Siamese Training for Shallow Face Learning

3 code implementations ECCV 2020 Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei

Extensive experiments on various benchmarks of face recognition show the proposed method significantly improves the training, not only in shallow face learning, but also for conventional deep face data.

Face Recognition

UNITE: A Unified Benchmark for Text-to-SQL Evaluation

1 code implementation25 May 2023 Wuwei Lan, Zhiguo Wang, Anuj Chauhan, Henghui Zhu, Alexander Li, Jiang Guo, Sheng Zhang, Chung-Wei Hang, Joseph Lilien, Yiqun Hu, Lin Pan, Mingwen Dong, Jun Wang, Jiarong Jiang, Stephen Ash, Vittorio Castelli, Patrick Ng, Bing Xiang

A practical text-to-SQL system should generalize well on a wide variety of natural language questions, unseen database schemas, and novel SQL query structures.

Text-To-SQL

Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search

1 code implementation ECCV 2020 Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink

In this paper, we introduce a new reinforcement learning (RL) based neural architecture search (NAS) methodology for effective and efficient generative adversarial network (GAN) architecture search.

Generative Adversarial Network Image Generation +3

GT U-Net: A U-Net Like Group Transformer Network for Tooth Root Segmentation

1 code implementation30 Sep 2021 Yunxiang Li, Shuai Wang, Jun Wang, Guodong Zeng, Wenjun Liu, Qianni Zhang, Qun Jin, Yaqi Wang

In this paper, we propose a novel end-to-end U-Net like Group Transformer Network (GT U-Net) for the tooth root segmentation.

Anatomy Segmentation

CT-Net: Channel Tensorization Network for Video Classification

1 code implementation ICLR 2021 Kunchang Li, Xianhang Li, Yali Wang, Jun Wang, Yu Qiao

It can learn to exploit spatial, temporal and channel attention in a high-dimensional manner, to improve the cooperative power of all the feature dimensions in our CT-Module.

Action Classification Classification +1

Character-level Convolutional Network for Text Classification Applied to Chinese Corpus

1 code implementation14 Nov 2016 Wei-Jie Huang, Jun Wang

This article provides an interesting exploration of character-level convolutional neural network solving Chinese corpus text classification problem.

General Classification text-classification +1

ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language

2 code implementations ECCV 2020 Zhe Wang, Zhiyuan Fang, Jun Wang, Yezhou Yang

Person search by natural language aims at retrieving a specific person in a large-scale image pool that matches the given textual descriptions.

Attribute Contrastive Learning +2

Scalable Model-based Policy Optimization for Decentralized Networked Systems

2 code implementations13 Jul 2022 Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang

Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple tasks.

Reinforcement Learning-based Visual Navigation with Information-Theoretic Regularization

1 code implementation9 Dec 2019 Qiaoyun Wu, Kai Xu, Jun Wang, Mingliang Xu, Dinesh Manocha

The regularization maximizes the mutual information between navigation actions and visual observation transforms of an agent, thus promoting more informed navigation decisions.

Robotics

Settling the Variance of Multi-Agent Policy Gradients

1 code implementation NeurIPS 2021 Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang

In multi-agent RL (MARL), although the PG theorem can be naturally extended, the effectiveness of multi-agent PG (MAPG) methods degrades as the variance of gradient estimates increases rapidly with the number of agents.

Reinforcement Learning (RL) Starcraft

Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech

1 code implementation ACL 2022 Yang Li, Cheng Yu, Guangzhi Sun, Hua Jiang, Fanglei Sun, Weiqin Zu, Ying Wen, Yang Yang, Jun Wang

Modelling prosody variation is critical for synthesizing natural and expressive speech in end-to-end text-to-speech (TTS) systems.

Neural Auto-Curricula

1 code implementation4 Jun 2021 Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen Mcaleer, Ying Wen, Jun Wang, Yaodong Yang

When solving two-player zero-sum games, multi-agent reinforcement learning (MARL) algorithms often create populations of agents where, at each iteration, a new agent is discovered as the best response to a mixture over the opponent population.

Multi-agent Reinforcement Learning

Neural Auto-Curricula in Two-Player Zero-Sum Games

1 code implementation NeurIPS 2021 Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen Mcaleer, Ying Wen, Jun Wang, Yaodong Yang

When solving two-player zero-sum games, multi-agent reinforcement learning (MARL) algorithms often create populations of agents where, at each iteration, a new agent is discovered as the best response to a mixture over the opponent population.

Multi-agent Reinforcement Learning Vocal Bursts Valence Prediction

A Regularized Opponent Model with Maximum Entropy Objective

1 code implementation17 May 2019 Zheng Tian, Ying Wen, Zhichen Gong, Faiz Punakkath, Shihao Zou, Jun Wang

In a single-agent setting, reinforcement learning (RL) tasks can be cast into an inference problem by introducing a binary random variable o, which stands for the "optimality".

Multi-agent Reinforcement Learning reinforcement-learning +1

DistilPose: Tokenized Pose Regression with Heatmap Distillation

1 code implementation CVPR 2023 Suhang Ye, Yingyi Zhang, Jie Hu, Liujuan Cao, Shengchuan Zhang, Lei Shen, Jun Wang, Shouhong Ding, Rongrong Ji

Specifically, DistilPose maximizes the transfer of knowledge from the teacher model (heatmap-based) to the student model (regression-based) through Token-distilling Encoder (TDE) and Simulated Heatmaps.

Knowledge Distillation Pose Estimation +1

DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases

1 code implementation30 Sep 2022 Donghan Yu, Sheng Zhang, Patrick Ng, Henghui Zhu, Alexander Hanbo Li, Jun Wang, Yiqun Hu, William Wang, Zhiguo Wang, Bing Xiang

Question answering over knowledge bases (KBs) aims to answer natural language questions with factual information such as entities and relations in KBs.

Entity Linking Question Answering +2

UTOPIC: Uncertainty-aware Overlap Prediction Network for Partial Point Cloud Registration

1 code implementation4 Aug 2022 Zhilei Chen, Honghua Chen, Lina Gong, Xuefeng Yan, Jun Wang, Yanwen Guo, Jing Qin, Mingqiang Wei

High-confidence overlap prediction and accurate correspondences are critical for cutting-edge models to align paired point clouds in a partial-to-partial manner.

Point Cloud Registration

TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation

1 code implementation3 Aug 2022 Jun Wang, Mingfei Gao, Yuqian Hu, Ramprasaath R. Selvaraju, Chetan Ramaiah, ran Xu, Joseph F. JaJa, Larry S. Davis

To address this deficiency, we develop a new method to generate high-quality and diverse QA pairs by explicitly utilizing the existing rich text available in the scene context of each image.

Answer Generation Question-Answer-Generation +3

Multi-Agent Interactions Modeling with Correlated Policies

1 code implementation ICLR 2020 Minghuan Liu, Ming Zhou, Wei-Nan Zhang, Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

In this paper, we cast the multi-agent interactions modeling problem into a multi-agent imitation learning framework with explicit modeling of correlated policies by approximating opponents' policies, which can recover agents' policies that can regenerate similar interactions.

Imitation Learning

IH-GAN: A Conditional Generative Model for Implicit Surface-Based Inverse Design of Cellular Structures

1 code implementation3 Mar 2021 Jun Wang, Wei Wayne Chen, Daicong Da, Mark Fuge, Rahul Rai

Results show that our method can 1) generate various unit cells that satisfy given material properties with high accuracy ($R^2$-scores between target properties and properties of generated unit cells $>98\%$) and 2) improve the optimized structural performance over the conventional variable-density single-type structure.

Generative Adversarial Network

Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation

2 code implementations1 Mar 2021 Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu

One of the leading single-channel speech separation (SS) models is based on a TasNet with a dual-path segmentation technique, where the size of each segment remains unchanged throughout all layers.

Computational Efficiency Speech Separation

Modelling Behavioural Diversity for Learning in Open-Ended Games

3 code implementations14 Mar 2021 Nicolas Perez Nieves, Yaodong Yang, Oliver Slumbers, David Henry Mguni, Ying Wen, Jun Wang

Promoting behavioural diversity is critical for solving games with non-transitive dynamics where strategic cycles exist, and there is no consistent winner (e. g., Rock-Paper-Scissors).

Point Processes

Learning to Model Opponent Learning

1 code implementation6 Jun 2020 Ian Davies, Zheng Tian, Jun Wang

In this work, we develop a novel approach to modelling an opponent's learning dynamics which we term Learning to Model Opponent Learning (LeMOL).

Decision Making Multi-agent Reinforcement Learning

Learning the Implicit Semantic Representation on Graph-Structured Data

1 code implementation16 Jan 2021 Likang Wu, Zhi Li, Hongke Zhao, Qi Liu, Jun Wang, Mengdi Zhang, Enhong Chen

Existing representation learning methods in graph convolutional networks are mainly designed by describing the neighborhood of each node as a perceptual whole, while the implicit semantic associations behind highly complex interactions of graphs are largely unexploited.

Representation Learning

Neural Variational Inference For Estimating Uncertainty in Knowledge Graph Embeddings

1 code implementation12 Jun 2019 Alexander I. Cowen-Rivers, Pasquale Minervini, Tim Rocktaschel, Matko Bosnjak, Sebastian Riedel, Jun Wang

Recent advances in Neural Variational Inference allowed for a renaissance in latent variable models in a variety of domains involving high-dimensional data.

Knowledge Graph Embeddings Knowledge Graphs +2

Optimizing Vision Transformers for Medical Image Segmentation

1 code implementation14 Oct 2022 Qianying Liu, Chaitanya Kaul, Jun Wang, Christos Anagnostopoulos, Roderick Murray-Smith, Fani Deligianni

For medical image semantic segmentation (MISS), Vision Transformers have emerged as strong alternatives to convolutional neural networks thanks to their inherent ability to capture long-range correlations.

Domain Adaptation Image Segmentation +2

Multi-Agent Trust Region Learning

1 code implementation1 Jan 2021 Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang

We derive the lower bound of agents' payoff improvements for MATRL methods, and also prove the convergence of our method on the meta-game fixed points.

Atari Games Multi-agent Reinforcement Learning +3

A Game-Theoretic Approach to Multi-Agent Trust Region Optimization

1 code implementation12 Jun 2021 Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang

Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement guarantee at every iteration.

Atari Games Multi-agent Reinforcement Learning +2

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

1 code implementation8 Oct 2023 Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai

This paper introduces a distributed, GPU-centric experience replay system, GEAR, designed to perform scalable reinforcement learning (RL) with large sequence models (such as transformers).

Reinforcement Learning (RL)

Token-level Direct Preference Optimization

1 code implementation18 Apr 2024 Yongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang

Fine-tuning pre-trained Large Language Models (LLMs) is essential to align them with human values and intentions.

Multi-View Reinforcement Learning

1 code implementation NeurIPS 2019 Minne Li, Lisheng Wu, Haitham Bou Ammar, Jun Wang

This paper is concerned with multi-view reinforcement learning (MVRL), which allows for decision making when agents share common dynamics but adhere to different observation models.

Decision Making reinforcement-learning +1

Multi-modal Summarization for Video-containing Documents

1 code implementation17 Sep 2020 Xiyan Fu, Jun Wang, Zhenglu Yang

Summarization of multimedia data becomes increasingly significant as it is the basis for many real-world applications, such as question answering, Web search, and so forth.

Question Answering Video Summarization

Boosting Semi-Supervised Face Recognition with Noise Robustness

1 code implementation10 May 2021 Yuchi Liu, Hailin Shi, Hang Du, Rui Zhu, Jun Wang, Liang Zheng, Tao Mei

This paper presents an effective solution to semi-supervised face recognition that is robust to the label noise aroused by the auto-labelling.

Face Recognition

Cross Attention-guided Dense Network for Images Fusion

1 code implementation23 Sep 2021 Zhengwen Shen, Jun Wang, Zaiyu Pan, Yulian Li, Jiangyu Wang

In this paper, we propose a novel cross-attention-guided image fusion network, which is a unified and unsupervised framework for multi-modal image fusion, multi-exposure image fusion, and multi-focus image fusion.

Multi-Exposure Image Fusion

FR: Folded Rationalization with a Unified Encoder

1 code implementation17 Sep 2022 Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Chao Yue, Yuankai Zhang

Conventional works generally employ a two-phase model in which a generator selects the most important pieces, followed by a predictor that makes predictions based on the selected pieces.

Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning

1 code implementation NeurIPS 2018 Rui Luo, Jianhong Wang, Yaodong Yang, Zhanxing Zhu, Jun Wang

We propose a new sampling method, the thermostat-assisted continuously-tempered Hamiltonian Monte Carlo, for Bayesian learning on large datasets and multimodal distributions.

An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective

1 code implementation1 Nov 2020 Yaodong Yang, Jun Wang

In this work, we provide a monograph on MARL that covers both the fundamentals and the latest developments in the research frontier.

Multi-agent Reinforcement Learning reinforcement-learning +1

Online Double Oracle

1 code implementation13 Mar 2021 Le Cong Dinh, Yaodong Yang, Stephen Mcaleer, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Haitham Bou Ammar, Jun Wang

Solving strategic games with huge action space is a critical yet under-explored topic in economics, operations research and artificial intelligence.

An Adversarial Imitation Click Model for Information Retrieval

1 code implementation13 Apr 2021 Xinyi Dai, Jianghao Lin, Weinan Zhang, Shuai Li, Weiwen Liu, Ruiming Tang, Xiuqiang He, Jianye Hao, Jun Wang, Yong Yu

Modern information retrieval systems, including web search, ads placement, and recommender systems, typically rely on learning from user feedback.

Imitation Learning Information Retrieval +2

MGR: Multi-generator Based Rationalization

1 code implementation8 May 2023 Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Xinyang Li, Yuankai Zhang, Yang Qiu

Rationalization is to employ a generator and a predictor to construct a self-explaining NLP model in which the generator selects a subset of human-intelligible pieces of the input text to the following predictor.

Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting

1 code implementation7 Oct 2016 Jun Wang, Wei-Nan Zhang, Shuai Yuan

The most significant progress in recent years in online display advertising is what is known as the Real-Time Bidding (RTB) mechanism to buy and sell ads.

Computer Science and Game Theory

NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations

1 code implementation17 Jun 2019 Qiaoyun Wu, Dinesh Manocha, Jun Wang, Kai Xu

First, the latent distribution is conditioned on current observations and the target view, leading to a model-based, target-driven navigation.

Visual Navigation

Mask Guided Attention For Fine-Grained Patchy Image Classification

2 code implementations4 Feb 2021 Jun Wang, Xiaohan Yu, Yongsheng Gao

Specifically, the proposed MGA integrates a pre-trained semantic segmentation model that produces auxiliary supervision signal, i. e., patchy attention mask, enabling a discriminative representation learning.

Classification General Classification +3

AA-Forecast: Anomaly-Aware Forecast for Extreme Events

1 code implementation21 Aug 2022 Ashkan Farhangi, Jiang Bian, Arthur Huang, Haoyi Xiong, Jun Wang, Zhishan Guo

Moreover, the framework employs a dynamic uncertainty optimization algorithm that reduces the uncertainty of forecasts in an online manner.

Anomaly Forecasting Management +3

XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations

1 code implementation7 Jun 2023 Yusen Zhang, Jun Wang, Zhiguo Wang, Rui Zhang

However, existing CLSP models are separately proposed and evaluated on datasets of limited tasks and applications, impeding a comprehensive and unified evaluation of CLSP on a diverse range of NLs and MRs. To this end, we present XSemPLR, a unified benchmark for cross-lingual semantic parsing featured with 22 natural languages and 8 meaning representations by examining and selecting 9 existing datasets to cover 5 tasks and 164 domains.

Cross-Lingual Transfer Semantic Parsing +2

FusionU-Net: U-Net with Enhanced Skip Connection for Pathology Image Segmentation

1 code implementation17 Oct 2023 Zongyi Li, Hongbing Lyu, Jun Wang

One of the key designs of U-Net is the use of skip connections between the encoder and decoder, which helps to recover detailed information after upsampling.

Image Segmentation Semantic Segmentation

$H_\infty$ Model-free Reinforcement Learning with Robust Stability Guarantee

1 code implementation7 Nov 2019 Minghao Han, Yuan Tian, Lixian Zhang, Jun Wang, Wei Pan

In this paper, we introduce and extend the idea of robust stability and $H_\infty$ control to design policies with both stability and robustness guarantee.

Autonomous Driving reinforcement-learning +2

Multiscale Attention Guided Network for COVID-19 Diagnosis Using Chest X-ray Images

1 code implementation11 Nov 2020 Jingxiong Li, Yaqi Wang, Shuai Wang, Jun Wang, Jun Liu, Qun Jin, Lingling Sun

Moreover, massive data collection is impractical for a newly emerged disease, which limited the performance of data thirsty deep learning models.

Classification COVID-19 Diagnosis +1

Measuring Correlation-to-Causation Exaggeration in Press Releases

1 code implementation COLING 2020 Bei Yu, Jun Wang, Lu Guo, Yingya Li

By comparing the claims made in a press release with the corresponding claims in the original research paper, we found that 22{\%} of press releases made exaggerated causal claims from correlational findings in observational studies.

Learning to Identify Top Elo Ratings: A Dueling Bandits Approach

1 code implementation12 Jan 2022 Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen

The Elo rating system is widely adopted to evaluate the skills of (chess) game and sports players.

Scheduling

PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination

1 code implementation16 Jan 2023 Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du

We conduct experiments on the Overcooked environment, and evaluate the zero-shot human-AI coordination performance of our method with both behavior-cloned human proxies and real humans.

Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks

2 code implementations13 Jan 2021 Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu

Recent research on the time-domain audio separation networks (TasNets) has brought great success to speech separation.

Speech Separation

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization

2 code implementations4 Mar 2022 Minghuan Liu, Zhengbang Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao, Yong Yu, Jun Wang

Recent progress in state-only imitation learning extends the scope of applicability of imitation learning to real-world settings by relieving the need for observing expert actions.

Imitation Learning Transfer Learning

Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds

1 code implementation23 Mar 2022 Haoran Zhou, Honghua Chen, Yingkui Zhang, Mingqiang Wei, Haoran Xie, Jun Wang, Tong Lu, Jing Qin, Xiao-Ping Zhang

Differently, our network is designed to refine the initial normal of each point by extracting additional information from multiple feature representations.

Decoupled Rationalization with Asymmetric Learning Rates: A Flexible Lipschitz Restraint

1 code implementation23 May 2023 Wei Liu, Jun Wang, Haozhao Wang, Ruixuan Li, Yang Qiu, Yuankai Zhang, Jie Han, Yixiong Zou

However, such a cooperative game may incur the degeneration problem where the predictor overfits to the uninformative pieces generated by a not yet well-trained generator and in turn, leads the generator to converge to a sub-optimal model that tends to select senseless pieces.

Invariant Learning via Probability of Sufficient and Necessary Causes

1 code implementation NeurIPS 2023 Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

To capture the information of sufficient and necessary causes, we employ a classical concept, the probability of sufficiency and necessary causes (PNS), which indicates the probability of whether one is the necessary and sufficient cause.

A simple normalization technique using window statistics to improve the out-of-distribution generalization on medical images

1 code implementation7 Jul 2022 Chengfeng Zhou, Songchang Chen, Chenming Xu, Jun Wang, Feng Liu, Chun Zhang, Juan Ye, Hefeng Huang, Dahong Qian

In this study, we present a novel normalization technique called window normalization (WIN) to improve the model generalization on heterogeneous medical images, which is a simple yet effective alternative to existing normalization methods.

Breast Cancer Detection Out-of-Distribution Generalization

Matrix Recovery with Implicitly Low-Rank Data

1 code implementation9 Nov 2018 Xingyu Xie, Jianlong Wu, Guangcan Liu, Jun Wang

To tackle this issue, we propose a novel method for matrix recovery in this paper, which could well handle the case where the target matrix is low-rank in an implicit feature space but high-rank or even full-rank in its original form.

Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation and Diagnosis for COVID-19

1 code implementation6 Apr 2020 Feng Shi, Jun Wang, Jun Shi, Ziyan Wu, Qian Wang, Zhenyu Tang, Kelei He, Yinghuan Shi, Dinggang Shen

In this review paper, we thus cover the entire pipeline of medical imaging and analysis techniques involved with COVID-19, including image acquisition, segmentation, diagnosis, and follow-up.

Computed Tomography (CT)

EAR-NET: Error Attention Refining Network For Retinal Vessel Segmentation

1 code implementation3 Jul 2021 Jun Wang, Yang Zhao, Linglong Qian, Xiaohan Yu, Yongsheng Gao

The precise detection of blood vessels in retinal images is crucial to the early diagnosis of the retinal vascular diseases, e. g., diabetic, hypertensive and solar retinopathies.

Retinal Vessel Segmentation Segmentation +1

MLVSNet: Multi-Level Voting Siamese Network for 3D Visual Tracking

1 code implementation ICCV 2021 Zhoutao Wang, Qian Xie, Yu-Kun Lai, Jing Wu, Kun Long, Jun Wang

To deal with sparsity in outdoor 3D point clouds, we propose to perform Hough voting on multi-level features to get more vote centers and retain more useful information, instead of voting only on the final level feature as in previous methods.

3D Object Detection object-detection +1

Large Language Models Are Neurosymbolic Reasoners

1 code implementation17 Jan 2024 Meng Fang, Shilong Deng, Yudi Zhang, Zijing Shi, Ling Chen, Mykola Pechenizkiy, Jun Wang

A wide range of real-world applications is characterized by their symbolic nature, necessitating a strong capability for symbolic reasoning.

Common Sense Reasoning Math +2

SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation

1 code implementation4 Apr 2024 Sichen Chen, Yingyi Zhang, Siming Huang, Ran Yi, Ke Fan, Ruixin Zhang, Peixian Chen, Jun Wang, Shouhong Ding, Lizhuang Ma

To mitigate the problem of under-fitting, we design a transformer module named Multi-Cycled Transformer(MCT) based on multiple-cycled forwards to more fully exploit the potential of small model parameters.

Edge-computing Pose Estimation

Scalable Quantum Tomography with Fidelity Estimation

1 code implementation8 Dec 2017 Jun Wang, Zhao-Yu Han, Song-Bo Wang, Zeyang Li, Liang-Zhu Mu, Heng Fan, Lei Wang

We propose a quantum tomography scheme for pure qudit systems which adopts random base measurements and generative learning methods, along with a built-in fidelity estimation approach to assess the reliability of the tomographic states.

Quantum Physics

Learning State Representations via Retracing in Reinforcement Learning

1 code implementation ICLR 2022 Changmin Yu, Dong Li, Jianye Hao, Jun Wang, Neil Burgess

We propose learning via retracing, a novel self-supervised approach for learning the state representation (and the associated dynamics model) for reinforcement learning tasks.

Continuous Control Model-based Reinforcement Learning +3

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

1 code implementation31 Dec 2021 Xidong Feng, Bo Liu, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang

Gradient-based Meta-RL (GMRL) refers to methods that maintain two-level optimisation procedures wherein the outer-loop meta-learner guides the inner-loop gradient-based reinforcement learner to achieve fast adaptations.

Atari Games Meta Reinforcement Learning +3

Lane Change Classification and Prediction with Action Recognition Networks

1 code implementation24 Aug 2022 Kai Liang, Jun Wang, Abhir Bhalerao

Previous works often adopt physical variables such as driving speed, acceleration and so forth for lane change classification.

Action Recognition Autonomous Driving +2

MODNet: Multi-offset Point Cloud Denoising Network Customized for Multi-scale Patches

1 code implementation30 Aug 2022 Anyi Huang, Qian Xie, Zhoutao Wang, Dening Lu, Mingqiang Wei, Jun Wang

Second, a multi-scale perception module is designed to embed multi-scale geometric information for each scale feature and regress multi-scale weights to guide a multi-offset denoising displacement.

Denoising

CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation

1 code implementation2 Nov 2022 Jun Wang, Abhir Bhalerao, Terry Yin, Simon See, Yulan He

Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process of disease decision making by radiologists.

Decision Making

D-Separation for Causal Self-Explanation

1 code implementation NeurIPS 2023 Wei Liu, Jun Wang, Haozhao Wang, Ruixuan Li, Zhiying Deng, Yuankai Zhang, Yang Qiu

Instead of attempting to rectify the issues of the MMI criterion, we propose a novel criterion to uncover the causal rationale, termed the Minimum Conditional Dependence (MCD) criterion, which is grounded on our finding that the non-causal features and the target label are \emph{d-separated} by the causal rationale.

Entropy-Regularized Token-Level Policy Optimization for Large Language Models

1 code implementation9 Feb 2024 Muning Wen, Cheng Deng, Jun Wang, Weinan Zhang, Ying Wen

At the heart of ETPO is our novel per-token soft Bellman update, designed to harmonize the RL process with the principles of language modeling.

Code Generation Decision Making +3

Feature Concatenation Multi-view Subspace Clustering

1 code implementation30 Jan 2019 Qinghai Zheng, Jihua Zhu, Zhongyu Li, Shanmin Pang, Jun Wang, Yaochen Li

To this end, this paper proposes a novel multi-view subspace clustering approach dubbed Feature Concatenation Multi-view Subspace Clustering (FCMSC), which boosts the clustering performance by exploring the consensus information of multi-view data.

Clustering Multi-view Subspace Clustering

DRD-Net: Detail-recovery Image Deraining via Context Aggregation Networks

1 code implementation27 Aug 2019 Sen Deng, Mingqiang Wei, Jun Wang, Luming Liang, Haoran Xie, Meng Wang

We have validated our approach on four recognized datasets (three synthetic and one real-world).

Rain Removal

Detecting Health Advice in Medical Research Literature

1 code implementation EMNLP 2021 Yingya Li, Jun Wang, Bei Yu

We also conducted a case study that applied this prediction model to retrieve specific health advice on COVID-19 treatments from LitCovid, a large COVID research literature portal, demonstrating the usefulness of retrieving health advice sentences as an advanced research literature navigation function for health researchers and the general public.

Retrieval Sentence

Obtaining Dyadic Fairness by Optimal Transport

1 code implementation9 Feb 2022 Moyi Yang, Junjie Sheng, Xiangfeng Wang, Wenyan Liu, Bo Jin, Jun Wang, Hongyuan Zha

Fairness has been taken as a critical metric in machine learning models, which is considered as an important component of trustworthy machine learning.

Fairness Link Prediction

Single-cell Multi-view Clustering via Community Detection with Unknown Number of Clusters

1 code implementation28 Nov 2023 Dayu Hu, Zhibin Dong, Ke Liang, Jun Wang, Siwei Wang, Xinwang Liu

To this end, we introduce scUNC, an innovative multi-view clustering approach tailored for single-cell data, which seamlessly integrates information from different views without the need for a predefined number of clusters.

Clustering Community Detection

Self Promotion in US Congressional Tweets

1 code implementation NAACL 2021 Jun Wang, Kelly Cui, Bei Yu

Prior studies have found that women self-promote less than men due to gender stereotypes.

Multi-Agent Feedback Enabled Neural Networks for Intelligent Communications

1 code implementation22 May 2022 Fanglei Sun, Yang Li, Ying Wen, Jingchen Hu, Jun Wang, Yang Yang, Kai Li

The design of MAFENN framework and algorithm are dedicated to enhance the learning capability of the feedfoward DL networks or their variations with the simple data feedback.

Denoising Intelligent Communication

Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation

1 code implementation19 May 2023 Xuanli He, Qiongkai Xu, Jun Wang, Benjamin Rubinstein, Trevor Cohn

Modern NLP models are often trained over large untrusted datasets, raising the potential for a malicious adversary to compromise model behaviour.

Specify Robust Causal Representation from Mixed Observations

1 code implementation21 Oct 2023 Mengyue Yang, Xinyu Cai, Furui Liu, Weinan Zhang, Jun Wang

Under the hypothesis that the intrinsic latent factors follow some casual generative models, we argue that by learning a causal representation, which is the minimal sufficient causes of the whole system, we can improve the robustness and generalization performance of machine learning models.

Understanding Adversarial Robustness from Feature Maps of Convolutional Layers

1 code implementation25 Feb 2022 Cong Xu, Wei zhang, Jun Wang, Min Yang

Our theoretical analysis discovers that larger convolutional feature maps before average pooling can contribute to better resistance to perturbations, but the conclusion is not true for max pooling.

Adversarial Robustness

M2N: Mesh Movement Networks for PDE Solvers

1 code implementation24 Apr 2022 Wenbin Song, Mingrui Zhang, Joseph G. Wallwork, Junpeng Gao, Zheng Tian, Fanglei Sun, Matthew D. Piggott, Junqing Chen, Zuoqiang Shi, Xiang Chen, Jun Wang

However, mesh movement methods, such as the Monge-Ampere method, require the solution of auxiliary equations, which can be extremely expensive especially when the mesh is adapted frequently.

Graph Attention

Dual Representation Learning for One-Step Clustering of Multi-View Data

1 code implementation30 Aug 2022 Wei zhang, Zhaohong Deng, Kup-Sze Choi, Jun Wang, Shitong Wang

Meanwhile, to make the representation learning more specific to the clustering task, a one-step learning framework is proposed to integrate representation learning and clustering partition as a whole.

Clustering Representation Learning

Which country is this picture from? New data and methods for DNN-based country recognition

1 code implementation2 Sep 2022 Omran Alamayreh, Giovanna Maria Dimitri, Jun Wang, Benedetta Tondi, Mauro Barni

Notably, we found that asking the network to identify the country provides better results than estimating the geo-coordinates and then tracing them back to the country where the picture was taken.

IMBERT: Making BERT Immune to Insertion-based Backdoor Attacks

1 code implementation25 May 2023 Xuanli He, Jun Wang, Benjamin Rubinstein, Trevor Cohn

Backdoor attacks are an insidious security threat against machine learning models.

Enhanced Latent Multi-view Subspace Clustering

1 code implementation22 Dec 2023 Long Shi, Lei Cao, Jun Wang, Badong Chen

Specifically, we stack the data matrices from various views into the block-diagonal locations of the augmented matrix to exploit the complementary information.

Clustering Multi-view Subspace Clustering

Detecting Adversarial Examples via Key-based Network

no code implementations2 Jun 2018 Pinlong Zhao, Zhouyu Fu, Ou wu, QinGhua Hu, Jun Wang

In contrast to existing defense methods, the proposed method does not require knowledge of the process for generating adversarial examples and can be applied to defend against different types of attacks.

Learning to Design Games: Strategic Environments in Reinforcement Learning

no code implementations5 Jul 2017 Haifeng Zhang, Jun Wang, Zhiming Zhou, Wei-Nan Zhang, Ying Wen, Yong Yu, Wenxin Li

In typical reinforcement learning (RL), the environment is assumed given and the goal of the learning is to identify an optimal policy for the agent taking actions through its interactions with the environment.

reinforcement-learning Reinforcement Learning (RL)

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

no code implementations13 Sep 2017 Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Wei-Nan Zhang, Ying Wen, Yong Yu

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Multi-view Registration Based on Weighted Low Rank and Sparse Matrix Decomposition of Motions

no code implementations25 Sep 2017 Congcong Jin, Jihua Zhu, Yaochen Li, Shanmin Pang, Lei Chen, Jun Wang

Then, it proposes the weighted LRS decomposition, where each block element is assigned with one estimated weight to denote its reliability.

Neural Text Generation: Past, Present and Beyond

no code implementations15 Mar 2018 Sidi Lu, Yaoming Zhu, Wei-Nan Zhang, Jun Wang, Yong Yu

This paper presents a systematic survey on recent development of neural text generation models.

Benchmarking reinforcement-learning +2

Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising

no code implementations1 Mar 2018 Kan Ren, Wei-Nan Zhang, Ke Chang, Yifei Rong, Yong Yu, Jun Wang

From the learning perspective, we show that the bidding machine can be updated smoothly with both offline periodical batch or online sequential training schemes.

BIG-bench Machine Learning

Exponential Discriminative Metric Embedding in Deep Learning

no code implementations7 Mar 2018 Bowen Wu, Zhangling Chen, Jun Wang, Huaming Wu

With the remarkable success achieved by the Convolutional Neural Networks (CNNs) in object recognition recently, deep learning is being widely used in the computer vision community.

Face Recognition Face Verification +3

Inception Score, Label Smoothing, Gradient Vanishing and -log(D(x)) Alternative

no code implementations5 Aug 2017 Zhiming Zhou, Wei-Nan Zhang, Jun Wang

In this article, we mathematically study several GAN related topics, including Inception score, label smoothing, gradient vanishing and the -log(D(x)) alternative.

Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks

no code implementations25 Feb 2015 Yu-Gang Jiang, Zuxuan Wu, Jun Wang, xiangyang xue, Shih-Fu Chang

In this paper, we study the challenging problem of categorizing videos according to high-level semantics such as the existence of a particular human action or a complex event.

tau-FPL: Tolerance-Constrained Learning in Linear Time

no code implementations15 Jan 2018 Ao Zhang, Nan Li, Jian Pu, Jun Wang, Junchi Yan, Hongyuan Zha

Learning a classifier with control on the false-positive rate plays a critical role in many machine learning applications.

Learning Continuous User Representations through Hybrid Filtering with doc2vec

no code implementations31 Dec 2017 Simon Stiebellehner, Jun Wang, Shuai Yuan

In order to maximize the predictive performance of our look-alike modeling algorithms, we propose two novel hybrid filtering techniques that utilize the recent neural probabilistic language model algorithm doc2vec.

Language Modelling

Happiness Pursuit: Personality Learning in a Society of Agents

no code implementations29 Nov 2017 Rafał Muszyński, Jun Wang

We find that the agents that achieve higher happiness during testing against hand-coded AI, have lower happiness when competing against each other.

A Neural Stochastic Volatility Model

no code implementations30 Nov 2017 Rui Luo, Wei-Nan Zhang, Xiaojun Xu, Jun Wang

In this paper, we show that the recent integration of statistical models with deep recurrent neural networks provides a new way of formulating volatility (the degree of variation of time series) models that have been widely used in time series analysis and prediction in finance.

Time Series Time Series Analysis

Set-to-Set Hashing with Applications in Visual Recognition

no code implementations2 Nov 2017 I-Hong Jhuo, Jun Wang

In this paper, we consider the fundamental problem of finding a nearest set from a collection of sets, to a query set.

Retrieval

Effective scaling registration approach by imposing the emphasis on the scale factor

no code implementations28 Apr 2017 Minmin Xu, Siyu Xu, Jihua Zhu, Yaochen Li, Jun Wang, Huimin Lu

This paper proposes an effective approach for the scaling registration of $m$-D point sets.

Learning text representation using recurrent convolutional neural network with highway layers

no code implementations22 Jun 2016 Ying Wen, Wei-Nan Zhang, Rui Luo, Jun Wang

Recently, the rapid development of word embedding and neural networks has brought new inspiration to various NLP and IR tasks.

Sentiment Analysis

A Survey on Soft Subspace Clustering

no code implementations19 Sep 2014 Zhaohong Deng, Kup-Sze Choi, Yizhang Jiang, Jun Wang, Shitong Wang

Subspace clustering (SC) is a promising clustering technology to identify clusters based on their associations with subspaces in high dimensional spaces.

Clustering

Feature Selection as a Multiagent Coordination Problem

no code implementations16 Mar 2016 Kleanthis Malialis, Jun Wang, Gary Brooks, George Frangou

In this paper, we formulate feature selection as a multiagent coordination problem and propose a novel feature selection method using multiagent reinforcement learning.

feature selection reinforcement-learning +1

Implicit Look-alike Modelling in Display Ads: Transfer Collaborative Filtering to CTR Estimation

no code implementations11 Jan 2016 Wei-Nan Zhang, Lingxi Chen, Jun Wang

In this work, we propose a general framework which learns the user profiles based on their online browsing behaviour, and transfers the learned knowledge onto prediction of their ad response.

Collaborative Filtering Transfer Learning

Factorizing LambdaMART for cold start recommendations

no code implementations4 Nov 2015 Phong Nguyen, Jun Wang, Alexandros Kalousis

Motivated by the fact that very often the users' and items' descriptions as well as the preference behavior can be well summarized by a small number of hidden factors, we propose a novel algorithm, LambdaMART Matrix Factorization (LambdaMART-MF), that learns a low rank latent representation of users and items using gradient boosted trees.

Learning-To-Rank Matrix Completion +1

Learning to Hash for Indexing Big Data - A Survey

no code implementations17 Sep 2015 Jun Wang, Wei Liu, Sanjiv Kumar, Shih-Fu Chang

Such learning to hash methods exploit information such as data distributions or class labels when optimizing the hash codes or functions.

Deep Attributes from Context-Aware Regional Neural Codes

no code implementations8 Sep 2015 Jianwei Luo, Jianguo Li, Jun Wang, Zhiguo Jiang, Yurong Chen

Results show that deep attribute approaches achieve state-of-the-art results, and outperforms existing peer methods with a significant margin, even though some benchmarks have little overlap of concepts with the pre-trained CNN models.

Attribute General Classification +2

Two-Stage Metric Learning

no code implementations12 May 2014 Jun Wang, Ke Sun, Fei Sha, Stephane Marchand-Maillet, Alexandros Kalousis

This induces in the input data space a new family of distance metric with unique properties.

Metric Learning Vocal Bursts Valence Prediction

Question Answering Against Very-Large Text Collections

no code implementations26 Apr 2013 Leon Derczynski, Richard Shaw, Ben Solway, Jun Wang

Question answering involves developing methods to extract useful information from large collections of documents.

Information Retrieval Question Answering +1

A dynamic pricing model for unifying programmatic guarantee and real-time bidding in display advertising

no code implementations20 May 2014 Bo-Wei Chen, Shuai Yuan, Jun Wang

From the experiments we find that, in a less competitive market, lower prices of the guaranteed contracts will encourage the purchase in advance and the revenue gain is mainly contributed by the increased competition in future RTB.

Computer Science and Game Theory

Learning Adaptive Display Exposure for Real-Time Advertising

no code implementations10 Sep 2018 Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Wei-Nan Zhang, Jun Wang, Xiaotian Hao, Yixi Wang, Han Li, Jian Xu, Kun Gai

In this paper, we investigate the problem of advertising with adaptive exposure: can we dynamically determine the number and positions of ads for each user visit under certain business constraints so that the platform revenue can be increased?

Learning to Communicate Implicitly By Actions

no code implementations10 Oct 2018 Zheng Tian, Shihao Zou, Ian Davies, Tim Warr, Lisheng Wu, Haitham Bou Ammar, Jun Wang

The auxiliary reward for communication is integrated into the learning of the policy module.

Learning Shared Dynamics with Meta-World Models

no code implementations5 Nov 2018 Lisheng Wu, Minne Li, Jun Wang

Humans have consciousness as the ability to perceive events and objects: a mental model of the world developed from the most impoverished of visual stimuli, enabling humans to make rapid decisions and take actions.

Atari Games Multi-Task Learning

Layout Design for Intelligent Warehouse by Evolution with Fitness Approximation

no code implementations14 Nov 2018 Haifeng Zhang, Zilong Guo, Han Cai, Chris Wang, Wei-Nan Zhang, Yong Yu, Wenxin Li, Jun Wang

With the rapid growth of the express industry, intelligent warehouses that employ autonomous robots for carrying parcels have been widely used to handle the vast express volume.

Layout Design

Space-Time Local Embeddings

no code implementations NeurIPS 2015 Ke Sun, Jun Wang, Alexandros Kalousis, Stephane Marchand-Maillet

We give theoretical propositions to show that space-time is a more powerful representation than Euclidean space.

Dimensionality Reduction

Parametric Local Metric Learning for Nearest Neighbor Classification

no code implementations NeurIPS 2012 Jun Wang, Alexandros Kalousis, Adam Woznica

We present a new parametric local metric learning method in which we learn a smooth metric matrix function over the data manifold.

Classification General Classification +1

Transfer Representation Learning with TSK Fuzzy System

no code implementations9 Jan 2019 Peng Xu, Zhaohong Deng, Jun Wang, Qun Zhang, Shitong Wang

A core issue in transfer learning is to learn a shared feature space in where the distributions of the data from two domains are matched.

Dimensionality Reduction Representation Learning +1

Predicting the Mumble of Wireless Channel with Sequence-to-Sequence Models

no code implementations14 Jan 2019 Yourui Huangfu, Jian Wang, Rong Li, Chen Xu, Xianbin Wang, Huazi Zhang, Jun Wang

Accurate prediction of fading channel in future is essential to realize adaptive transmission and other methods that can save power and provide gains.

Caption Generation Language Modelling +5

Cannot find the paper you are looking for? You can Submit a new open access paper.