Search Results for author: Zhe Xu

Found 74 papers, 14 papers with code

Dynamic Demonstration Retrieval and Cognitive Understanding for Emotional Support Conversation

no code implementations3 Apr 2024 Zhe Xu, Daoyuan Chen, Jiayi Kuang, Zihao Yi, Yaliang Li, Ying Shen

Emotional Support Conversation (ESC) systems are pivotal in providing empathetic interactions, aiding users through negative emotional states by understanding and addressing their unique experiences.

Empathetic Response Generation In-Context Learning +2

Distributed Robust Learning based Formation Control of Mobile Robots based on Bioinspired Neural Dynamics

no code implementations23 Mar 2024 Zhe Xu, Tao Yan, Simon X. Yang, S. Andrew Gadsden, Mohammad Biglarbegian

This paper addresses the challenges of distributed formation control in multiple mobile robots, introducing a novel approach that enhances real-world practicability.

Diversified and Personalized Multi-rater Medical Image Segmentation

1 code implementation20 Mar 2024 Yicheng Wu, Xiangde Luo, Zhe Xu, Xiaoqing Guo, Lie Ju, ZongYuan Ge, Wenjun Liao, Jianfei Cai

To address it, the common practice is to gather multiple annotations from different experts, leading to the setting of multi-rater medical image segmentation.

Image Segmentation Medical Image Segmentation +2

A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems

no code implementations28 Feb 2024 Zihao Yi, Jiarui Ouyang, YuWen Liu, Tianhao Liao, Zhe Xu, Ying Shen

This survey provides a comprehensive review of research on multi-turn dialogue systems, with a particular focus on multi-turn dialogue systems based on large language models (LLMs).

Using Large Language Models to Automate and Expedite Reinforcement Learning with Reward Machine

no code implementations11 Feb 2024 Shayan Meshkat Alsadat, Jean-Raphael Gaglione, Daniel Neider, Ufuk Topcu, Zhe Xu

Our method uses Large Language Models (LLM) to obtain high-level domain-specific knowledge using prompt engineering instead of providing the reinforcement learning algorithm directly with the high-level knowledge which requires an expert to encode the automaton.

Language Modelling Large Language Model +2

Do You Guys Want to Dance: Zero-Shot Compositional Human Dance Generation with Multiple Persons

no code implementations24 Jan 2024 Zhe Xu, Kun Wei, Xu Yang, Cheng Deng

Human dance generation (HDG) aims to synthesize realistic videos from images and sequences of driving poses.

Pontryagin Neural Operator for Solving Parametric General-Sum Differential Games

no code implementations3 Jan 2024 Lei Zhang, Mukesh Ghimire, Zhe Xu, Wenlong Zhang, Yi Ren

To address these challenges, we propose in this paper a Pontryagin-mode neural operator that outperforms existing state-of-the-art (SOTA) on safety performance across games with parametric state constraints.

Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization

no code implementations14 Dec 2023 Wentao Pan, Zhe Xu, Jiangpeng Yan, Zihan Wu, Raymond Kai-yu Tong, Xiu Li, Jianhua Yao

Semi-supervised semantic segmentation aims to utilize limited labeled images and abundant unlabeled images to achieve label-efficient learning, wherein the weak-to-strong consistency regularization framework, popularized by FixMatch, is widely used as a benchmark scheme.

Image Classification Pseudo Label +2

Invariant Graph Transformer

no code implementations13 Dec 2023 Zhe Xu, Menghai Pan, Yuzhong Chen, Huiyuan Chen, Yuchen Yan, Mahashweta Das, Hanghang Tong

Based on the self-attention module, our proposed invariant graph Transformer (IGT) can achieve fine-grained, more specifically, node-level and virtual node-level intervention.

Value Approximation for Two-Player General-Sum Differential Games with State Constraints

1 code implementation28 Nov 2023 Lei Zhang, Mukesh Ghimire, Wenlong Zhang, Zhe Xu, Yi Ren

Solving Hamilton-Jacobi-Isaacs (HJI) PDEs numerically enables equilibrial feedback control in two-player differential games, yet faces the curse of dimensionality (CoD).

Physics-informed machine learning

Topological Augmentation for Class-Imbalanced Node Classification

no code implementations27 Aug 2023 Zhining Liu, Zhichen Zeng, Ruizhong Qiu, Hyunsik Yoo, David Zhou, Zhe Xu, Yada Zhu, Kommy Weldemariam, Jingrui He, Hanghang Tong

Class imbalance is prevalent in real-world node classification tasks and often biases graph learning models toward majority classes.

Classification Graph Learning +1

Distributed Robust Learning-Based Backstepping Control Aided with Neurodynamics for Consensus Formation Tracking of Underwater Vessels

no code implementations18 Aug 2023 Tao Yan, Zhe Xu, Simon X. Yang

This paper addresses distributed robust learning-based control for consensus formation tracking of multiple underwater vessels, in which the system parameters of the marine vessels are assumed to be entirely unknown and subject to the modeling mismatch, oceanic disturbances, and noises.

Distributed Neurodynamics-Based Backstepping Optimal Control for Robust Constrained Consensus of Underactuated Underwater Vehicles Fleet

no code implementations18 Aug 2023 Tao Yan, Zhe Xu, Simon X. Yang, S. Andrew Gadsden

Robust constrained formation tracking control of underactuated underwater vehicles (UUVs) fleet in three-dimensional space is a challenging but practical problem.

You've Got Two Teachers: Co-evolutionary Image and Report Distillation for Semi-supervised Anatomical Abnormality Detection in Chest X-ray

no code implementations18 Jul 2023 Jinghan Sun, Dong Wei, Zhe Xu, Donghuan Lu, Hong Liu, Liansheng Wang, Yefeng Zheng

Inversely, we also use the prediction of the vision detection model for abnormality-guided pseudo classification label refinement (APCLR) in the auxiliary report classification task, and propose a co-evolution strategy where the vision and report models mutually promote each other with RPDLR and APCLR performed alternatively.

Anomaly Detection Pseudo Label

Data-Driven Model Discrimination of Switched Nonlinear Systems with Temporal Logic Inference

no code implementations16 Jun 2023 Zeyuan Jin, Nasim Baharisangari, Zhe Xu, Sze Zheng Yong

To tackle this problem, we propose data-driven methods to over-approximate the unknown dynamics and to infer the unknown specifications such that both set-membership models of the unknown dynamics and LTL formulas are guaranteed to include the ground truth model and specification/task.

Computational Efficiency

Reinforcement Learning With Reward Machines in Stochastic Games

no code implementations27 May 2023 Jueming Hu, Jean-Raphael Gaglione, Yanze Wang, Zhe Xu, Ufuk Topcu, Yongming Liu

We develop an algorithm called Q-learning with reward machines for stochastic games (QRM-SG), to learn the best-response strategy at Nash equilibrium for each agent.

Multi-agent Reinforcement Learning Q-Learning +1

Distributed Leader Follower Formation Control of Mobile Robots based on Bioinspired Neural Dynamics and Adaptive Sliding Innovation Filter

no code implementations3 May 2023 Zhe Xu, Tao Yan, Simon X. Yang, S. Andrew Gadsden

This paper investigated the distributed leader follower formation control problem for multiple differentially driven mobile robots.

CTT-Net: A Multi-view Cross-token Transformer for Cataract Postoperative Visual Acuity Prediction

1 code implementation12 Dec 2022 Jinhong Wang, Jingwen Wang, Tingting Chen, Wenhao Zheng, Zhe Xu, Xingdi Wu, Wen Xu, Haochao Ying, Danny Chen, Jian Wu

Clinically, to assess the necessity of cataract surgery, accurately predicting postoperative VA before surgery by analyzing multi-view optical coherence tomography (OCT) images is crucially needed.

regression

Learning Temporal Logic Properties: an Overview of Two Recent Methods

no code implementations2 Dec 2022 Jean-Raphaël Gaglione, Rajarshi Roy, Nasim Baharisangari, Daniel Neider, Zhe Xu, Ufuk Topcu

Learning linear temporal logic (LTL) formulas from examples labeled as positive or negative has found applications in inferring descriptions of system behavior.

Specificity Vocal Bursts Valence Prediction

Human-machine Interactive Tissue Prototype Learning for Label-efficient Histopathology Image Segmentation

1 code implementation26 Nov 2022 Wentao Pan, Jiangpeng Yan, Hanbo Chen, Jiawei Yang, Zhe Xu, Xiu Li, Jianhua Yao

Then, the encoder is used to map the images into the embedding space and generate pixel-level pseudo tissue masks by querying the tissue prototype dictionary.

Contrastive Learning Image Segmentation +5

Distributed Differentially Private Control Synthesis for Multi-Agent Systems with Metric Temporal Logic Specifications

no code implementations4 Oct 2022 Nasim Baharisangari, Zhe Xu

In this paper, we propose a distributed differentially private receding horizon control (RHC) approach for multi-agent systems (MAS) with metric temporal logic (MTL) specifications.

Learning Interpretable Temporal Properties from Positive Examples Only

1 code implementation6 Sep 2022 Rajarshi Roy, Jean-Raphaël Gaglione, Nasim Baharisangari, Daniel Neider, Zhe Xu, Ufuk Topcu

To learn meaningful models from positive examples only, we design algorithms that rely on conciseness and language minimality of models as regularizers.

A Hybrid Tracking Control Strategy for an Unmanned Underwater Vehicle Aided with Bioinspired Neural Dynamics

no code implementations3 Sep 2022 Zhe Xu, Tao Yan, Simon X. Yang, S. Andrew Gadsden

In comparative studies, the proposed combined hybrid control strategy has ensured control signals smoothness, which is critical in real world applications, especially for an unmanned underwater vehicle that needs to operate in complex underwater environments.

Non-Parametric Neuro-Adaptive Formation Control

no code implementations17 Jul 2022 Christos K. Verginis, Zhe Xu, Ufuk Topcu

Most existing algorithms either assume certain parametric forms for the unknown dynamic terms or resort to unnecessarily large control inputs in order to provide theoretical guarantees.

Towards Better Dermoscopic Image Feature Representation Learning for Melanoma Classification

1 code implementation15 Jul 2022 Chenghui Yu, Mingkang Tang, ShengGe Yang, Mingqing Wang, Zhe Xu, Jiangpeng Yan, HanMo Chen, Yu Yang, Xiao-jun Zeng, Xiu Li

Deep learning-based melanoma classification with dermoscopic images has recently shown great potential in automatic early-stage melanoma diagnosis.

Data Augmentation Denoising +2

Approximating Discontinuous Nash Equilibrial Values of Two-Player General-Sum Differential Games

no code implementations5 Jul 2022 Lei Zhang, Mukesh Ghimire, Wenlong Zhang, Zhe Xu, Yi Ren

This paper investigates two potential solutions to this problem: a hybrid method that leverages both supervised Nash equilibria and the HJI PDE, and a value-hardening method where a sequence of HJIs are solved with a gradually hardening reward.

Autonomous Driving Self-Supervised Learning

Consensus Formation Tracking for Multiple AUV Systems Using Distributed Bioinspired Sliding Mode Control

no code implementations16 Jun 2022 Tao Yan, Zhe Xu, Simon X. Yang

Consensus formation tracking of multiple autonomous underwater vehicles (AUVs) subject to nonlinear and uncertain dynamics is a challenging problem in robotics.

Seeking Common Ground While Reserving Differences: Multiple Anatomy Collaborative Framework for Undersampled MRI Reconstruction

no code implementations15 Jun 2022 Jiangpeng Yan, Chenghui Yu, Hanbo Chen, Zhe Xu, Junzhou Huang, Xiu Li, Jianhua Yao

Four different implementations of anatomy-specific learners are presented and explored on the top of our framework in two MRI reconstruction networks.

Anatomy De-aliasing +1

Optimal Propagation for Graph Neural Networks

no code implementations6 May 2022 Beidi Zhao, Boxin Du, Zhe Xu, Liangyue Li, Hanghang Tong

Graph Neural Networks (GNNs) have achieved tremendous success in a variety of real-world applications by relying on the fixed graph data as input.

Node Classification

Data Augmentation for Deep Graph Learning: A Survey

1 code implementation16 Feb 2022 Kaize Ding, Zhe Xu, Hanghang Tong, Huan Liu

In this survey, we formally formulate the problem of graph data augmentation and further review the representative techniques and their applications in different deep graph learning problems.

Data Augmentation Graph Learning

Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems

no code implementations11 Oct 2021 Christos K. Verginis, Zhe Xu, Ufuk Topcu

Most existing algorithms either assume certain parametric forms for the unknown dynamic terms or resort to unnecessarily large control inputs in order to provide theoretical guarantees.

Multi-Trigger-Key: Towards Multi-Task Privacy Preserving In Deep Learning

no code implementations6 Oct 2021 Ren Wang, Zhe Xu, Alfred Hero

Deep learning-based Multi-Task Classification (MTC) is widely used in applications like facial attributes and healthcare that warrant strong privacy guarantees.

Privacy Preserving

Multi-Trigger-Key: Towards Multi-Task Privacy-Preserving In Deep Learning

no code implementations29 Sep 2021 Ren Wang, Zhe Xu, Alfred Hero

Deep learning-based Multi-Task Classification (MTC) is widely used in applications like facial attribute and healthcare that warrant strong privacy guarantees.

Attribute Privacy Preserving

All-Around Real Label Supervision: Cyclic Prototype Consistency Learning for Semi-supervised Medical Image Segmentation

1 code implementation28 Sep 2021 Zhe Xu, Yixin Wang, Donghuan Lu, Lequan Yu, Jiangpeng Yan, Jie Luo, Kai Ma, Yefeng Zheng, Raymond Kai-yu Tong

Observing this, we ask an unexplored but interesting question: can we exploit the unlabeled data via explicit real label supervision for semi-supervised training?

Brain Tumor Segmentation Image Segmentation +3

Weighted Graph-Based Signal Temporal Logic Inference Using Neural Networks

no code implementations16 Sep 2021 Nasim Baharisangari, Kazuma Hirota, Ruixuan Yan, Agung Julius, Zhe Xu

It is important that the obtained knowledge is human-interpretable and amenable to formal analysis.

Classification

Double-Uncertainty Guided Spatial and Temporal Consistency Regularization Weighting for Learning-based Abdominal Registration

no code implementations6 Jul 2021 Zhe Xu, Jie Luo, Donghuan Lu, Jiangpeng Yan, Sarah Frisken, Jayender Jagadeesan, William Wells III, Xiu Li, Yefeng Zheng, Raymond Tong

Such convention has two limitations: (i) Besides the laborious grid search for the optimal fixed weight, the regularization strength of a specific image pair should be associated with the content of the images, thus the "one value fits all" training scheme is not ideal; (ii) Only spatially regularizing the transformation may neglect some informative clues related to the ill-posedness.

Image Registration

Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach

no code implementations29 Jun 2021 Franck Djeumou, Zhe Xu, Murat Cubuktepe, Ufuk Topcu

Specifically, we study a setting in which the agents move along the nodes of a graph, and the high-level task specifications for the swarm are expressed in a recently-proposed language called graph temporal logic (GTL).

Trust It or Not: Confidence-Guided Automatic Radiology Report Generation

no code implementations21 Jun 2021 Yixin Wang, Zihao Lin, Zhe Xu, Haoyu Dong, Jiang Tian, Jie Luo, Zhongchao shi, Yang Zhang, Jianping Fan, Zhiqiang He

Experimental results have demonstrated that the proposed method for model uncertainty characterization and estimation can produce more reliable confidence scores for radiology report generation, and the modified loss function, which takes into account the uncertainties, leads to better model performance on two public radiology report datasets.

Decision Making Image Captioning +2

A Deep Value-network Based Approach for Multi-Driver Order Dispatching

no code implementations8 Jun 2021 Xiaocheng Tang, Zhiwei Qin, Fan Zhang, Zhaodong Wang, Zhe Xu, Yintai Ma, Hongtu Zhu, Jieping Ye

In this work, we propose a deep reinforcement learning based solution for order dispatching and we conduct large scale online A/B tests on DiDi's ride-dispatching platform to show that the proposed method achieves significant improvement on both total driver income and user experience related metrics.

reinforcement-learning Reinforcement Learning (RL) +1

Noisy Labels are Treasure: Mean-Teacher-Assisted Confident Learning for Hepatic Vessel Segmentation

1 code implementation3 Jun 2021 Zhe Xu, Donghuan Lu, Yixin Wang, Jie Luo, Jayender Jagadeesan, Kai Ma, Yefeng Zheng, Xiu Li

Manually segmenting the hepatic vessels from Computer Tomography (CT) is far more expertise-demanding and laborious than other structures due to the low-contrast and complex morphology of vessels, resulting in the extreme lack of high-quality labeled data.

Uncertainty-Aware Signal Temporal Logic Inference

1 code implementation24 May 2021 Nasim Baharisangari, Jean-Raphaël Gaglione, Daniel Neider, Ufuk Topcu, Zhe Xu

In this paper, we first investigate the uncertainties associated with trajectories of a system and represent such uncertainties in the form of interval trajectories.

Graph Sanitation with Application to Node Classification

no code implementations19 May 2021 Zhe Xu, Boxin Du, Hanghang Tong

Generally speaking, the vast majority of the existing works aim to answer the following question, that is, given a graph, what is the best way to mine it?

Anomaly Detection Bilevel Optimization +6

Learning Linear Temporal Properties from Noisy Data: A MaxSAT Approach

no code implementations30 Apr 2021 Jean-Raphaël Gaglione, Daniel Neider, Rajarshi Roy, Ufuk Topcu, Zhe Xu

Our first algorithm infers minimal LTL formulas by reducing the inference problem to a problem in maximum satisfiability and then using off-the-shelf MaxSAT solvers to find a solution.

Temporal-Logic-Based Intermittent, Optimal, and Safe Continuous-Time Learning for Trajectory Tracking

no code implementations6 Apr 2021 Aris Kanellopoulos, Filippos Fotiadis, Chuangchuang Sun, Zhe Xu, Kyriakos G. Vamvoudakis, Ufuk Topcu, Warren E. Dixon

In this paper, we develop safe reinforcement-learning-based controllers for systems tasked with accomplishing complex missions that can be expressed as linear temporal logic specifications, similar to those required by search-and-rescue missions.

Reinforcement Learning (RL) Safe Reinforcement Learning

Robust Pandemic Control Synthesis with Formal Specifications: A Case Study on COVID-19 Pandemic

no code implementations26 Mar 2021 Zhe Xu, Xiaoming Duan

We provide simulation results in two different scenarios for robust control of the COVID-19 pandemic: one for vaccination control, and another for shield immunity control, with the model parameters estimated from data in Lombardy, Italy.

Provably Correct Controller Synthesis of Switched Stochastic Systems with Metric Temporal Logic Specifications: A Case Study on Power Systems

no code implementations26 Mar 2021 Zhe Xu, Yichen Zhang

In this paper, we present a provably correct controller synthesis approach for switched stochastic control systems with metric temporal logic (MTL) specifications with provable probabilistic guarantees.

Magneto-acoustic oscillations observed in a solar plage region

no code implementations28 Jan 2021 Haisheng Ji, Parida Hashim, Zhenxiang Hong, Zhe Xu, Jinhua Shen, Kaifan Ji, Wenda Cao

All findings show that the magnetic perturbations are actually magneto-acoustic oscillations on the solar surface, the photosphere, powered by p-mode oscillations.

Solar and Stellar Astrophysics

Pseudo-Loss Confidence Metric for Semi-Supervised Few-Shot Learning

no code implementations ICCV 2021 Kai Huang, Jie Geng, Wen Jiang, Xinyang Deng, Zhe Xu

Most semi-supervised few-shot learning methods select pseudo-labeled data of unlabeled set by task-specific confidence estimation.

Few-Shot Learning

Unimodal Cyclic Regularization for Training Multimodal Image Registration Networks

no code implementations12 Nov 2020 Zhe Xu, Jiangpeng Yan, Jie Luo, William Wells, Xiu Li, Jayender Jagadeesan

The loss function of an unsupervised multimodal image registration framework has two terms, i. e., a metric for similarity measure and regularization.

Image Registration

Unsupervised Multimodal Image Registration with Adaptative Gradient Guidance

no code implementations12 Nov 2020 Zhe Xu, Jiangpeng Yan, Jie Luo, Xiu Li, Jayender Jagadeesan

Multimodal image registration (MIR) is a fundamental procedure in many image-guided therapies.

Image Registration

F3RNet: Full-Resolution Residual Registration Network for Deformable Image Registration

no code implementations15 Sep 2020 Zhe Xu, Jie Luo, Jiangpeng Yan, Xiu Li, Jagadeesan Jayender

In this paper, we propose a novel unsupervised registration network, namely the Full-Resolution Residual Registration Network (F3RNet), for deformable registration of severely deformed organs.

Image Registration

Real-time and Large-scale Fleet Allocation of Autonomous Taxis: A Case Study in New York Manhattan Island

no code implementations6 Sep 2020 Yue Yang, Wencang Bao, Mohsen Ramezani, Zhe Xu

Nowadays, autonomous taxis become a highly promising transportation mode, which helps relieve traffic congestion and avoid road accidents.

Constrained Active Classification Using Partially Observable Markov Decision Processes

no code implementations10 Aug 2020 Bo Wu, Niklas Lauffer, Mohamadreza Ahmadi, Suda Bharadwaj, Zhe Xu, Ufuk Topcu

The proposed framework relies on assigning a classification belief (a probability distribution) to the attributes of interest.

Attribute Classification +1

Byzantine-Resilient Distributed Hypothesis Testing With Time-Varying Network Topology

no code implementations1 Aug 2020 Bo Wu, Steven Carr, Suda Bharadwaj, Zhe Xu, Ufuk Topcu

We study the problem of distributed hypothesis testing over a network of mobile agents with limited communication and sensing ranges to infer the true hypothesis collaboratively.

Adversarial Uni- and Multi-modal Stream Networks for Multimodal Image Registration

no code implementations6 Jul 2020 Zhe Xu, Jie Luo, Jiangpeng Yan, Ritvik Pulya, Xiu Li, William Wells III, Jayender Jagadeesan

Deformable image registration between Computed Tomography (CT) images and Magnetic Resonance (MR) imaging is essential for many image-guided therapies.

Computed Tomography (CT) Image Registration +2

Reward Machines for Cooperative Multi-Agent Reinforcement Learning

2 code implementations3 Jul 2020 Cyrus Neary, Zhe Xu, Bo Wu, Ufuk Topcu

In cooperative multi-agent reinforcement learning, a collection of agents learns to interact in a shared environment to achieve a common goal.

Multi-agent Reinforcement Learning Q-Learning +2

Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples

no code implementations28 Jun 2020 Zhe Xu, Bo Wu, Aditya Ojha, Daniel Neider, Ufuk Topcu

We compare our algorithm with the state-of-the-art RL algorithms for non-Markovian reward functions, such as Joint Inference of Reward machines and Policies for RL (JIRP), Learning Reward Machine (LRM), and Proximal Policy Optimization (PPO2).

Active Learning Q-Learning +2

Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers

1 code implementation ICLR 2020 Junjie Liu, Zhe Xu, Runbin Shi, Ray C. C. Cheung, Hayden K. -H. So

We present a novel network pruning algorithm called Dynamic Sparse Training that can jointly find the optimal network parameters and sparse network structure in a unified optimization process with trainable pruning thresholds.

Network Pruning

Do Public Datasets Assure Unbiased Comparisons for Registration Evaluation?

no code implementations20 Mar 2020 Jie Luo, Guangshen Ma, Sarah Frisken, Parikshit Juvekar, Nazim Haouchine, Zhe Xu, Yiming Xiao, Alexandra Golby, Patrick Codd, Masashi Sugiyama, William Wells III

In this study, we use the variogram to screen the manually annotated landmarks in two datasets used to benchmark registration in image-guided neurosurgeries.

Image Registration

Adaptive Teaching of Temporal Logic Formulas to Learners with Preferences

no code implementations27 Jan 2020 Zhe Xu, Yuxin Chen, Ufuk Topcu

In the context of teaching temporal logic formulas, an exhaustive search even for a myopic solution takes exponential time (with respect to the time span of the task).

Accurate and Compact Convolutional Neural Networks with Trained Binarization

no code implementations25 Sep 2019 Zhe Xu, Ray C. C. Cheung

Recently, binary convolutional neural networks are explored to help alleviate this issue by quantizing both weights and activations with only 1 single bit.

Binarization

Joint Inference of Reward Machines and Policies for Reinforcement Learning

no code implementations12 Sep 2019 Zhe Xu, Ivan Gavran, Yousef Ahmad, Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu

The experiments show that learning high-level knowledge in the form of reward machines can lead to fast convergence to optimal policies in RL, while standard RL methods such as q-learning and hierarchical RL methods fail to converge to optimal policies after a substantial number of training steps in many tasks.

Q-Learning reinforcement-learning +1

Transfer of Temporal Logic Formulas in Reinforcement Learning

no code implementations10 Sep 2019 Zhe Xu, Ufuk Topcu

Transferring high-level knowledge from a source task to a target task is an effective way to expedite reinforcement learning (RL).

reinforcement-learning Reinforcement Learning (RL) +1

Deep Learning-Based Least Square Forward-Backward Stochastic Differential Equation Solver for High-Dimensional Derivative Pricing

no code implementations24 Jul 2019 Jian Liang, Zhe Xu, Peter Li

We propose a new forward-backward stochastic differential equation solver for high-dimensional derivatives pricing problems by combining deep learning solver with least square regression technique widely used in the least square Monte Carlo method for the valuation of American options.

regression

A Robust Background Initialization Algorithm with Superpixel Motion Detection

no code implementations17 May 2018 Zhe Xu, Biao Min, Ray C. C. Cheung

Scene background initialization allows the recovery of a clear image without foreground objects from a video sequence, which is generally the first step in many computer vision and video processing applications.

Clustering Motion Detection +1

Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management

1 code implementation18 Feb 2018 Kaixiang Lin, Renyu Zhao, Zhe Xu, Jiayu Zhou

Large-scale online ride-sharing platforms have substantially transformed our lives by reallocating transportation resources to alleviate traffic congestion and promote transportation efficiency.

Management Multi-agent Reinforcement Learning +3

Query-free Clothing Retrieval via Implicit Relevance Feedback

no code implementations1 Nov 2017 Zhuoxiang Chen, Zhe Xu, Ya zhang, Xiao Gu

We model this problem as a new type of image retrieval task in which the target image resides only in the user's mind (called "mental image retrieval" hereafter).

Decision Making Image Retrieval +1

Census Signal Temporal Logic Inference for Multi-Agent Group Behavior Analysis

no code implementations5 Oct 2016 Zhe Xu, Agung Julius

CensusSTL consists of an "inner logic" STL formula and an "outer logic" STL formula.

Part-Stacked CNN for Fine-Grained Visual Categorization

no code implementations CVPR 2016 Shaoli Huang, Zhe Xu, DaCheng Tao, Ya zhang

In the context of fine-grained visual categorization, the ability to interpret models as human-understandable visual manuals is sometimes as important as achieving high classification accuracy.

Classification Fine-Grained Image Classification +3

Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization

no code implementations ICCV 2015 Zhe Xu, Shaoli Huang, Ya zhang, DaCheng Tao

We propose a new method for fine-grained object recognition that employs part-level annotations and deep convolutional neural networks (CNNs) in a unified framework.

Object Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.