Search Results for author: Yan Zheng

Found 79 papers, 12 papers with code

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models

no code implementations • 6 Mar 2024 • Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao

To bridge the gap with the real-world requirements, we introduce $\textbf{SheetRM}$, a benchmark featuring long-horizon and multi-category tasks with reasoning-dependent manipulation caused by real-life challenges.

Language Modelling Large Language Model

Paper
Add Code

Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models

no code implementations • 22 Feb 2024 • Jinyi Liu, Yifu Yuan, Jianye Hao, Fei Ni, Lingzhi Fu, Yibin Chen, Yan Zheng

Recently, there has been considerable attention towards leveraging large language models (LLMs) to enhance decision-making processes.

Decision Making Robot Manipulation

Paper
Add Code

Random Projection Layers for Multidimensional Time Series Forecasting

no code implementations • 16 Feb 2024 • Chin-Chia Michael Yeh, Yujie Fan, Xin Dai, Vivian Lai, Prince Osei Aboagye, Junpeng Wang, Huiyuan Chen, Yan Zheng, Zhongfang Zhuang, Liang Wang, Wei zhang

All-Multi-Layer Perceptron (all-MLP) mixer models have been shown to be effective for time series forecasting problems.

Time Series Time Series Forecasting

Paper
Add Code

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

no code implementations • 4 Feb 2024 • Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng

It is crucial to consider diverse human feedback types and various learning methods in different environments.

Paper
Add Code

DiffuserLite: Towards Real-time Diffusion Planning

no code implementations • 27 Jan 2024 • Zibin Dong, Jianye Hao, Yifu Yuan, Fei Ni, Yitian Wang, Pengyi Li, Yan Zheng

Diffusion planning has been recognized as an effective decision-making paradigm in various domains.

D4RL Decision Making

Paper
Add Code

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey

1 code implementation • 22 Jan 2024 • Pengyi Li, Jianye Hao, Hongyao Tang, Xian Fu, Yan Zheng, Ke Tang

Specifically, we systematically summarize recent advancements in relevant algorithms and identify three primary research directions: EA-assisted optimization of RL, RL-assisted optimization of EA, and synergistic optimization of EA and RL.

Evolutionary Algorithms reinforcement-learning +1

110

Paper
Code

PUPAE: Intuitive and Actionable Explanations for Time Series Anomalies

no code implementations • 16 Jan 2024 • Audrey Der, Chin-Chia Michael Yeh, Yan Zheng, Junpeng Wang, Zhongfang Zhuang, Liang Wang, Wei zhang, Eamonn J. Keogh

In this work we introduce a domain agnostic counterfactual explanation technique to produce explanations for time series anomalies.

Anomaly Detection counterfactual +3

Paper
Add Code

Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach

no code implementations • 2 Jan 2024 • Prince Aboagye, Yan Zheng, Junpeng Wang, Uday Singh Saini, Xin Dai, Michael Yeh, Yujie Fan, Zhongfang Zhuang, Shubham Jain, Liang Wang, Wei zhang

The emergence of pre-trained models has significantly impacted Natural Language Processing (NLP) and Computer Vision to relational datasets.

Paper
Add Code

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

no code implementations • 19 Dec 2023 • Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun

In reinforcement learning, the optimism in the face of uncertainty (OFU) is a mainstream principle for directing exploration towards less explored areas, characterized by higher uncertainty.

Continuous Control

Paper
Add Code

Sketching Multidimensional Time Series for Fast Discord Mining

no code implementations • 5 Nov 2023 • Chin-Chia Michael Yeh, Yan Zheng, Menghai Pan, Huiyuan Chen, Zhongfang Zhuang, Junpeng Wang, Liang Wang, Wei zhang, Jeff M. Phillips, Eamonn Keogh

In this work, we propose a sketch for discord mining among multi-dimensional time series.

Anomaly Detection Time Series +1

Paper
Add Code

Temporal Treasure Hunt: Content-based Time Series Retrieval System for Discovering Insights

no code implementations • 5 Nov 2023 • Chin-Chia Michael Yeh, Huiyuan Chen, Xin Dai, Yan Zheng, Yujie Fan, Vivian Lai, Junpeng Wang, Audrey Der, Zhongfang Zhuang, Liang Wang, Wei zhang

To facilitate this investigation, we introduce a CTSR benchmark dataset that comprises time series data from a variety of domains, such as motion, power demand, and traffic.

Retrieval Time Series +1

Paper
Add Code

Ego-Network Transformer for Subsequence Classification in Time Series Data

no code implementations • 5 Nov 2023 • Chin-Chia Michael Yeh, Huiyuan Chen, Yujie Fan, Xin Dai, Yan Zheng, Vivian Lai, Junpeng Wang, Zhongfang Zhuang, Liang Wang, Wei zhang, Eamonn Keogh

The ego-networks of all subsequences collectively form a time series subsequence graph, and we introduce an algorithm to efficiently construct this graph.

Time Series Time Series Classification

Paper
Add Code

Time Series Synthesis Using the Matrix Profile for Anonymization

no code implementations • 5 Nov 2023 • Audrey Der, Chin-Chia Michael Yeh, Yan Zheng, Junpeng Wang, Huiyuan Chen, Zhongfang Zhuang, Liang Wang, Wei zhang, Eamonn Keogh

As a result, unmodified data mining tools can obtain near-identical performance on the synthesized time series as on the original time series.

Time Series

Paper
Add Code

Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning

no code implementations • 2 Nov 2023 • Yiran Li, Junpeng Wang, Prince Aboagye, Michael Yeh, Yan Zheng, Liang Wang, Wei zhang, Kwan-Liu Ma

On the one hand, by visually examining the captions automatically generated from language-image models for an image dataset, we gain deeper insights into the semantic underpinnings of the visual contents, unearthing data biases that may be entrenched within the dataset.

Caption Generation Efficient Exploration +1

Paper
Add Code

FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular Data

1 code implementation • 20 Oct 2023 • Dongyu Zhang, Liang Wang, Xin Dai, Shubham Jain, Junpeng Wang, Yujie Fan, Chin-Chia Michael Yeh, Yan Zheng, Zhongfang Zhuang, Wei zhang

FATA-Trans is field- and time-aware for sequential tabular data.

Language Modelling Masked Language Modeling

Paper
Code

Toward a Foundation Model for Time Series Data

no code implementations • 5 Oct 2023 • Chin-Chia Michael Yeh, Xin Dai, Huiyuan Chen, Yan Zheng, Yujie Fan, Audrey Der, Vivian Lai, Zhongfang Zhuang, Junpeng Wang, Liang Wang, Wei zhang

A foundation model is a machine learning model trained on a large and diverse set of data, typically using self-supervised learning-based pre-training techniques, that can be adapted to various downstream tasks.

Self-Supervised Learning Time Series

Paper
Add Code

An Efficient Content-based Time Series Retrieval System

no code implementations • 5 Oct 2023 • Chin-Chia Michael Yeh, Huiyuan Chen, Xin Dai, Yan Zheng, Junpeng Wang, Vivian Lai, Yujie Fan, Audrey Der, Zhongfang Zhuang, Liang Wang, Wei zhang, Jeff M. Phillips

A Content-based Time Series Retrieval (CTSR) system is an information retrieval system for users to interact with time series emerged from multiple domains, such as finance, healthcare, and manufacturing.

Information Retrieval Retrieval +1

Paper
Add Code

Multitask Learning for Time Series Data with 2D Convolution

no code implementations • 5 Oct 2023 • Chin-Chia Michael Yeh, Xin Dai, Yan Zheng, Junpeng Wang, Huiyuan Chen, Yujie Fan, Audrey Der, Zhongfang Zhuang, Liang Wang, Wei zhang

In this paper, we investigate the application of MTL to the time series classification (TSC) problem.

Dynamic Time Warping Recommendation Systems +2

Paper
Add Code

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model

no code implementations • 3 Oct 2023 • Zibin Dong, Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Tangjie Lv, Changjie Fan, Zhipeng Hu

Aligning agent behaviors with diverse human preferences remains a challenging problem in reinforcement learning (RL), owing to the inherent abstractness and mutability of human preferences.

Attribute Reinforcement Learning (RL)

Paper
Add Code

Hessian-aware Quantized Node Embeddings for Recommendation

no code implementations • 2 Sep 2023 • Huiyuan Chen, Kaixiong Zhou, Kwei-Herng Lai, Chin-Chia Michael Yeh, Yan Zheng, Xia Hu, Hao Yang

To address the gradient mismatch problem in STE, we further consider the quantized errors and its second-order derivatives for better stability.

Recommendation Systems Retrieval

Paper
Add Code

Adversarial Collaborative Filtering for Free

no code implementations • 20 Aug 2023 • Huiyuan Chen, Xiaoting Li, Vivian Lai, Chin-Chia Michael Yeh, Yujie Fan, Yan Zheng, Mahashweta Das, Hao Yang

In this paper, we present Sharpness-aware Collaborative Filtering (SharpCF), a simple yet effective method that conducts adversarial training without extra computational cost over the base optimizer.

Collaborative Filtering

Paper
Add Code

EmbeddingTree: Hierarchical Exploration of Entity Features in Embedding

no code implementations • 2 Aug 2023 • Yan Zheng, Junpeng Wang, Chin-Chia Michael Yeh, Yujie Fan, Huiyuan Chen, Liang Wang, Wei zhang

The tool helps users discover nuance features of data entities, perform feature denoising/injecting in embedding training, and generate embeddings for unseen entities.

Denoising

Paper
Add Code

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization

1 code implementation • 1 Aug 2023 • Junyi Wang, Yuanyang Zhu, Zhi Wang, Yan Zheng, Jianye Hao, Chunlin Chen

Evolutionary reinforcement learning (ERL) algorithms recently raise attention in tackling complex reinforcement learning (RL) problems due to high parallelism, while they are prone to insufficient exploration or model collapse without carefully tuning hyperparameters (aka meta-parameters).

Bilevel Optimization reinforcement-learning +1

Paper
Code

Sharpness-Aware Graph Collaborative Filtering

no code implementations • 18 Jul 2023 • Huiyuan Chen, Chin-Chia Michael Yeh, Yujie Fan, Yan Zheng, Junpeng Wang, Vivian Lai, Mahashweta Das, Hao Yang

Graph Neural Networks (GNNs) have achieved impressive performance in collaborative filtering.

Collaborative Filtering

Paper
Add Code

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning

no code implementations • 27 Jun 2023 • Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan

In summary, our research emphasizes the significance of trajectory-based data sampling techniques in enhancing the efficiency and performance of offline RL algorithms.

D4RL Offline RL +2

Paper
Add Code

Improving Offline-to-Online Reinforcement Learning with Q-Ensembles

no code implementations • 12 Jun 2023 • Kai Zhao, Yi Ma, Jianye Hao, Jinyi Liu, Yan Zheng, Zhaopeng Meng

Offline reinforcement learning (RL) is a learning paradigm where an agent learns from a fixed dataset of experience.

Offline RL reinforcement-learning +1

Paper
Add Code

HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach

no code implementations • 10 Jun 2023 • Shixi Lian, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng

Offline reinforcement learning (ORL) has gained attention as a means of training reinforcement learning models using pre-collected static data.

D4RL Data Augmentation +1

Paper
Add Code

Iteratively Refined Behavior Regularization for Offline Reinforcement Learning

no code implementations • 9 Jun 2023 • Xiaohan Hu, Yi Ma, Chenjun Xiao, Yan Zheng, Jianye Hao

One of the fundamental challenges for offline reinforcement learning (RL) is ensuring robustness to data distribution.

D4RL Offline RL +2

Paper
Add Code

Prompt Injection attack against LLM-integrated Applications

no code implementations • 8 Jun 2023 • Yi Liu, Gelei Deng, Yuekang Li, Kailong Wang, ZiHao Wang, XiaoFeng Wang, Tianwei Zhang, Yepang Liu, Haoyu Wang, Yan Zheng, Yang Liu

We deploy HouYi on 36 actual LLM-integrated applications and discern 31 applications susceptible to prompt injection.

Paper
Add Code

PDT: Pretrained Dual Transformers for Time-aware Bipartite Graphs

no code implementations • 2 Jun 2023 • Xin Dai, Yujie Fan, Zhongfang Zhuang, Shubham Jain, Chin-Chia Michael Yeh, Junpeng Wang, Liang Wang, Yan Zheng, Prince Osei Aboagye, Wei zhang

Pre-training on large models is prevalent and emerging with the ever-growing user-generated content in many machine learning application categories.

Contrastive Learning

Paper
Add Code

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL

no code implementations • 31 May 2023 • Fei Ni, Jianye Hao, Yao Mu, Yifu Yuan, Yan Zheng, Bin Wang, Zhixuan Liang

Recently, diffusion model shines as a promising backbone for the sequence modeling paradigm in offline reinforcement learning(RL).

Reinforcement Learning (RL)

Paper
Add Code

Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs

no code implementations • 24 Apr 2023 • Yinchuan Li, Zhigang Li, Wenqian Li, Yunfeng Shao, Yan Zheng, Jianye Hao

Many score-based active learning methods have been successfully applied to graph-structured data, aiming to reduce the number of labels and achieve better performance of graph neural networks based on predefined score functions.

Active Learning

Paper
Add Code

How Does Attention Work in Vision Transformers? A Visual Analytics Attempt

no code implementations • 24 Mar 2023 • Yiran Li, Junpeng Wang, Xin Dai, Liang Wang, Chin-Chia Michael Yeh, Yan Zheng, Wei zhang, Kwan-Liu Ma

Multi-head self-attentions are then applied to the sequence to learn the attention between patches.

Paper
Add Code

Neural Episodic Control with State Abstraction

no code implementations • 27 Jan 2023 • Zhuo Li, Derui Zhu, Yujing Hu, Xiaofei Xie, Lei Ma, Yan Zheng, Yan Song, Yingfeng Chen, Jianjun Zhao

Generally, episodic control-based approaches are solutions that leverage highly-rewarded past experiences to improve sample efficiency of DRL algorithms.

OpenAI Gym

Paper
Add Code

Matrix Profile XXVII: A Novel Distance Measure for Comparing Long Time Series

no code implementations • 9 Dec 2022 • Audrey Der, Chin-Chia Michael Yeh, Renjie Wu, Junpeng Wang, Yan Zheng, Zhongfang Zhuang, Liang Wang, Wei zhang, Eamonn Keogh

PRCIS is a distance measure for long time series, which exploits recent progress in our ability to summarize time series with dictionaries.

Anomaly Detection Dynamic Time Warping +2

Paper
Add Code

Denoising Self-attentive Sequential Recommendation

no code implementations • 8 Dec 2022 • Huiyuan Chen, Yusan Lin, Menghai Pan, Lan Wang, Chin-Chia Michael Yeh, Xiaoting Li, Yan Zheng, Fei Wang, Hao Yang

Transformer-based sequential recommenders are very powerful for capturing both short-term and long-term sequential item dependencies.

Denoising Sequential Recommendation

Paper
Add Code

TinyKG: Memory-Efficient Training Framework for Knowledge Graph Neural Recommender Systems

no code implementations • 8 Dec 2022 • Huiyuan Chen, Xiaoting Li, Kaixiong Zhou, Xia Hu, Chin-Chia Michael Yeh, Yan Zheng, Hao Yang

We found that our TinyKG with INT2 quantization aggressively reduces the memory footprint of activation maps with $7 \times$, only with $2\%$ loss in accuracy, allowing us to deploy KGNNs on memory-constrained devices.

Knowledge Graphs Quantization +1

Paper
Add Code

Quantized Wasserstein Procrustes Alignment of Word Embedding Spaces

no code implementations • AMTA 2022 • Prince O Aboagye, Yan Zheng, Michael Yeh, Junpeng Wang, Zhongfang Zhuang, Huiyuan Chen, Liang Wang, Wei zhang, Jeff Phillips

Optimal Transport (OT) provides a useful geometric framework to estimate the permutation matrix under unsupervised cross-lingual word embedding (CLWE) models that pose the alignment task as a Wasserstein-Procrustes problem.

Bilingual Lexicon Induction Quantization

Paper
Add Code

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation

1 code implementation • 26 Oct 2022 • Jianye Hao, Pengyi Li, Hongyao Tang, Yan Zheng, Xian Fu, Zhaopeng Meng

The state representation conveys expressive common features of the environment learned by all the agents collectively; the linear policy representation provides a favorable space for efficient policy optimization, where novel behavior-level crossover and mutation operations can be performed.

Continuous Control Evolutionary Algorithms +2

Paper
Code

Neural Volumetric Mesh Generator

no code implementations • 6 Oct 2022 • Yan Zheng, Lemeng Wu, Xingchao Liu, Zhen Chen, Qiang Liu, QiXing Huang

We first propose a diffusion-based generative model to tackle this problem by generating voxelized shapes with close-to-reality outlines and structures.

Paper
Add Code

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model

no code implementations • 2 Oct 2022 • Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan

Unsupervised reinforcement learning (URL) poses a promising paradigm to learn useful behaviors in a task-agnostic environment without the guidance of extrinsic rewards to facilitate the fast adaptation of various downstream tasks.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes

no code implementations • 16 Sep 2022 • Min Zhang, Hongyao Tang, Jianye Hao, Yan Zheng

First, we propose a unified policy abstraction theory, containing three types of policy abstraction associated to policy features at different levels.

Decision Making Metric Learning +2

Paper
Add Code

Embedding Compression with Hashing for Efficient Representation Learning in Large-Scale Graph

no code implementations • 11 Aug 2022 • Chin-Chia Michael Yeh, Mengting Gu, Yan Zheng, Huiyuan Chen, Javid Ebrahimi, Zhongfang Zhuang, Junpeng Wang, Liang Wang, Wei zhang

Graph neural networks (GNNs) are deep learning models designed specifically for graph data, and they typically rely on node features as the input to the first layer.

Representation Learning

Paper
Add Code

HIFI-Net: A Novel Network for Enhancement to Underwater Images

no code implementations • 6 Jun 2022 • Jiajia Zhou, Junbin Zhuang, Yan Zheng, Di wu

As this network make "Haar Images into Fusion Images", it is called HIFI-Net.

Paper
Add Code

GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis

no code implementations • 27 May 2022 • Yushi Cao, Zhiming Li, Tianpei Yang, Hao Zhang, Yan Zheng, Yi Li, Jianye Hao, Yang Liu

In this paper, we combine the above two paradigms together and propose a novel Generalizable Logic Synthesis (GALOIS) framework to synthesize hierarchical and strict cause-effect logic programs.

Decision Making Program Synthesis +2

Paper
Add Code

Relational Representation Learning in Visually-Rich Documents

no code implementations • 5 May 2022 • Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren

To deal with the unpredictable definition of relations, we propose a novel contrastive learning task named Relational Consistency Modeling (RCM), which harnesses the fact that existing relations should be consistent in differently augmented positive views.

Contrastive Learning Key Information Extraction +3

Paper
Add Code

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations

1 code implementation • 6 Apr 2022 • Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang

In online adaptation phase, with the environment context inferred from few experiences collected in new environments, the policy is optimized by gradient ascent with respect to the PDVF.

Contrastive Learning Decision Making

Paper
Code

Revealing Reliable Signatures by Learning Top-Rank Pairs

no code implementations • 17 Mar 2022 • Xiaotong Ji, Yan Zheng, Daiki Suehiro, Seiichi Uchida

Signature verification, as a crucial practical documentation analysis task, has been continuously studied by researchers in machine learning and pattern recognition fields.

POS

Paper
Add Code

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration

1 code implementation • 16 Mar 2022 • Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Wenyuan Tao, Zhen Wang, Fazl Barez

However, we reveal sub-optimal collaborative behaviors also emerge with strong correlations, and simply maximizing the MI can, surprisingly, hinder the learning towards better collaboration.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework

no code implementations • 10 Mar 2022 • Xiaotian Hao, Hangyu Mao, Weixun Wang, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao

To break this curse, we propose a unified agent permutation framework that exploits the permutation invariance (PI) and permutation equivariance (PE) inductive biases to reduce the multiagent state space.

Data Augmentation Reinforcement Learning (RL) +1

Paper
Add Code

Learning-From-Disagreement: A Model Comparison and Visual Analytics Framework

no code implementations • 19 Jan 2022 • Junpeng Wang, Liang Wang, Yan Zheng, Chin-Chia Michael Yeh, Shubham Jain, Wei zhang

With these metrics, one can easily identify meta-features with the most complementary behaviors in two classifiers, and use them to better ensemble the classifiers.

Binary Classification

Paper
Add Code

Error-bounded Approximate Time Series Joins Using Compact Dictionary Representations of Time Series

no code implementations • 24 Dec 2021 • Chin-Chia Michael Yeh, Yan Zheng, Junpeng Wang, Huiyuan Chen, Zhongfang Zhuang, Wei zhang, Eamonn Keogh

The matrix profile is an effective data mining tool that provides similarity join functionality for time series data.

Time Series Time Series Analysis

Paper
Add Code

Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning

no code implementations • 19 Nov 2021 • Tong Sang, Hongyao Tang, Jianye Hao, Yan Zheng, Zhaopeng Meng

Such a reconstruction exploits the underlying structure of value matrix to improve the value approximation, thus leading to a more efficient learning process of value function.

Continuous Control reinforcement-learning +1

Paper
Add Code

OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning

no code implementations • 29 Sep 2021 • Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Junjie Ye, Chenjia Bai, Pengyi Li

Many exploration strategies are built upon the optimism in the face of the uncertainty (OFU) principle for reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Embedding Compression with Hashing for Efficient Representation Learning in Graph

no code implementations • 29 Sep 2021 • Chin-Chia Michael Yeh, Mengting Gu, Yan Zheng, Huiyuan Chen, Javid Ebrahimi, Zhongfang Zhuang, Junpeng Wang, Liang Wang, Wei zhang

When applying such type of networks on graph without node feature, one can extract simple graph-based node features (e. g., number of degrees) or learn the input node representation (i. e., embeddings) when training the network.

Representation Learning

Paper
Add Code

Online Multi-horizon Transaction Metric Estimation with Multi-modal Learning in Payment Networks

no code implementations • 21 Sep 2021 • Chin-Chia Michael Yeh, Zhongfang Zhuang, Junpeng Wang, Yan Zheng, Javid Ebrahimi, Ryan Mercer, Liang Wang, Wei zhang

In this work, we study the problem of multivariate time series prediction for estimating transaction metrics associated with entities in the payment transaction database.

Time Series Time Series Prediction

Paper
Add Code

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation

1 code implementation • ICLR 2022 • Boyan Li, Hongyao Tang, Yan Zheng, Jianye Hao, Pengyi Li, Zhen Wang, Zhaopeng Meng, Li Wang

Discrete-continuous hybrid action space is a natural setting in many practical problems, such as robot control and game AI.

reinforcement-learning Reinforcement Learning (RL)

2,513

Paper
Code

Towards Book Cover Design via Layout Graphs

1 code implementation • 24 May 2021 • Wensheng Zhang, Yan Zheng, Taiga Miyazono, Seiichi Uchida, Brian Kenji Iwana

Book covers are intentionally designed and provide an introduction to a book.

Paper
Code

Normalization of Language Embeddings for Cross-Lingual Alignment

1 code implementation • NeurIPS 2021 • Prince Osei Aboagye, Jeff Phillips, Yan Zheng, Chin-Chia Michael Yeh, Junpeng Wang, Wei zhang, Liang Wang, Hao Yang

Learning a good transfer function to map the word vectors from two languages into a shared cross-lingual word vector space plays a crucial role in cross-lingual NLP.

Translation

Paper
Code

VERB: Visualizing and Interpreting Bias Mitigation Techniques for Word Representations

1 code implementation • 6 Apr 2021 • Archit Rathore, Sunipa Dev, Jeff M. Phillips, Vivek Srikumar, Yan Zheng, Chin-Chia Michael Yeh, Junpeng Wang, Wei zhang, Bei Wang

To aid this, we present Visualization of Embedding Representations for deBiasing system ("VERB"), an open-source web-based visualization tool that helps the users gain a technical understanding and visual intuition of the inner workings of debiasing techniques, with a focus on their geometric properties.

Decision Making Dimensionality Reduction +3

Paper
Code

MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning

no code implementations • 1 Jan 2021 • Jinyi Liu, Zhi Wang, Jianye Hao, Yan Zheng

Recently, the principle of optimism in the face of (aleatoric and epistemic) uncertainty has been utilized to design efficient exploration strategies for Reinforcement Learning (RL).

Efficient Exploration reinforcement-learning +1

Paper
Add Code

Merchant Category Identification Using Credit Card Transactions

no code implementations • 5 Nov 2020 • Chin-Chia Michael Yeh, Zhongfang Zhuang, Yan Zheng, Liang Wang, Junpeng Wang, Wei zhang

In this work, we approach this problem from a multi-modal learning perspective, where we use not only the merchant time series data but also the information of merchant-merchant relationship (i. e., affinity) to verify the self-reported business type (i. e., merchant category) of a given merchant.

Time Series Time Series Analysis +1

Paper
Add Code

Towards a Flexible Embedding Learning Framework

no code implementations • 23 Sep 2020 • Chin-Chia Michael Yeh, Dhruv Gelda, Zhongfang Zhuang, Yan Zheng, Liang Gou, Wei zhang

Our proposed framework utilizes a set of entity-relation-matrices as the input, which quantifies the affinities among different entities in the database.

Relation Representation Learning

Paper
Add Code

Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning

no code implementations • 14 May 2020 • Jianwen Sun, Tianwei Zhang, Xiaofei Xie, Lei Ma, Yan Zheng, Kangjie Chen, Yang Liu

Adversarial attacks against conventional Deep Learning (DL) systems and algorithms have been widely studied, and various defenses were proposed.

Adversarial Attack reinforcement-learning +1

Paper
Add Code

Continuous Multiagent Control using Collective Behavior Entropy for Large-Scale Home Energy Management

no code implementations • 14 May 2020 • Jianwen Sun, Yan Zheng, Jianye Hao, Zhaopeng Meng, Yang Liu

With the increasing popularity of electric vehicles, distributed energy generation and storage facilities in smart grid systems, an efficient Demand-Side Management (DSM) is urgent for energy savings and peak loads reduction.

energy management Management

Paper
Add Code

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

no code implementations • 18 Feb 2020 • Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng

Our framework consists of a fuzzy rule controller to represent human knowledge and a refine module to fine-tune suboptimal prior knowledge.

Common Sense Reasoning Continuous Control +2

Paper
Add Code

Diverse Behavior Is What Game AI Needs: Generating Varied Human-Like Playing Styles Using Evolutionary Multi-Objective Deep Reinforcement Learning

no code implementations • 20 Oct 2019 • Ruimin Shen, Yan Zheng, Jianye Hao, Yinfeng Chen, Changjie Fan

this paper has been withdrawn

Paper
Add Code

Constrained Non-Affine Alignment of Embeddings

no code implementations • 13 Oct 2019 • Yuwei Wang, Yan Zheng, Yanqing Peng, Chin-Chia Michael Yeh, Zhongfang Zhuang, Das Mahashweta, Bendre Mangesh, Feifei Li, Wei zhang, Jeff M. Phillips

Embeddings are already essential tools for large language models and image analysis, and their use is being extended to many other research domains.

Paper
Add Code

Gradient Boosting Survival Tree with Applications in Credit Scoring

1 code implementation • 9 Aug 2019 • Miaojun Bai, Yan Zheng, Yun Shen

In order to deal with highly heterogeneous industrial data collected in Chinese market of consumer finance, we propose a nonparametric ensemble tree model called gradient boosting survival tree (GBST) that extends the survival tree models with a gradient boosting algorithm.

Survival Analysis

Paper
Code

A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents

no code implementations • NeurIPS 2018 • Yan Zheng, Zhaopeng Meng, Jianye Hao, Zongzhang Zhang, Tianpei Yang, Changjie Fan

In multiagent domains, coping with non-stationary agents that change behaviors from time to time is a challenging problem, where an agent is usually required to be able to quickly detect the other agent's policy during online interaction, and then adapt its own policy accordingly.

Paper
Add Code

Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints from Limited Training Data

1 code implementation • ICCV 2019 • Yutong Bai, Qing Liu, Lingxi Xie, Weichao Qiu, Yan Zheng, Alan Yuille

In particular, this enables images in the training dataset to be matched to a virtual 3D model of the object (for simplicity, we assume that the object viewpoint can be estimated by standard techniques).

Clustering Object +1

Paper
Code

Hierarchical Deep Multiagent Reinforcement Learning with Temporal Abstraction

no code implementations • 25 Sep 2018 • Hongyao Tang, Jianye Hao, Tangjie Lv, Yingfeng Chen, Zongzhang Zhang, Hangtian Jia, Chunxu Ren, Yan Zheng, Zhaopeng Meng, Changjie Fan, Li Wang

Besides, we propose a new experience replay mechanism to alleviate the issue of the sparse transitions at the high level of abstraction and the non-stationarity of multiagent learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Towards Efficient Detection and Optimal Response against Sophisticated Opponents

no code implementations • 12 Sep 2018 • Tianpei Yang, Zhaopeng Meng, Jianye Hao, Chongjie Zhang, Yan Zheng, Ze Zheng

This paper proposes a novel approach called Bayes-ToMoP which can efficiently detect the strategy of opponents using either stationary or higher-level reasoning strategies.

Multiagent Systems

Paper
Add Code

End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings

no code implementations • 19 Jun 2018 • Brenner Eliot, Zhao Jun, Kutiyanawala Aliasgar, Yan Zheng

The different types of relevance models developed for IR have complementary advantages and disadvantages when applied to eCommerce product search.

Benchmarking

Paper
Add Code

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments

no code implementations • 23 Feb 2018 • Yan Zheng, Jianye Hao, Zongzhang Zhang

Recently, multiagent deep reinforcement learning (DRL) has received increasingly wide attention.

Q-Learning reinforcement-learning +1

Paper
Add Code

Interactive Deep Colorization With Simultaneous Global and Local Inputs

no code implementations • 27 Jan 2018 • Yi Xiao, Peiyao Zhou, Yan Zheng

To solve this problem, we present a novel deep colorization method, which allows simultaneous global and local inputs to better control the output colorized images.

Colorization

Paper
Add Code

Joint convolutional neural pyramid for depth map super-resolution

no code implementations • 3 Jan 2018 • Yi Xiao, Xiang Cao, Xianyi Zhu, Renzhi Yang, Yan Zheng

The convolutional neural pyramids extract information from large receptive fields of the depth map and guidance map, while the convolutional neural network effectively transfers useful structures of the guidance image to the depth image.

Depth Map Super-Resolution

Paper
Add Code

Coresets for Kernel Regression

no code implementations • 13 Feb 2017 • Yan Zheng, Jeff M. Phillips

Kernel regression is an essential and ubiquitous tool for non-parametric data analysis, particularly popular among time series and spatial data.

regression Time Series +1

Paper
Add Code

Subsampling in Smoothed Range Spaces

no code implementations • 30 Oct 2015 • Jeff M. Phillips, Yan Zheng

We consider smoothed versions of geometric range spaces, so an element of the ground set (e. g. a point) can be contained in a range with a non-binary value in $[0, 1]$.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.