Search Results for author: Yong Yu

Found 141 papers, 73 papers with code

Nested Named Entity Recognition with Span-level Graphs

no code implementations ACL 2022 Juncheng Wan, Dongyu Ru, Weinan Zhang, Yong Yu

In this work, we try to improve the span representation by utilizing retrieval-based span-level graphs, connecting spans and entities in the training data based on n-gram features.

named-entity-recognition Named Entity Recognition +3

DRepMRec: A Dual Representation Learning Framework for Multimodal Recommendation

no code implementations17 Apr 2024 Kangning Zhang, Yingjie Qin, Ruilong Su, Yifan Liu, Jiarui Jin, Weinan Zhang, Yong Yu

After obtaining separate behavior and modal representations, we design a Behavior-Modal Alignment Module (BMA) to align and fuse the dual representations to solve the misalignment problem.

Multimodal Recommendation Representation Learning

Recall-Augmented Ranking: Enhancing Click-Through Rate Prediction Accuracy with Cross-Stage Data

no code implementations15 Apr 2024 JunJie Huang, Guohao Cai, Jieming Zhu, Zhenhua Dong, Ruiming Tang, Weinan Zhang, Yong Yu

RAR consists of two key sub-modules, which synergistically gather information from a vast pool of look-alike users and recall items, resulting in enriched user representations.

Click-Through Rate Prediction

Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development

no code implementations14 Apr 2024 Siyuan Feng, Jiawei Liu, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang, Tianqi Chen

While a traditional bottom-up development pipeline fails to close the gap timely, we introduce TapML, a top-down approach and tooling designed to streamline the deployment of ML systems on diverse platforms, optimized for developer productivity.

M-scan: A Multi-Scenario Causal-driven Adaptive Network for Recommendation

no code implementations11 Apr 2024 Jiachen Zhu, Yichao Wang, Jianghao Lin, Jiarui Qin, Ruiming Tang, Weinan Zhang, Yong Yu

Furthermore, through causal graph analysis, we have discovered that the scenario itself directly influences click behavior, yet existing approaches directly incorporate data from other scenarios during the training of the current scenario, leading to prediction biases when they directly utilize click behaviors from other scenarios to train models.

counterfactual Counterfactual Inference

Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models

no code implementations25 Mar 2024 Yunjia Xi, Weiwen Liu, Jianghao Lin, Chuhan Wu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu

The rise of large language models (LLMs) has opened new opportunities in Recommender Systems (RSs) by enhancing user behavior modeling and content understanding.

Language Modelling Large Language Model +1

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

1 code implementation10 Mar 2024 Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang

Among these works, many of them utilize in-context examples to achieve generalization without the need for fine-tuning, while few of them have considered the problem of how to select and effectively utilize these examples.

Language Modelling Large Language Model +1

Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem

no code implementations8 Mar 2024 Jingxiao Chen, Ziqin Gong, Minghuan Liu, Jun Wang, Yong Yu, Weinan Zhang

To overcome this problem and to have an effective solution against hard constraints, we proposed a novel learning-based method that uses looking-ahead information as the feature to improve the legality of TSP with Time Windows (TSPTW) solutions.

Traveling Salesman Problem

Towards Efficient and Effective Unlearning of Large Language Models for Recommendation

1 code implementation6 Mar 2024 Hangyu Wang, Jianghao Lin, Bo Chen, Yang Yang, Ruiming Tang, Weinan Zhang, Yong Yu

However, in order to protect user privacy and optimize utility, it is also crucial for LLMRec to intentionally forget specific user data, which is generally referred to as recommendation unlearning.

World Knowledge

Offline Fictitious Self-Play for Competitive Games

no code implementations29 Feb 2024 Jingxiao Chen, Weiji Xie, Weinan Zhang, Yong Yu, Ying Wen

Firstly, unaware of the game structure, it is impossible to interact with the opponents and conduct a major learning paradigm, self-play, for competitive games.

Offline RL Reinforcement Learning (RL)

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization

no code implementations23 Jan 2024 Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian McAuley

Subsequently, we minimize the mutual information between the observation estimation and the relevance estimation conditioned on the input features.

Learning-To-Rank Recommendation Systems

D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems

no code implementations21 Jan 2024 Jiarui Qin, Weiwen Liu, Ruiming Tang, Weinan Zhang, Yong Yu

A personalized knowledge adaptation unit is devised to effectively exploit the information from the knowledge base by adapting the retrieved knowledge to the target samples.

Recommendation Systems

Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios

1 code implementation29 Dec 2023 Xinyuan Wu, Wentao Dong, Hang Lai, Yong Yu, Ying Wen

Quadruped robots have strong adaptability to extreme environments but may also experience faults.

Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges

no code implementations27 Dec 2023 Qingyao Li, Lingyue Fu, Weiming Zhang, Xianyu Chen, Jingwei Yu, Wei Xia, Weinan Zhang, Ruiming Tang, Yong Yu

Online education platforms, leveraging the internet to distribute education resources, seek to provide convenient education but often fall short in real-time communication with students.

Question Answering

FLIP: Towards Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction

no code implementations30 Oct 2023 Hangyu Wang, Jianghao Lin, Xiangyang Li, Bo Chen, Chenxu Zhu, Ruiming Tang, Weinan Zhang, Yong Yu

Specifically, the masked data of one modality (i. e., tokens or features) has to be recovered with the help of the other modality, which establishes the feature-level interaction and alignment via sufficient mutual information extraction between dual modalities.

Click-Through Rate Prediction Contrastive Learning

ROMO: Retrieval-enhanced Offline Model-based Optimization

1 code implementation11 Oct 2023 Mingcheng Chen, Haoran Zhao, Yuxiang Zhao, Hulei Fan, Hongqiao Gao, Yong Yu, Zheng Tian

Data-driven black-box model-based optimization (MBO) problems arise in a great number of practical application scenarios, where the goal is to find a design over the whole space maximizing a black-box target function based on a static offline dataset.

Retrieval

CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

1 code implementation8 Sep 2023 JinYuan Wang, Hai Zhao, Zhong Wang, Zeyang Zhu, Jinhao Xie, Yong Yu, Yongjian Fei, Yue Huang, Dawei Cheng

In recent years, great advances in pre-trained language models (PLMs) have sparked considerable research focus and achieved promising performance on the approach of dense passage retrieval, which aims at retrieving relative passages from massive corpus with given questions.

Passage Retrieval Retrieval

CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models

1 code implementation5 Sep 2023 Lingyue Fu, Huacan Chai, Shuang Luo, Kounianhua Du, Weiming Zhang, Longteng Fan, Jiayi Lei, Renting Rui, Jianghao Lin, Yuchen Fang, Yifan Liu, Jingkuan Wang, Siyuan Qi, Kangning Zhang, Weinan Zhang, Yong Yu

With the emergence of Large Language Models (LLMs), there has been a significant improvement in the programming capabilities of models, attracting growing attention from researchers.

Code Generation Multiple-choice

ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation

1 code implementation22 Aug 2023 Jianghao Lin, Rong Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, Shigang Quan, Ruiming Tang, Yong Yu, Weinan Zhang

With large language models (LLMs) achieving remarkable breakthroughs in natural language processing (NLP) domains, LLM-enhanced recommender systems have received much attention and have been actively explored currently.

Data Augmentation Language Modelling +3

Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank

no code implementations5 Aug 2023 Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang

Notice that these ranking metrics do not consider the effects of the contextual dependence among the items in the list, we design a new family of simulation-based ranking metrics, where existing metrics can be regarded as special cases.

Learning-To-Rank

MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction

1 code implementation3 Aug 2023 Jianghao Lin, Yanru Qu, Wei Guo, Xinyi Dai, Ruiming Tang, Yong Yu, Weinan Zhang

The large capacity of neural models helps digest such massive amounts of data under the supervised learning paradigm, yet they fail to utilize the substantial data to its full potential, since the 1-bit click signal is not sufficient to guide the model to learn capable representations of features and instances.

Binary Classification Click-Through Rate Prediction +1

Learning Multi-Agent Intention-Aware Communication for Optimal Multi-Order Execution in Finance

no code implementations6 Jul 2023 Yuchen Fang, Zhenggang Tang, Kan Ren, Weiqing Liu, Li Zhao, Jiang Bian, Dongsheng Li, Weinan Zhang, Yong Yu, Tie-Yan Liu

Order execution is a fundamental task in quantitative finance, aiming at finishing acquisition or liquidation for a number of trading orders of the specific assets.

Reinforcement Learning (RL)

Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models

1 code implementation19 Jun 2023 Yunjia Xi, Weiwen Liu, Jianghao Lin, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu

In this work, we propose an Open-World Knowledge Augmented Recommendation Framework with Large Language Models, dubbed KAR, to acquire two types of external knowledge from LLMs -- the reasoning knowledge on user preferences and the factual knowledge on items.

Music Recommendation Recommendation Systems +1

How Can Recommender Systems Benefit from Large Language Models: A Survey

1 code implementation9 Jun 2023 Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang

In this paper, we conduct a comprehensive survey on this research direction from the perspective of the whole pipeline in real-world recommender systems.

Ethics Feature Engineering +5

Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation

no code implementations7 Jun 2023 Xianyu Chen, Jian Shen, Wei Xia, Jiarui Jin, Yakun Song, Weinan Zhang, Weiwen Liu, Menghui Zhu, Ruiming Tang, Kai Dong, Dingyin Xia, Yong Yu

Noticing that existing approaches fail to consider the correlations of concepts in the path, we propose a novel framework named Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation (SRC), which formulates the recommendation task under a set-to-sequence paradigm.

Knowledge Tracing Recommendation Systems

MADiff: Offline Multi-agent Learning with Diffusion Models

1 code implementation27 May 2023 Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang

To the best of our knowledge, MADiff is the first diffusion-based multi-agent offline RL framework, which behaves as both a decentralized policy and a centralized controller.

Offline RL Trajectory Prediction

Refined Edge Usage of Graph Neural Networks for Edge Prediction

no code implementations25 Dec 2022 Jiarui Jin, Yangkun Wang, Weinan Zhang, Quan Gan, Xiang Song, Yong Yu, Zheng Zhang, David Wipf

However, existing methods lack elaborate design regarding the distinctions between two tasks that have been frequently overlooked: (i) edges only constitute the topology in the node classification task but can be used as both the topology and the supervisions (i. e., labels) in the edge prediction task; (ii) the node classification makes prediction over each individual node, while the edge prediction is determinated by each pair of nodes.

Link Prediction Node Classification

Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer

no code implementations15 Dec 2022 Hang Lai, Weinan Zhang, Xialin He, Chen Yu, Zheng Tian, Yong Yu, Jun Wang

Deep reinforcement learning has recently emerged as an appealing alternative for legged locomotion over multiple terrains by training a policy in physical simulation and then transferring it to the real world (i. e., sim-to-real transfer).

Decision Making

A Bird's-eye View of Reranking: from List Level to Page Level

1 code implementation17 Nov 2022 Yunjia Xi, Jianghao Lin, Weiwen Liu, Xinyi Dai, Weinan Zhang, Rui Zhang, Ruiming Tang, Yong Yu

Moreover, simply applying a shared network for all the lists fails to capture the commonalities and distinctions in user behaviors on different lists.

Recommendation Systems

Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems

no code implementations11 Oct 2022 Zhengbang Zhu, Rongjun Qin, JunJie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang

The increase in the measured performance, however, can have two possible attributions: a better understanding of user preferences, and a more proactive ability to utilize human bounded rationality to seduce user over-consumption.

Benchmarking Sequential Recommendation

Multi-Scale User Behavior Network for Entire Space Multi-Task Learning

no code implementations3 Aug 2022 Jiarui Jin, Xianyu Chen, Weinan Zhang, Yuanbo Chen, Zaifan Jiang, Zekun Zhu, Zhewen Su, Yong Yu

Modelling the user's multiple behaviors is an essential part of modern e-commerce, whose widely adopted application is to jointly optimize click-through rate (CTR) and conversion rate (CVR) predictions.

Multi-Task Learning Survival Analysis

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

2 code implementations9 Jul 2022 Siyuan Feng, Bohan Hou, Hongyi Jin, Wuwei Lin, Junru Shao, Ruihang Lai, Zihao Ye, Lianmin Zheng, Cody Hao Yu, Yong Yu, Tianqi Chen

Finally, we build an end-to-end framework on top of our abstraction to automatically optimize deep learning models for given tensor computation primitives.

BIG-bench Machine Learning

An F-shape Click Model for Information Retrieval on Multi-block Mobile Pages

1 code implementation17 Jun 2022 Lingyue Fu, Jianghao Lin, Weiwen Liu, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu

However, with the development of user interface (UI) design, the layout of displayed items on a result page tends to be multi-block (i. e., multi-list) style instead of a single list, which requires different assumptions to model user behaviors more accurately.

Information Retrieval Retrieval

Multi-Level Interaction Reranking with User Behavior History

1 code implementation20 Apr 2022 Yunjia Xi, Weiwen Liu, Jieming Zhu, Xilong Zhao, Xinyi Dai, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu

MIR combines low-level cross-item interaction and high-level set-to-list interaction, where we view the candidate items to be reranked as a set and the users' behavior history in chronological order as a list.

Recommendation Systems

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization

2 code implementations4 Mar 2022 Minghuan Liu, Zhengbang Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao, Yong Yu, Jun Wang

Recent progress in state-only imitation learning extends the scope of applicability of imitation learning to real-world settings by relieving the need for observing expert actions.

Imitation Learning Transfer Learning

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection

1 code implementation25 Feb 2022 Ting Long, Yutong Xie, Xianyu Chen, Weinan Zhang, Qinxiang Cao, Yong Yu

We thoroughly evaluate our proposed MVG approach in the context of algorithm detection, an important and challenging subfield of PLP.

Who to Watch Next: Two-side Interactive Networks for Live Broadcast Recommendation

no code implementations9 Feb 2022 Jiarui Jin, Xianyu Chen, Yuanbo Chen, Weinan Zhang, Renting Rui, Zaifan Jiang, Zhewen Su, Yong Yu

With the prevalence of live broadcast business nowadays, a new type of recommendation service, called live broadcast recommendation, is widely used in many mobile e-commerce Apps.

Retrieval

Learn over Past, Evolve for Future: Search-based Time-aware Recommendation with Sequential Behavior Data

no code implementations7 Feb 2022 Jiarui Jin, Xianyu Chen, Weinan Zhang, JunJie Huang, Ziming Feng, Yong Yu

More concretely, we first design a search-based module to retrieve a user's relevant historical behaviors, which are then mixed up with her recent records to be fed into a time-aware sequential network for capturing her time-sensitive demands.

Click-Through Rate Prediction

Efficient Policy Space Response Oracles

no code implementations28 Jan 2022 Ming Zhou, Jingxiao Chen, Ying Wen, Weinan Zhang, Yaodong Yang, Yong Yu, Jun Wang

Policy Space Response Oracle methods (PSRO) provide a general solution to learn Nash equilibrium in two-player zero-sum games but suffer from two drawbacks: (1) the computation inefficiency due to the need for consistent meta-game evaluation via simulations, and (2) the exploration inefficiency due to finding the best response against a fixed meta-strategy at every epoch.

Efficient Exploration

Generative Adversarial Exploration for Reinforcement Learning

no code implementations27 Jan 2022 Weijun Hong, Menghui Zhu, Minghuan Liu, Weinan Zhang, Ming Zhou, Yong Yu, Peng Sun

Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel.

Generative Adversarial Network Montezuma's Revenge +2

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation

no code implementations COLING 2022 Juncheng Wan, Jian Yang, Shuming Ma, Dongdong Zhang, Weinan Zhang, Yong Yu, Zhoujun Li

While end-to-end neural machine translation (NMT) has achieved impressive progress, noisy input usually leads models to become fragile and unstable.

Machine Translation NMT +1

QA4PRF: A Question Answering based Framework for Pseudo Relevance Feedback

no code implementations16 Nov 2021 Handong Ma, Jiawei Hou, Chenxu Zhu, Weinan Zhang, Ruiming Tang, Jincai Lai, Jieming Zhu, Xiuqiang He, Yong Yu

Pseudo relevance feedback (PRF) automatically performs query expansion based on top-retrieved documents to better represent the user's information need so as to improve the search results.

Question Answering Semantic Similarity +1

AIM: Automatic Interaction Machine for Click-Through Rate Prediction

1 code implementation5 Nov 2021 Chenxu Zhu, Bo Chen, Weinan Zhang, Jincai Lai, Ruiming Tang, Xiuqiang He, Zhenguo Li, Yong Yu

To address these three issues mentioned above, we propose Automatic Interaction Machine (AIM) with three core components, namely, Feature Interaction Search (FIS), Interaction Function Search (IFS) and Embedding Dimension Search (EDS), to select significant feature interactions, appropriate interaction functions and necessary embedding dimensions automatically in a unified framework.

Click-Through Rate Prediction

Context-aware Reranking with Utility Maximization for Recommendation

no code implementations18 Oct 2021 Yunjia Xi, Weiwen Liu, Xinyi Dai, Ruiming Tang, Weinan Zhang, Qing Liu, Xiuqiang He, Yong Yu

As a critical task for large-scale commercial recommender systems, reranking has shown the potential of improving recommendation results by uncovering mutual influence among items.

counterfactual Graph Attention +2

Why Propagate Alone? Parallel Use of Labels and Features on Graphs

no code implementations ICLR 2022 Yangkun Wang, Jiarui Jin, Weinan Zhang, Yongyi Yang, Jiuhai Chen, Quan Gan, Yong Yu, Zheng Zhang, Zengfeng Huang, David Wipf

In this regard, it has recently been proposed to use a randomly-selected portion of the training labels as GNN inputs, concatenated with the original node features for making predictions on the remaining labels.

Node Property Prediction Property Prediction

Inductive Relation Prediction Using Analogy Subgraph Embeddings

no code implementations ICLR 2022 Jiarui Jin, Yangkun Wang, Kounianhua Du, Weinan Zhang, Zheng Zhang, David Wipf, Yong Yu, Quan Gan

Prevailing methods for relation prediction in heterogeneous graphs aim at learning latent representations (i. e., embeddings) of observed nodes and relations, and thus are limited to the transductive setting where the relation types must be known during training.

Inductive Bias Inductive Relation Prediction +1

Graph-Enhanced Exploration for Goal-oriented Reinforcement Learning

no code implementations ICLR 2022 Jiarui Jin, Sijin Zhou, Weinan Zhang, Tong He, Yong Yu, Rasool Fakoor

Goal-oriented Reinforcement Learning (GoRL) is a promising approach for scaling up RL techniques on sparse reward environments requiring long horizon planning.

Continuous Control graph construction +2

Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

1 code implementation16 Aug 2021 Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo, Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu, Tiange Wang, Mian Li, Wei-Wei Tu, Yong Yu, Yufang Bi, Weiqing Wang, Guang Ning

To tackle the above challenges, we employ gradient boosting decision trees (GBDT) to handle data heterogeneity and introduce multi-task learning (MTL) to solve data insufficiency.

Diabetes Prediction Multi-Task Learning

Retrieval & Interaction Machine for Tabular Data Prediction

1 code implementation11 Aug 2021 Jiarui Qin, Weinan Zhang, Rong Su, Zhirong Liu, Weiwen Liu, Ruiming Tang, Xiuqiang He, Yong Yu

Prediction over tabular data is an essential task in many data science applications such as recommender systems, online advertising, medical treatment, etc.

Attribute Click-Through Rate Prediction +2

Learning to Select Cuts for Efficient Mixed-Integer Programming

no code implementations28 May 2021 Zeren Huang, Kerong Wang, Furui Liu, Hui-Ling Zhen, Weinan Zhang, Mingxuan Yuan, Jianye Hao, Yong Yu, Jun Wang

In the online A/B testing of the product planning problems with more than $10^7$ variables and constraints daily, Cut Ranking has achieved the average speedup ratio of 12. 42% over the production solver without any accuracy loss of solution.

Multiple Instance Learning

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks

1 code implementation13 May 2021 Menghui Zhu, Minghuan Liu, Jian Shen, Zhicheng Zhang, Sheng Chen, Weinan Zhang, Deheng Ye, Yong Yu, Qiang Fu, Wei Yang

In Goal-oriented Reinforcement learning, relabeling the raw goals in past experience to provide agents with hindsight ability is a major solution to the reward sparsity problem.

An Adversarial Imitation Click Model for Information Retrieval

1 code implementation13 Apr 2021 Xinyi Dai, Jianghao Lin, Weinan Zhang, Shuai Li, Weiwen Liu, Ruiming Tang, Xiuqiang He, Jianye Hao, Jun Wang, Yong Yu

Modern information retrieval systems, including web search, ads placement, and recommender systems, typically rely on learning from user feedback.

Imitation Learning Information Retrieval +2

Bag of Tricks for Node Classification with Graph Neural Networks

2 code implementations24 Mar 2021 Yangkun Wang, Jiarui Jin, Weinan Zhang, Yong Yu, Zheng Zhang, David Wipf

Over the past few years, graph neural networks (GNN) and label propagation-based methods have made significant progress in addressing node classification tasks on graphs.

Classification General Classification +2

Universal Trading for Order Execution with Oracle Policy Distillation

no code implementations28 Jan 2021 Yuchen Fang, Kan Ren, Weiqing Liu, Dong Zhou, Weinan Zhang, Jiang Bian, Yong Yu, Tie-Yan Liu

As a fundamental problem in algorithmic trading, order execution aims at fulfilling a specific trading order, either liquidation or acquirement, for a given instrument.

Algorithmic Trading reinforcement-learning +1

Explore with Dynamic Map: Graph Structured Reinforcement Learning

no code implementations1 Jan 2021 Jiarui Jin, Sijin Zhou, Weinan Zhang, Rasool Fakoor, David Wipf, Tong He, Yong Yu, Zheng Zhang, Alex Smola

In reinforcement learning, a map with states and transitions built based on historical trajectories is often helpful in exploration and exploitation.

reinforcement-learning Reinforcement Learning (RL)

Regioned Episodic Reinforcement Learning

no code implementations1 Jan 2021 Jiarui Jin, Cong Chen, Ming Zhou, Weinan Zhang, Rasool Fakoor, David Wipf, Yong Yu, Jun Wang, Alex Smola

Goal-oriented reinforcement learning algorithms are often good at exploration, not exploitation, while episodic algorithms excel at exploitation, not exploration.

reinforcement-learning Reinforcement Learning (RL)

Non-iterative Parallel Text Generation via Glancing Transformer

no code implementations1 Jan 2021 Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu, Lei LI

Although non-autoregressive models with one-iteration generation achieves remarkable inference speed-up, they still falls behind their autoregressive counterparts inprediction accuracy.

Language Modelling Text Generation

Improving Knowledge Tracing via Pre-training Question Embeddings

1 code implementation9 Dec 2020 Yunfei Liu, Yang Yang, Xianyu Chen, Jian Shen, Haifeng Zhang, Yong Yu

Knowledge tracing (KT) defines the task of predicting whether students can correctly answer questions based on their historical response.

Knowledge Tracing

Towards Generalized Implementation of Wasserstein Distance in GANs

1 code implementation7 Dec 2020 Minkai Xu, Zhiming Zhou, Guansong Lu, Jian Tang, Weinan Zhang, Yong Yu

Wasserstein GANs (WGANs), built upon the Kantorovich-Rubinstein (KR) duality of Wasserstein distance, is one of the most theoretically sound GAN models.

U-rank: Utility-oriented Learning to Rank with Implicit Feedback

no code implementations1 Nov 2020 Xinyi Dai, Jiawei Hou, Qing Liu, Yunjia Xi, Ruiming Tang, Weinan Zhang, Xiuqiang He, Jun Wang, Yong Yu

To this end, we propose a novel ranking framework called U-rank that directly optimizes the expected utility of the ranking list.

Click-Through Rate Prediction Learning-To-Rank +2

Efficient Projection-Free Algorithms for Saddle Point Problems

no code implementations NeurIPS 2020 Cheng Chen, Luo Luo, Weinan Zhang, Yong Yu

The Frank-Wolfe algorithm is a classic method for constrained optimization problems.

Model-based Policy Optimization with Unsupervised Model Adaptation

1 code implementation NeurIPS 2020 Jian Shen, Han Zhao, Weinan Zhang, Yong Yu

However, due to the potential distribution mismatch between simulated data and real data, this could lead to degraded performance.

Continuous Control Model-based Reinforcement Learning +2

AI Chiller: An Open IoT Cloud Based Machine Learning Framework for the Energy Saving of Building HVAC System via Big Data Analytics on the Fusion of BMS and Environmental Data

no code implementations9 Oct 2020 Yong Yu

Although many research works and projects turn to this direction for energy saving, the application into the optimization problem remains a challenging task.

GIKT: A Graph-based Interaction Model for Knowledge Tracing

3 code implementations13 Sep 2020 Yang Yang, Jian Shen, Yanru Qu, Yunfei Liu, Kerong Wang, Yaoming Zhu, Wei-Nan Zhang, Yong Yu

With the rapid development in online education, knowledge tracing (KT) has become a fundamental problem which traces students' knowledge status and predicts their performance on new questions.

Knowledge Tracing

Bidirectional Model-based Policy Optimization

1 code implementation ICML 2020 Hang Lai, Jian Shen, Wei-Nan Zhang, Yong Yu

Model-based reinforcement learning approaches leverage a forward dynamics model to support planning and decision making, which, however, may fail catastrophically if the model is inaccurate.

Decision Making Model-based Reinforcement Learning +1

An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph

1 code implementation1 Jul 2020 Jiarui Jin, Jiarui Qin, Yuchen Fang, Kounianhua Du, Wei-Nan Zhang, Yong Yu, Zheng Zhang, Alexander J. Smola

To the best of our knowledge, this is the first work providing an efficient neighborhood-based interaction model in the HIN-based recommendations.

Recommendation Systems

Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning

no code implementations18 Jun 2020 Sijin Zhou, Xinyi Dai, Haokun Chen, Wei-Nan Zhang, Kan Ren, Ruiming Tang, Xiuqiang He, Yong Yu

Interactive recommender system (IRS) has drawn huge attention because of its flexible recommendation strategy and the consideration of optimal long-term user experiences.

Decision Making Recommendation Systems +3

User Behavior Retrieval for Click-Through Rate Prediction

1 code implementation28 May 2020 Jiarui Qin, Wei-Nan Zhang, Xin Wu, Jiarui Jin, Yuchen Fang, Yong Yu

These retrieved behaviors are then fed into a deep model to make the final prediction instead of simply using the most recent ones.

Click-Through Rate Prediction Retrieval

A Deep Recurrent Survival Model for Unbiased Ranking

1 code implementation30 Apr 2020 Jiarui Jin, Yuchen Fang, Wei-Nan Zhang, Kan Ren, Guorui Zhou, Jian Xu, Yong Yu, Jun Wang, Xiaoqiang Zhu, Kun Gai

Position bias is a critical problem in information retrieval when dealing with implicit yet biased user feedback data.

Information Retrieval Position +2

Infomax Neural Joint Source-Channel Coding via Adversarial Bit Flip

1 code implementation3 Apr 2020 Yuxuan Song, Minkai Xu, Lantao Yu, Hao Zhou, Shuo Shao, Yong Yu

In this paper, motivated by the inherent connections between neural joint source-channel coding and discrete representation learning, we propose a novel regularization method called Infomax Adversarial-Bit-Flip (IABF) to improve the stability and robustness of the neural joint source-channel coding scheme.

Representation Learning

AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction

4 code implementations25 Mar 2020 Bin Liu, Chenxu Zhu, Guilin Li, Wei-Nan Zhang, Jincai Lai, Ruiming Tang, Xiuqiang He, Zhenguo Li, Yong Yu

By implementing a regularized optimizer over the architecture parameters, the model can automatically identify and remove the redundant feature interactions during the training process of the model.

Click-Through Rate Prediction Recommendation Systems

Multi-Agent Interactions Modeling with Correlated Policies

1 code implementation ICLR 2020 Minghuan Liu, Ming Zhou, Wei-Nan Zhang, Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

In this paper, we cast the multi-agent interactions modeling problem into a multi-agent imitation learning framework with explicit modeling of correlated policies by approximating opponents' policies, which can recover agents' policies that can regenerate similar interactions.

Imitation Learning

Improving Unsupervised Domain Adaptation with Variational Information Bottleneck

no code implementations21 Nov 2019 Yuxuan Song, Lantao Yu, Zhangjie Cao, Zhiming Zhou, Jian Shen, Shuo Shao, Wei-Nan Zhang, Yong Yu

Domain adaptation aims to leverage the supervision signal of source domain to obtain an accurate model for target domain, where the labels are not available.

Unsupervised Domain Adaptation

Exploring Diverse Expressions for Paraphrase Generation

no code implementations IJCNLP 2019 Lihua Qian, Lin Qiu, Wei-Nan Zhang, Xin Jiang, Yong Yu

Paraphrasing plays an important role in various natural language processing (NLP) tasks, such as question answering, information retrieval and sentence simplification.

Information Retrieval Paraphrase Generation +4

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning

no code implementations10 Sep 2019 Liheng Chen, Hongyi Guo, Yali Du, Fei Fang, Haifeng Zhang, Yaoming Zhu, Ming Zhou, Wei-Nan Zhang, Qing Wang, Yong Yu

Although existing works formulate this problem into a centralized learning with decentralized execution framework, which avoids the non-stationary problem in training, their decentralized execution paradigm limits the agents' capability to coordinate.

Multi-agent Reinforcement Learning reinforcement-learning +1

Towards Making the Most of BERT in Neural Machine Translation

2 code implementations15 Aug 2019 Jiacheng Yang, Mingxuan Wang, Hao Zhou, Chengqi Zhao, Yong Yu, Wei-Nan Zhang, Lei LI

Our experiments in machine translation show CTNMT gains of up to 3 BLEU score on the WMT14 English-German language pair which even surpasses the previous state-of-the-art pre-training aided NMT by 1. 4 BLEU score.

Machine Translation NMT +2

Urban Traffic Prediction from Spatio-Temporal Data Using Deep Meta Learning

1 code implementation KDD '19 2019 Zheyi Pan, Yuxuan Liang, Weifeng Wang, Yong Yu, Yu Zheng, Junbo Zhang

Predicting urban traffic is of great importance to intelligent transportation systems and public safety, yet is very challenging because of two aspects: 1) complex spatio-temporal correlations of urban traffic, including spatial correlations between locations along with temporal correlations among timestamps; 2) diversity of such spatiotemporal correlations, which vary from location to location and depend on the surrounding geographical information, e. g., points of interests and road networks.

Graph Attention Meta-Learning +3

Dynamically Fused Graph Network for Multi-hop Reasoning

1 code implementation ACL 2019 Yunxuan Xiao, Yanru Qu, Lin Qiu, Hao Zhou, Lei LI, Wei-Nan Zhang, Yong Yu

However, many difficult questions require multiple supporting evidence from scattered text among two or more documents.

Question Answering

CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

1 code implementation13 May 2019 Huichu Zhang, Siyuan Feng, Chang Liu, Yaoyao Ding, Yichen Zhu, Zihan Zhou, Wei-Nan Zhang, Yong Yu, Haiming Jin, Zhenhui Li

The most commonly used open-source traffic simulator SUMO is, however, not scalable to large road network and large traffic flow, which hinders the study of reinforcement learning on traffic scenarios.

Multi-agent Reinforcement Learning reinforcement-learning +1

Deep Landscape Forecasting for Real-time Bidding Advertising

2 code implementations7 May 2019 Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Yong Yu

The problem is formulated as to forecast the probability distribution of market price for each ad auction.

Survival Analysis

Lifelong Sequential Modeling with Personalized Memorization for User Response Prediction

1 code implementation2 May 2019 Kan Ren, Jiarui Qin, Yuchen Fang, Wei-Nan Zhang, Lei Zheng, Weijie Bian, Guorui Zhou, Jian Xu, Yong Yu, Xiaoqiang Zhu, Kun Gai

In order to tackle these challenges, in this paper, we propose a Hierarchical Periodic Memory Network for lifelong sequential modeling with personalized memorization of sequential patterns for each user.

Memorization

Towards Efficient and Unbiased Implementation of Lipschitz Continuity in GANs

1 code implementation2 Apr 2019 Zhiming Zhou, Jian Shen, Yuxuan Song, Wei-Nan Zhang, Yong Yu

Lipschitz continuity recently becomes popular in generative adversarial networks (GANs).

Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space

1 code implementation4 Mar 2019 Zhou Fan, Rui Su, Wei-Nan Zhang, Yong Yu

In this paper we propose a hybrid architecture of actor-critic algorithms for reinforcement learning in parameterized action space, which consists of multiple parallel sub-actor networks to decompose the structured action space into simpler action spaces along with a critic network to guide the training of all sub-actor networks.

reinforcement-learning Reinforcement Learning (RL)

Lipschitz Generative Adversarial Nets

1 code implementation15 Feb 2019 Zhiming Zhou, Jiadong Liang, Yuxuan Song, Lantao Yu, Hongwei Wang, Wei-Nan Zhang, Yong Yu, Zhihua Zhang

By contrast, Wasserstein GAN (WGAN), where the discriminative function is restricted to 1-Lipschitz, does not suffer from such a gradient uninformativeness problem.

Informativeness

Guiding the One-to-one Mapping in CycleGAN via Optimal Transport

no code implementations15 Nov 2018 Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, Yong Yu

CycleGAN is capable of learning a one-to-one mapping between two data distributions without paired examples, achieving the task of unsupervised data translation.

Translation

Layout Design for Intelligent Warehouse by Evolution with Fitness Approximation

no code implementations14 Nov 2018 Haifeng Zhang, Zilong Guo, Han Cai, Chris Wang, Wei-Nan Zhang, Yong Yu, Wenxin Li, Jun Wang

With the rapid growth of the express industry, intelligent warehouses that employ autonomous robots for carrying parcels have been widely used to handle the vast express volume.

Layout Design

Large-scale Interactive Recommendation with Tree-structured Policy Gradient

no code implementations14 Nov 2018 Haokun Chen, Xinyi Dai, Han Cai, Wei-Nan Zhang, Xuejian Wang, Ruiming Tang, Yuzhou Zhang, Yong Yu

Reinforcement learning (RL) has recently been introduced to interactive recommender systems (IRS) because of its nature of learning from dynamic interactions and planning for long-run performance.

Clustering Recommendation Systems +1

HyperST-Net: Hypernetworks for Spatio-Temporal Forecasting

no code implementations28 Sep 2018 Zheyi Pan, Yuxuan Liang, Junbo Zhang, Xiuwen Yi, Yong Yu, Yu Zheng

In this paper, we propose a general framework (HyperST-Net) based on hypernetworks for deep ST models.

Spatio-Temporal Forecasting Time Series +1

Sampled in Pairs and Driven by Text: A New Graph Embedding Framework

no code implementations12 Sep 2018 Liheng Chen, Yanru Qu, Zhenghui Wang, Lin Qiu, Wei-Nan Zhang, Ken Chen, Shaodian Zhang, Yong Yu

TGE-PS uses Pairs Sampling (PS) to improve the sampling strategy of RW, being able to reduce ~99% training samples while preserving competitive performance.

Graph Embedding Link Prediction

Deep Recurrent Survival Analysis

1 code implementation7 Sep 2018 Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Lin Qiu, Yong Yu

By capturing the time dependency through modeling the conditional probability of the event for each sample, our method predicts the likelihood of the true event occurrence and estimates the survival rate over time, i. e., the probability of the non-occurrence of the event, for the censored data.

Survival Analysis

Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising

1 code implementation11 Aug 2018 Kan Ren, Yuchen Fang, Wei-Nan Zhang, Shuhao Liu, Jiajun Li, Ya zhang, Yong Yu, Jun Wang

To achieve this, we utilize sequence-to-sequence prediction for user clicks, and combine both post-view and post-click attribution patterns together for the final conversion estimation.

Understanding the Effectiveness of Lipschitz-Continuity in Generative Adversarial Nets

1 code implementation2 Jul 2018 Zhiming Zhou, Yuxuan Song, Lantao Yu, Hongwei Wang, Jiadong Liang, Wei-Nan Zhang, Zhihua Zhang, Yong Yu

In this paper, we investigate the underlying factor that leads to failure and success in the training of GANs.

valid

Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data

8 code implementations1 Jul 2018 Yanru Qu, Bohui Fang, Wei-Nan Zhang, Ruiming Tang, Minzhe Niu, Huifeng Guo, Yong Yu, Xiuqiang He

User response prediction is a crucial component for personalized information retrieval and filtering scenarios, such as recommender system and web search.

Click-Through Rate Prediction Feature Engineering +3

QA4IE: A Question Answering based Framework for Information Extraction

1 code implementation10 Apr 2018 Lin Qiu, Hao Zhou, Yanru Qu, Wei-Nan Zhang, Suoheng Li, Shu Rong, Dongyu Ru, Lihua Qian, Kewei Tu, Yong Yu

Information Extraction (IE) refers to automatically extracting structured relation tuples from unstructured texts.

Question Answering Relation +2

Neural Text Generation: Past, Present and Beyond

no code implementations15 Mar 2018 Sidi Lu, Yaoming Zhu, Wei-Nan Zhang, Jun Wang, Yong Yu

This paper presents a systematic survey on recent development of neural text generation models.

Benchmarking reinforcement-learning +2

Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising

no code implementations1 Mar 2018 Kan Ren, Wei-Nan Zhang, Ke Chang, Yifei Rong, Yong Yu, Jun Wang

From the learning perspective, we show that the bidding machine can be updated smoothly with both offline periodical batch or online sequential training schemes.

BIG-bench Machine Learning

Unsupervised Deep Domain Adaptation for Pedestrian Detection

no code implementations9 Feb 2018 Lihang Liu, Weiyao Lin, Lisheng Wu, Yong Yu, Michael Ying Yang

This paper addresses the problem of unsupervised domain adaptation on the task of pedestrian detection in crowded scenes.

Pedestrian Detection Unsupervised Domain Adaptation

Texygen: A Benchmarking Platform for Text Generation Models

1 code implementation6 Feb 2018 Yaoming Zhu, Sidi Lu, Lei Zheng, Jiaxian Guo, Wei-Nan Zhang, Jun Wang, Yong Yu

We introduce Texygen, a benchmarking platform to support research on open-domain text generation models.

Benchmarking Text Generation

Supervised Hashing based on Energy Minimization

no code implementations2 Dec 2017 Zihao Hu, Xiyi Luo, Hongtao Lu, Yong Yu

Recently, supervised hashing methods have attracted much attention since they can optimize retrieval speed and storage cost while preserving semantic information.

Retrieval

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

3 code implementations2 Dec 2017 Lianmin Zheng, Jiacheng Yang, Han Cai, Wei-Nan Zhang, Jun Wang, Yong Yu

Unlike previous research platforms on single or multi-agent reinforcement learning, MAgent focuses on supporting the tasks and the applications that require hundreds to millions of agents.

Multi-agent Reinforcement Learning reinforcement-learning +1

Face Transfer with Generative Adversarial Network

no code implementations17 Oct 2017 Runze Xu, Zhiming Zhou, Wei-Nan Zhang, Yong Yu

Face transfer animates the facial performances of the character in the target video by a source actor.

Face Transfer Generative Adversarial Network

Long Text Generation via Adversarial Training with Leaked Information

6 code implementations24 Sep 2017 Jiaxian Guo, Sidi Lu, Han Cai, Wei-Nan Zhang, Yong Yu, Jun Wang

Automatically generating coherent and semantically meaningful text has many applications in machine translation, dialogue systems, image captioning, etc.

Sentence Text Generation

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

no code implementations13 Sep 2017 Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Wei-Nan Zhang, Ying Wen, Yong Yu

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Efficient Architecture Search by Network Transformation

3 code implementations16 Jul 2017 Han Cai, Tianyao Chen, Wei-Nan Zhang, Yong Yu, Jun Wang

Techniques for automatically designing deep neural network architectures such as reinforcement learning based approaches have recently shown promising results.

Image Classification Neural Architecture Search +2

Learning to Design Games: Strategic Environments in Reinforcement Learning

no code implementations5 Jul 2017 Haifeng Zhang, Jun Wang, Zhiming Zhou, Wei-Nan Zhang, Ying Wen, Yong Yu, Wenxin Li

In typical reinforcement learning (RL), the environment is assumed given and the goal of the learning is to identify an optimal policy for the agent taking actions through its interactions with the environment.

reinforcement-learning Reinforcement Learning (RL)

Wasserstein Distance Guided Representation Learning for Domain Adaptation

8 code implementations5 Jul 2017 Jian Shen, Yanru Qu, Wei-Nan Zhang, Yong Yu

Inspired by Wasserstein GAN, in this paper we propose a novel approach to learn domain invariant feature representations, namely Wasserstein Distance Guided Representation Learning (WDGRL).

Domain Adaptation General Classification +2

Real-Time Bidding by Reinforcement Learning in Display Advertising

1 code implementation10 Jan 2017 Han Cai, Kan Ren, Wei-Nan Zhang, Kleanthis Malialis, Jun Wang, Yong Yu, Defeng Guo

In this paper, we formulate the bid decision process as a reinforcement learning problem, where the state space is represented by the auction information and the campaign's real-time parameters, while an action is the bid price to set.

reinforcement-learning Reinforcement Learning (RL)

Product-based Neural Networks for User Response Prediction

11 code implementations1 Nov 2016 Yanru Qu, Han Cai, Kan Ren, Wei-Nan Zhang, Yong Yu, Ying Wen, Jun Wang

Predicting user responses, such as clicks and conversions, is of great importance and has found its usage in many Web applications including recommender systems, web search and online advertising.

Click-Through Rate Prediction Recommendation Systems

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

23 code implementations18 Sep 2016 Lantao Yu, Wei-Nan Zhang, Jun Wang, Yong Yu

As a new way of training generative models, Generative Adversarial Nets (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data.

Reinforcement Learning (RL) Text Generation

A Graph Traversal Based Approach to Answer Non-Aggregation Questions Over DBpedia

no code implementations16 Oct 2015 Chenhao Zhu, Kan Ren, Xuan Liu, Haofen Wang, Yiding Tian, Yong Yu

We present a question answering system over DBpedia, filling the gap between user information needs expressed in natural language and a structured query interface expressed in SPARQL over the underlying knowledge base (KB).

Question Answering

A Latent Clothing Attribute Approach for Human Pose Estimation

no code implementations16 Nov 2014 Weipeng Zhang, Jie Shen, Guangcan Liu, Yong Yu

Unlike previous approaches, our approach models the clothing attributes as latent variables and thus requires no explicit labeling for the clothing attributes.

Action Recognition Attribute +3

A Parallel and Efficient Algorithm for Learning to Match

no code implementations22 Oct 2014 Jingbo Shang, Tianqi Chen, Hang Li, Zhengdong Lu, Yong Yu

In this paper, we tackle this challenge with a novel parallel and efficient algorithm for feature-based matrix factorization.

Collaborative Filtering Link Prediction

Unified Structured Learning for Simultaneous Human Pose Estimation and Garment Attribute Classification

no code implementations19 Apr 2014 Jie Shen, Guangcan Liu, Jia Chen, Yuqiang Fang, Jianbin Xie, Yong Yu, Shuicheng Yan

In this paper, we utilize structured learning to simultaneously address two intertwined problems: human pose estimation (HPE) and garment attribute classification (GAC), which are valuable for a variety of computer vision and multimedia applications.

Attribute General Classification +1

Feature-Based Matrix Factorization

no code implementations11 Sep 2011 Tianqi Chen, Zhao Zheng, Qiuxia Lu, Weinan Zhang, Yong Yu

Recommender system has been more and more popular and widely used in many applications recently.

Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.