Search Results for author: Yong Yu

Found 141 papers, 73 papers with code

Nested Named Entity Recognition with Span-level Graphs

no code implementations • ACL 2022 • Juncheng Wan, Dongyu Ru, Weinan Zhang, Yong Yu

In this work, we try to improve the span representation by utilizing retrieval-based span-level graphs, connecting spans and entities in the training data based on n-gram features.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

DRepMRec: A Dual Representation Learning Framework for Multimodal Recommendation

no code implementations • 17 Apr 2024 • Kangning Zhang, Yingjie Qin, Ruilong Su, Yifan Liu, Jiarui Jin, Weinan Zhang, Yong Yu

After obtaining separate behavior and modal representations, we design a Behavior-Modal Alignment Module (BMA) to align and fuse the dual representations to solve the misalignment problem.

Multimodal Recommendation Representation Learning

Paper
Add Code

Recall-Augmented Ranking: Enhancing Click-Through Rate Prediction Accuracy with Cross-Stage Data

no code implementations • 15 Apr 2024 • JunJie Huang, Guohao Cai, Jieming Zhu, Zhenhua Dong, Ruiming Tang, Weinan Zhang, Yong Yu

RAR consists of two key sub-modules, which synergistically gather information from a vast pool of look-alike users and recall items, resulting in enriched user representations.

Click-Through Rate Prediction

Paper
Add Code

Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development

no code implementations • 14 Apr 2024 • Siyuan Feng, Jiawei Liu, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang, Tianqi Chen

While a traditional bottom-up development pipeline fails to close the gap timely, we introduce TapML, a top-down approach and tooling designed to streamline the deployment of ML systems on diverse platforms, optimized for developer productivity.

Paper
Add Code

M-scan: A Multi-Scenario Causal-driven Adaptive Network for Recommendation

no code implementations • 11 Apr 2024 • Jiachen Zhu, Yichao Wang, Jianghao Lin, Jiarui Qin, Ruiming Tang, Weinan Zhang, Yong Yu

Furthermore, through causal graph analysis, we have discovered that the scenario itself directly influences click behavior, yet existing approaches directly incorporate data from other scenarios during the training of the current scenario, leading to prediction biases when they directly utilize click behaviors from other scenarios to train models.

counterfactual Counterfactual Inference

Paper
Add Code

Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models

no code implementations • 25 Mar 2024 • Yunjia Xi, Weiwen Liu, Jianghao Lin, Chuhan Wu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu

The rise of large language models (LLMs) has opened new opportunities in Recommender Systems (RSs) by enhancing user behavior modeling and content understanding.

Language Modelling Large Language Model +1

Paper
Add Code

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

1 code implementation • 10 Mar 2024 • Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang

Among these works, many of them utilize in-context examples to achieve generalization without the need for fine-tuning, while few of them have considered the problem of how to select and effectively utilize these examples.

Language Modelling Large Language Model +1

Paper
Code

Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem

no code implementations • 8 Mar 2024 • Jingxiao Chen, Ziqin Gong, Minghuan Liu, Jun Wang, Yong Yu, Weinan Zhang

To overcome this problem and to have an effective solution against hard constraints, we proposed a novel learning-based method that uses looking-ahead information as the feature to improve the legality of TSP with Time Windows (TSPTW) solutions.

Traveling Salesman Problem

Paper
Add Code

Towards Efficient and Effective Unlearning of Large Language Models for Recommendation

1 code implementation • 6 Mar 2024 • Hangyu Wang, Jianghao Lin, Bo Chen, Yang Yang, Ruiming Tang, Weinan Zhang, Yong Yu

However, in order to protect user privacy and optimize utility, it is also crucial for LLMRec to intentionally forget specific user data, which is generally referred to as recommendation unlearning.

World Knowledge

Paper
Code

Offline Fictitious Self-Play for Competitive Games

no code implementations • 29 Feb 2024 • Jingxiao Chen, Weiji Xie, Weinan Zhang, Yong Yu, Ying Wen

Firstly, unaware of the game structure, it is impossible to interact with the opponents and conduct a major learning paradigm, self-play, for competitive games.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization

no code implementations • 23 Jan 2024 • Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian McAuley

Subsequently, we minimize the mutual information between the observation estimation and the relevance estimation conditioned on the input features.

Learning-To-Rank Recommendation Systems

Paper
Add Code

D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems

no code implementations • 21 Jan 2024 • Jiarui Qin, Weiwen Liu, Ruiming Tang, Weinan Zhang, Yong Yu

A personalized knowledge adaptation unit is devised to effectively exploit the information from the knowledge base by adapting the retrieved knowledge to the target samples.

Recommendation Systems

Paper
Add Code

Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios

1 code implementation • 29 Dec 2023 • Xinyuan Wu, Wentao Dong, Hang Lai, Yong Yu, Ying Wen

Quadruped robots have strong adaptability to extreme environments but may also experience faults.

Paper
Code

Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges

no code implementations • 27 Dec 2023 • Qingyao Li, Lingyue Fu, Weiming Zhang, Xianyu Chen, Jingwei Yu, Wei Xia, Weinan Zhang, Ruiming Tang, Yong Yu

Online education platforms, leveraging the internet to distribute education resources, seek to provide convenient education but often fall short in real-time communication with students.

Question Answering

Paper
Add Code

FLIP: Towards Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction

no code implementations • 30 Oct 2023 • Hangyu Wang, Jianghao Lin, Xiangyang Li, Bo Chen, Chenxu Zhu, Ruiming Tang, Weinan Zhang, Yong Yu

Specifically, the masked data of one modality (i. e., tokens or features) has to be recovered with the help of the other modality, which establishes the feature-level interaction and alignment via sufficient mutual information extraction between dual modalities.

Click-Through Rate Prediction Contrastive Learning

Paper
Add Code

ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction

no code implementations • 13 Oct 2023 • Jianghao Lin, Bo Chen, Hangyu Wang, Yunjia Xi, Yanru Qu, Xinyi Dai, Kangning Zhang, Ruiming Tang, Yong Yu, Weinan Zhang

Traditional CTR models convert the multi-field categorical data into ID features via one-hot encoding, and extract the collaborative signals among features.

Click-Through Rate Prediction Language Modelling +1

Paper
Add Code

GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive Testing

1 code implementation • 11 Oct 2023 • Hangyu Wang, Ting Long, Liang Yin, Weinan Zhang, Wei Xia, Qichen Hong, Dingyin Xia, Ruiming Tang, Yong Yu

Besides, the students' response records contain valuable relational information between questions and knowledge concepts.

Multi-Objective Reinforcement Learning

Paper
Code

ROMO: Retrieval-enhanced Offline Model-based Optimization

1 code implementation • 11 Oct 2023 • Mingcheng Chen, Haoran Zhao, Yuxiang Zhao, Hulei Fan, Hongqiao Gao, Yong Yu, Zheng Tian

Data-driven black-box model-based optimization (MBO) problems arise in a great number of practical application scenarios, where the goal is to find a design over the whole space maximizing a black-box target function based on a static offline dataset.

Retrieval

Paper
Code

CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market

1 code implementation • 8 Sep 2023 • JinYuan Wang, Hai Zhao, Zhong Wang, Zeyang Zhu, Jinhao Xie, Yong Yu, Yongjian Fei, Yue Huang, Dawei Cheng

In recent years, great advances in pre-trained language models (PLMs) have sparked considerable research focus and achieved promising performance on the approach of dense passage retrieval, which aims at retrieving relative passages from massive corpus with given questions.

Passage Retrieval Retrieval

Paper
Code

CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models

1 code implementation • 5 Sep 2023 • Lingyue Fu, Huacan Chai, Shuang Luo, Kounianhua Du, Weiming Zhang, Longteng Fan, Jiayi Lei, Renting Rui, Jianghao Lin, Yuchen Fang, Yifan Liu, Jingkuan Wang, Siyuan Qi, Kangning Zhang, Weinan Zhang, Yong Yu

With the emergence of Large Language Models (LLMs), there has been a significant improvement in the programming capabilities of models, attracting growing attention from researchers.

Code Generation Multiple-choice

Paper
Code

ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation

1 code implementation • 22 Aug 2023 • Jianghao Lin, Rong Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, Shigang Quan, Ruiming Tang, Yong Yu, Weinan Zhang

With large language models (LLMs) achieving remarkable breakthroughs in natural language processing (NLP) domains, LLM-enhanced recommender systems have received much attention and have been actively explored currently.

Data Augmentation Language Modelling +3

Paper
Code

Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank

no code implementations • 5 Aug 2023 • Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang

Notice that these ranking metrics do not consider the effects of the contextual dependence among the items in the list, we design a new family of simulation-based ranking metrics, where existing metrics can be regarded as special cases.

Learning-To-Rank

Paper
Add Code

MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction

1 code implementation • 3 Aug 2023 • Jianghao Lin, Yanru Qu, Wei Guo, Xinyi Dai, Ruiming Tang, Yong Yu, Weinan Zhang

The large capacity of neural models helps digest such massive amounts of data under the supervised learning paradigm, yet they fail to utilize the substantial data to its full potential, since the 1-bit click signal is not sufficient to guide the model to learn capable representations of features and instances.

Binary Classification Click-Through Rate Prediction +1

Paper
Code

Learning Multi-Agent Intention-Aware Communication for Optimal Multi-Order Execution in Finance

no code implementations • 6 Jul 2023 • Yuchen Fang, Zhenggang Tang, Kan Ren, Weiqing Liu, Li Zhao, Jiang Bian, Dongsheng Li, Weinan Zhang, Yong Yu, Tie-Yan Liu

Order execution is a fundamental task in quantitative finance, aiming at finishing acquisition or liquidation for a number of trading orders of the specific assets.

Reinforcement Learning (RL)

Paper
Add Code

Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models

1 code implementation • 19 Jun 2023 • Yunjia Xi, Weiwen Liu, Jianghao Lin, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu

In this work, we propose an Open-World Knowledge Augmented Recommendation Framework with Large Language Models, dubbed KAR, to acquire two types of external knowledge from LLMs -- the reasoning knowledge on user preferences and the factual knowledge on items.

Music Recommendation Recommendation Systems +1

Paper
Code

How Can Recommender Systems Benefit from Large Language Models: A Survey

1 code implementation • 9 Jun 2023 • Jianghao Lin, Xinyi Dai, Yunjia Xi, Weiwen Liu, Bo Chen, Hao Zhang, Yong liu, Chuhan Wu, Xiangyang Li, Chenxu Zhu, Huifeng Guo, Yong Yu, Ruiming Tang, Weinan Zhang

In this paper, we conduct a comprehensive survey on this research direction from the perspective of the whole pipeline in real-world recommender systems.

Ethics Feature Engineering +5

737

Paper
Code

Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation

no code implementations • 7 Jun 2023 • Xianyu Chen, Jian Shen, Wei Xia, Jiarui Jin, Yakun Song, Weinan Zhang, Weiwen Liu, Menghui Zhu, Ruiming Tang, Kai Dong, Dingyin Xia, Yong Yu

Noticing that existing approaches fail to consider the correlations of concepts in the path, we propose a novel framework named Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation (SRC), which formulates the recommendation task under a set-to-sequence paradigm.

Knowledge Tracing Recommendation Systems

Paper
Add Code

MADiff: Offline Multi-agent Learning with Diffusion Models

1 code implementation • 27 May 2023 • Zhengbang Zhu, Minghuan Liu, Liyuan Mao, Bingyi Kang, Minkai Xu, Yong Yu, Stefano Ermon, Weinan Zhang

To the best of our knowledge, MADiff is the first diffusion-based multi-agent offline RL framework, which behaves as both a decentralized policy and a centralized controller.

Offline RL Trajectory Prediction

Paper
Code

Refined Edge Usage of Graph Neural Networks for Edge Prediction

no code implementations • 25 Dec 2022 • Jiarui Jin, Yangkun Wang, Weinan Zhang, Quan Gan, Xiang Song, Yong Yu, Zheng Zhang, David Wipf

However, existing methods lack elaborate design regarding the distinctions between two tasks that have been frequently overlooked: (i) edges only constitute the topology in the node classification task but can be used as both the topology and the supervisions (i. e., labels) in the edge prediction task; (ii) the node classification makes prediction over each individual node, while the edge prediction is determinated by each pair of nodes.

Link Prediction Node Classification

Paper
Add Code

Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer

no code implementations • 15 Dec 2022 • Hang Lai, Weinan Zhang, Xialin He, Chen Yu, Zheng Tian, Yong Yu, Jun Wang

Deep reinforcement learning has recently emerged as an appealing alternative for legged locomotion over multiple terrains by training a policy in physical simulation and then transferring it to the real world (i. e., sim-to-real transfer).

Decision Making

Paper
Add Code

A Bird's-eye View of Reranking: from List Level to Page Level

1 code implementation • 17 Nov 2022 • Yunjia Xi, Jianghao Lin, Weiwen Liu, Xinyi Dai, Weinan Zhang, Rui Zhang, Ruiming Tang, Yong Yu

Moreover, simply applying a shared network for all the lists fails to capture the commonalities and distinctions in user behaviors on different lists.

Recommendation Systems

Paper
Code

RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow

no code implementations • 7 Nov 2022 • Zhengbang Zhu, Shenyu Zhang, Yuzheng Zhuang, Yuecheng Liu, Minghuan Liu, Liyuan Mao, Ziqin Gong, Shixiong Kai, Qiang Gu, Bin Wang, Siyuan Cheng, Xinyu Wang, Jianye Hao, Yong Yu

High-quality traffic flow generation is the core module in building simulators for autonomous driving.

Autonomous Driving

Paper
Add Code

Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems

no code implementations • 11 Oct 2022 • Zhengbang Zhu, Rongjun Qin, JunJie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang

The increase in the measured performance, however, can have two possible attributions: a better understanding of user preferences, and a more proactive ability to utilize human bounded rationality to seduce user over-consumption.

Benchmarking Sequential Recommendation

Paper
Add Code

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning

1 code implementation • 18 Sep 2022 • Hua Wei, Jingxiao Chen, Xiyang Ji, Hongyang Qin, Minwen Deng, Siqin Li, Liang Wang, Weinan Zhang, Yong Yu, Lin Liu, Lanxiao Huang, Deheng Ye, Qiang Fu, Wei Yang

Compared to other environments studied in most previous work, ours presents new generalization challenges for competitive reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

537

Paper
Code

Multi-Scale User Behavior Network for Entire Space Multi-Task Learning

no code implementations • 3 Aug 2022 • Jiarui Jin, Xianyu Chen, Weinan Zhang, Yuanbo Chen, Zaifan Jiang, Zekun Zhu, Zhewen Su, Yong Yu

Modelling the user's multiple behaviors is an essential part of modern e-commerce, whose widely adopted application is to jointly optimize click-through rate (CTR) and conversion rate (CVR) predictions.

Multi-Task Learning Survival Analysis

Paper
Add Code

Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning

no code implementations • 26 Jul 2022 • Zeren Huang, WenHao Chen, Weinan Zhang, Chuhan Shi, Furui Liu, Hui-Ling Zhen, Mingxuan Yuan, Jianye Hao, Yong Yu, Jun Wang

Deriving a good variable selection strategy in branch-and-bound is essential for the efficiency of modern mixed-integer programming (MIP) solvers.

Decision Making Reinforcement Learning (RL) +1

Paper
Add Code

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

2 code implementations • 9 Jul 2022 • Siyuan Feng, Bohan Hou, Hongyi Jin, Wuwei Lin, Junru Shao, Ruihang Lai, Zihao Ye, Lianmin Zheng, Cody Hao Yu, Yong Yu, Tianqi Chen

Finally, we build an end-to-end framework on top of our abstraction to automatically optimize deep learning models for given tensor computation primitives.

BIG-bench Machine Learning

11,152

Paper
Code

An F-shape Click Model for Information Retrieval on Multi-block Mobile Pages

1 code implementation • 17 Jun 2022 • Lingyue Fu, Jianghao Lin, Weiwen Liu, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu

However, with the development of user interface (UI) design, the layout of displayed items on a result page tends to be multi-block (i. e., multi-list) style instead of a single list, which requires different assumptions to model user behaviors more accurately.

Information Retrieval Retrieval

Paper
Code

A Graph-Enhanced Click Model for Web Search

1 code implementation • Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval 2021 • Jianghao Lin, Weiwen Liu, Xinyi Dai, Weinan Zhang, Shuai Li, Ruiming Tang, Xiuqiang He, Jianye Hao, Yong Yu

To better exploit search logs and model users' behavior patterns, numerous click models are proposed to extract users' implicit interaction feedback.

graph construction

Paper
Code

Multi-Level Interaction Reranking with User Behavior History

1 code implementation • 20 Apr 2022 • Yunjia Xi, Weiwen Liu, Jieming Zhu, Xilong Zhao, Xinyi Dai, Ruiming Tang, Weinan Zhang, Rui Zhang, Yong Yu

MIR combines low-level cross-item interaction and high-level set-to-list interaction, where we view the candidate items to be reranked as a set and the users' behavior history in chronological order as a list.

Recommendation Systems

Paper
Code

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization

2 code implementations • 4 Mar 2022 • Minghuan Liu, Zhengbang Zhu, Yuzheng Zhuang, Weinan Zhang, Jianye Hao, Yong Yu, Jun Wang

Recent progress in state-only imitation learning extends the scope of applicability of imitation learning to real-world settings by relieving the need for observing expert actions.

Imitation Learning Transfer Learning

Paper
Code

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection

1 code implementation • 25 Feb 2022 • Ting Long, Yutong Xie, Xianyu Chen, Weinan Zhang, Qinxiang Cao, Yong Yu

We thoroughly evaluate our proposed MVG approach in the context of algorithm detection, an important and challenging subfield of PLP.

Paper
Code

Who to Watch Next: Two-side Interactive Networks for Live Broadcast Recommendation

no code implementations • 9 Feb 2022 • Jiarui Jin, Xianyu Chen, Yuanbo Chen, Weinan Zhang, Renting Rui, Zaifan Jiang, Zhewen Su, Yong Yu

With the prevalence of live broadcast business nowadays, a new type of recommendation service, called live broadcast recommendation, is widely used in many mobile e-commerce Apps.

Retrieval

Paper
Add Code

Learn over Past, Evolve for Future: Search-based Time-aware Recommendation with Sequential Behavior Data

no code implementations • 7 Feb 2022 • Jiarui Jin, Xianyu Chen, Weinan Zhang, JunJie Huang, Ziming Feng, Yong Yu

More concretely, we first design a search-based module to retrieve a user's relevant historical behaviors, which are then mixed up with her recent records to be fed into a time-aware sequential network for capturing her time-sensitive demands.

Click-Through Rate Prediction

Paper
Add Code

Efficient Policy Space Response Oracles

no code implementations • 28 Jan 2022 • Ming Zhou, Jingxiao Chen, Ying Wen, Weinan Zhang, Yaodong Yang, Yong Yu, Jun Wang

Policy Space Response Oracle methods (PSRO) provide a general solution to learn Nash equilibrium in two-player zero-sum games but suffer from two drawbacks: (1) the computation inefficiency due to the need for consistent meta-game evaluation via simulations, and (2) the exploration inefficiency due to finding the best response against a fixed meta-strategy at every epoch.

Efficient Exploration

Paper
Add Code

Generative Adversarial Exploration for Reinforcement Learning

no code implementations • 27 Jan 2022 • Weijun Hong, Menghui Zhu, Minghuan Liu, Weinan Zhang, Ming Zhou, Yong Yu, Peng Sun

Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel.

Generative Adversarial Network Montezuma's Revenge +2

Paper
Add Code

DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

1 code implementation • 27 Jan 2022 • Weijun Hong, Guilin Li, Weinan Zhang, Ruiming Tang, Yunhe Wang, Zhenguo Li, Yong Yu

Neural architecture search (NAS) has shown encouraging results in automating the architecture design.

Neural Architecture Search

Paper
Code

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation

no code implementations • COLING 2022 • Juncheng Wan, Jian Yang, Shuming Ma, Dongdong Zhang, Weinan Zhang, Yong Yu, Zhoujun Li

While end-to-end neural machine translation (NMT) has achieved impressive progress, noisy input usually leads models to become fragile and unstable.

Machine Translation NMT +1

Paper
Add Code

QA4PRF: A Question Answering based Framework for Pseudo Relevance Feedback

no code implementations • 16 Nov 2021 • Handong Ma, Jiawei Hou, Chenxu Zhu, Weinan Zhang, Ruiming Tang, Jincai Lai, Jieming Zhu, Xiuqiang He, Yong Yu

Pseudo relevance feedback (PRF) automatically performs query expansion based on top-retrieved documents to better represent the user's information need so as to improve the search results.

Question Answering Semantic Similarity +1

Paper
Add Code

On Effective Scheduling of Model-based Reinforcement Learning

1 code implementation • NeurIPS 2021 • Hang Lai, Jian Shen, Weinan Zhang, Yimin Huang, Xing Zhang, Ruiming Tang, Yong Yu, Zhenguo Li

Model-based reinforcement learning has attracted wide attention due to its superior sample efficiency.

Continuous Control Model-based Reinforcement Learning +3

Paper
Code

Learning Logic Rules for Document-level Relation Extraction

1 code implementation • EMNLP 2021 • Dongyu Ru, Changzhi Sun, Jiangtao Feng, Lin Qiu, Hao Zhou, Weinan Zhang, Yong Yu, Lei LI

LogiRE treats logic rules as latent variables and consists of two modules: a rule generator and a relation extractor.

Ranked #21 on Relation Extraction on DocRED

Document-level Relation Extraction Relation

Paper
Code

AIM: Automatic Interaction Machine for Click-Through Rate Prediction

1 code implementation • 5 Nov 2021 • Chenxu Zhu, Bo Chen, Weinan Zhang, Jincai Lai, Ruiming Tang, Xiuqiang He, Zhenguo Li, Yong Yu

To address these three issues mentioned above, we propose Automatic Interaction Machine (AIM) with three core components, namely, Feature Interaction Search (FIS), Interaction Function Search (IFS) and Embedding Dimension Search (EDS), to select significant feature interactions, appropriate interaction functions and necessary embedding dimensions automatically in a unified framework.

Click-Through Rate Prediction

Paper
Code

Context-aware Reranking with Utility Maximization for Recommendation

no code implementations • 18 Oct 2021 • Yunjia Xi, Weiwen Liu, Xinyi Dai, Ruiming Tang, Weinan Zhang, Qing Liu, Xiuqiang He, Yong Yu

As a critical task for large-scale commercial recommender systems, reranking has shown the potential of improving recommendation results by uncovering mutual influence among items.

counterfactual Graph Attention +2

Paper
Add Code

Why Propagate Alone? Parallel Use of Labels and Features on Graphs

no code implementations • ICLR 2022 • Yangkun Wang, Jiarui Jin, Weinan Zhang, Yongyi Yang, Jiuhai Chen, Quan Gan, Yong Yu, Zheng Zhang, Zengfeng Huang, David Wipf

In this regard, it has recently been proposed to use a randomly-selected portion of the training labels as GNN inputs, concatenated with the original node features for making predictions on the remaining labels.

Node Property Prediction Property Prediction

Paper
Add Code

Inductive Relation Prediction Using Analogy Subgraph Embeddings

no code implementations • ICLR 2022 • Jiarui Jin, Yangkun Wang, Kounianhua Du, Weinan Zhang, Zheng Zhang, David Wipf, Yong Yu, Quan Gan

Prevailing methods for relation prediction in heterogeneous graphs aim at learning latent representations (i. e., embeddings) of observed nodes and relations, and thus are limited to the transductive setting where the relation types must be known during training.

Inductive Bias Inductive Relation Prediction +1

Paper
Add Code

Graph-Enhanced Exploration for Goal-oriented Reinforcement Learning

no code implementations • ICLR 2022 • Jiarui Jin, Sijin Zhou, Weinan Zhang, Tong He, Yong Yu, Rasool Fakoor

Goal-oriented Reinforcement Learning (GoRL) is a promising approach for scaling up RL techniques on sparse reward environments requiring long horizon planning.

Continuous Control graph construction +2

Paper
Add Code

Plan Your Target and Learn Your Skills: State-Only Imitation Learning via Decoupled Policy Optimization

no code implementations • NeurIPS 2021 • Minghuan Liu, Zhengbang Zhu, Yuzheng Zhuang, Weinan Zhang, Jian Shen, Jianye Hao, Yong Yu, Jun Wang

State-only imitation learning (SOIL) enables agents to learn from massive demonstrations without explicit action or reward information.

Imitation Learning Reinforcement Learning (RL)

Paper
Add Code

Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

1 code implementation • 16 Aug 2021 • Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo, Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu, Tiange Wang, Mian Li, Wei-Wei Tu, Yong Yu, Yufang Bi, Weiqing Wang, Guang Ning

To tackle the above challenges, we employ gradient boosting decision trees (GBDT) to handle data heterogeneity and introduce multi-task learning (MTL) to solve data insufficiency.

Diabetes Prediction Multi-Task Learning

Paper
Code

Retrieval & Interaction Machine for Tabular Data Prediction

1 code implementation • 11 Aug 2021 • Jiarui Qin, Weinan Zhang, Rong Su, Zhirong Liu, Weiwen Liu, Ruiming Tang, Xiuqiang He, Yong Yu

Prediction over tabular data is an essential task in many data science applications such as recommender systems, online advertising, medical treatment, etc.

Attribute Click-Through Rate Prediction +2

Paper
Code

Learning to Select Cuts for Efficient Mixed-Integer Programming

no code implementations • 28 May 2021 • Zeren Huang, Kerong Wang, Furui Liu, Hui-Ling Zhen, Weinan Zhang, Mingxuan Yuan, Jianye Hao, Yong Yu, Jun Wang

In the online A/B testing of the product planning problems with more than $10^7$ variables and constraints daily, Cut Ranking has achieved the average speedup ratio of 12. 42% over the production solver without any accuracy loss of solution.

Multiple Instance Learning

Paper
Add Code

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks

1 code implementation • 13 May 2021 • Menghui Zhu, Minghuan Liu, Jian Shen, Zhicheng Zhang, Sheng Chen, Weinan Zhang, Deheng Ye, Yong Yu, Qiang Fu, Wei Yang

In Goal-oriented Reinforcement learning, relabeling the raw goals in past experience to provide agents with hindsight ability is a major solution to the reward sparsity problem.

Paper
Code

An Adversarial Imitation Click Model for Information Retrieval

1 code implementation • 13 Apr 2021 • Xinyi Dai, Jianghao Lin, Weinan Zhang, Shuai Li, Weiwen Liu, Ruiming Tang, Xiuqiang He, Jianye Hao, Jun Wang, Yong Yu

Modern information retrieval systems, including web search, ads placement, and recommender systems, typically rely on learning from user feedback.

Imitation Learning Information Retrieval +2

Paper
Code

Bag of Tricks for Node Classification with Graph Neural Networks

2 code implementations • 24 Mar 2021 • Yangkun Wang, Jiarui Jin, Weinan Zhang, Yong Yu, Zheng Zhang, David Wipf

Over the past few years, graph neural networks (GNN) and label propagation-based methods have made significant progress in addressing node classification tasks on graphs.

Ranked #1 on Node Property Prediction on ogbn-proteins

Classification General Classification +2

Paper
Code

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery

1 code implementation • ICLR 2021 • Yutong Xie, Chence Shi, Hao Zhou, Yuwei Yang, Weinan Zhang, Yong Yu, Lei LI

Searching for novel molecules with desired chemical properties is crucial in drug discovery.

Drug Discovery Molecular Graph Generation

Paper
Code

Universal Trading for Order Execution with Oracle Policy Distillation

no code implementations • 28 Jan 2021 • Yuchen Fang, Kan Ren, Weiqing Liu, Dong Zhou, Weinan Zhang, Jiang Bian, Yong Yu, Tie-Yan Liu

As a fundamental problem in algorithmic trading, order execution aims at fulfilling a specific trading order, either liquidation or acquirement, for a given instrument.

Algorithmic Trading reinforcement-learning +1

Paper
Add Code

Explore with Dynamic Map: Graph Structured Reinforcement Learning

no code implementations • 1 Jan 2021 • Jiarui Jin, Sijin Zhou, Weinan Zhang, Rasool Fakoor, David Wipf, Tong He, Yong Yu, Zheng Zhang, Alex Smola

In reinforcement learning, a map with states and transitions built based on historical trajectories is often helpful in exploration and exploitation.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Regioned Episodic Reinforcement Learning

no code implementations • 1 Jan 2021 • Jiarui Jin, Cong Chen, Ming Zhou, Weinan Zhang, Rasool Fakoor, David Wipf, Yong Yu, Jun Wang, Alex Smola

Goal-oriented reinforcement learning algorithms are often good at exploration, not exploitation, while episodic algorithms excel at exploitation, not exploration.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Non-iterative Parallel Text Generation via Glancing Transformer

no code implementations • 1 Jan 2021 • Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu, Lei LI

Although non-autoregressive models with one-iteration generation achieves remarkable inference speed-up, they still falls behind their autoregressive counterparts inprediction accuracy.

Language Modelling Text Generation

Paper
Add Code

Improving Knowledge Tracing via Pre-training Question Embeddings

1 code implementation • 9 Dec 2020 • Yunfei Liu, Yang Yang, Xianyu Chen, Jian Shen, Haifeng Zhang, Yong Yu

Knowledge tracing (KT) defines the task of predicting whether students can correctly answer questions based on their historical response.

Ranked #3 on Knowledge Tracing on EdNet

Knowledge Tracing

Paper
Code

Towards Generalized Implementation of Wasserstein Distance in GANs

1 code implementation • 7 Dec 2020 • Minkai Xu, Zhiming Zhou, Guansong Lu, Jian Tang, Weinan Zhang, Yong Yu

Wasserstein GANs (WGANs), built upon the Kantorovich-Rubinstein (KR) duality of Wasserstein distance, is one of the most theoretically sound GAN models.

Paper
Code

GraphHINGE: Learning Interaction Models of Structured Neighborhood on Heterogeneous Information Network

1 code implementation • 25 Nov 2020 • Jiarui Jin, Kounianhua Du, Weinan Zhang, Jiarui Qin, Yuchen Fang, Yong Yu, Zheng Zhang, Alexander J. Smola

Heterogeneous information network (HIN) has been widely used to characterize entities of various types and their complex relations.

Click-Through Rate Prediction

Paper
Code

U-rank: Utility-oriented Learning to Rank with Implicit Feedback

no code implementations • 1 Nov 2020 • Xinyi Dai, Jiawei Hou, Qing Liu, Yunjia Xi, Ruiming Tang, Weinan Zhang, Xiuqiang He, Jun Wang, Yong Yu

To this end, we propose a novel ranking framework called U-rank that directly optimizes the expected utility of the ranking list.

Click-Through Rate Prediction Learning-To-Rank +2

Paper
Add Code

Efficient Projection-Free Algorithms for Saddle Point Problems

no code implementations • NeurIPS 2020 • Cheng Chen, Luo Luo, Weinan Zhang, Yong Yu

The Frank-Wolfe algorithm is a classic method for constrained optimization problems.

Paper
Add Code

Model-based Policy Optimization with Unsupervised Model Adaptation

1 code implementation • NeurIPS 2020 • Jian Shen, Han Zhao, Weinan Zhang, Yong Yu

However, due to the potential distribution mismatch between simulated data and real data, this could lead to degraded performance.

Continuous Control Model-based Reinforcement Learning +2

Paper
Code

AI Chiller: An Open IoT Cloud Based Machine Learning Framework for the Energy Saving of Building HVAC System via Big Data Analytics on the Fusion of BMS and Environmental Data

no code implementations • 9 Oct 2020 • Yong Yu

Although many research works and projects turn to this direction for energy saving, the application into the optimization problem remains a challenging task.

Paper
Add Code

GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning

no code implementations • 17 Sep 2020 • Chang Liu, Huichu Zhang, Wei-Nan Zhang, Guanjie Zheng, Yong Yu

The heavy traffic congestion problem has always been a concern for modern cities.

Clustering Generative Adversarial Network +4

Paper
Add Code

GIKT: A Graph-based Interaction Model for Knowledge Tracing

3 code implementations • 13 Sep 2020 • Yang Yang, Jian Shen, Yanru Qu, Yunfei Liu, Kerong Wang, Yaoming Zhu, Wei-Nan Zhang, Yong Yu

With the rapid development in online education, knowledge tracing (KT) has become a fundamental problem which traces students' knowledge status and predicts their performance on new questions.

Ranked #7 on Knowledge Tracing on EdNet

Knowledge Tracing

Paper
Code

Glancing Transformer for Non-Autoregressive Neural Machine Translation

1 code implementation • ACL 2021 • Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Wei-Nan Zhang, Yong Yu, Lei LI

With GLM, we develop Glancing Transformer (GLAT) for machine translation.

Ranked #69 on Machine Translation on WMT2014 English-German

Language Modelling Machine Translation +1

131

Paper
Code

Bidirectional Model-based Policy Optimization

1 code implementation • ICML 2020 • Hang Lai, Jian Shen, Wei-Nan Zhang, Yong Yu

Model-based reinforcement learning approaches leverage a forward dynamics model to support planning and decision making, which, however, may fail catastrophically if the model is inaccurate.

Decision Making Model-based Reinforcement Learning +1

Paper
Code

An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph

1 code implementation • 1 Jul 2020 • Jiarui Jin, Jiarui Qin, Yuchen Fang, Kounianhua Du, Wei-Nan Zhang, Yong Yu, Zheng Zhang, Alexander J. Smola

To the best of our knowledge, this is the first work providing an efficient neighborhood-based interaction model in the HIN-based recommendations.

Recommendation Systems

Paper
Code

Interactive Recommender System via Knowledge Graph-enhanced Reinforcement Learning

no code implementations • 18 Jun 2020 • Sijin Zhou, Xinyi Dai, Haokun Chen, Wei-Nan Zhang, Kan Ren, Ruiming Tang, Xiuqiang He, Yong Yu

Interactive recommender system (IRS) has drawn huge attention because of its flexible recommendation strategy and the consideration of optimal long-term user experiences.

Decision Making Recommendation Systems +3

Paper
Add Code

User Behavior Retrieval for Click-Through Rate Prediction

1 code implementation • 28 May 2020 • Jiarui Qin, Wei-Nan Zhang, Xin Wu, Jiarui Jin, Yuchen Fang, Yong Yu

These retrieved behaviors are then fed into a deep model to make the final prediction instead of simply using the most recent ones.

Click-Through Rate Prediction Retrieval

Paper
Code

A Deep Recurrent Survival Model for Unbiased Ranking

1 code implementation • 30 Apr 2020 • Jiarui Jin, Yuchen Fang, Wei-Nan Zhang, Kan Ren, Guorui Zhou, Jian Xu, Yong Yu, Jun Wang, Xiaoqiang Zhu, Kun Gai

Position bias is a critical problem in information retrieval when dealing with implicit yet biased user feedback data.

Information Retrieval Position +2

Paper
Code

Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space

no code implementations • Findings of the Association for Computational Linguistics 2020 • Dongyu Ru, Jiangtao Feng, Lin Qiu, Hao Zhou, Mingxuan Wang, Wei-Nan Zhang, Yong Yu, Lei LI

We propose adversarial uncertainty sampling in discrete space (AUSDS) to retrieve informative unlabeled samples more efficiently.

Active Learning Adversarial Attack +2

Paper
Add Code

Infomax Neural Joint Source-Channel Coding via Adversarial Bit Flip

1 code implementation • 3 Apr 2020 • Yuxuan Song, Minkai Xu, Lantao Yu, Hao Zhou, Shuo Shao, Yong Yu

In this paper, motivated by the inherent connections between neural joint source-channel coding and discrete representation learning, we propose a novel regularization method called Infomax Adversarial-Bit-Flip (IABF) to improve the stability and robustness of the neural joint source-channel coding scheme.

Representation Learning

Paper
Code

AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction

4 code implementations • 25 Mar 2020 • Bin Liu, Chenxu Zhu, Guilin Li, Wei-Nan Zhang, Jincai Lai, Ruiming Tang, Xiuqiang He, Zhenguo Li, Yong Yu

By implementing a regularized optimizer over the architecture parameters, the model can automatically identify and remove the redundant feature interactions during the training process of the model.

Ranked #29 on Click-Through Rate Prediction on Criteo

Click-Through Rate Prediction Recommendation Systems

4,094

Paper
Code

Large-Scale Optimal Transport via Adversarial Training with Cycle-Consistency

no code implementations • 14 Mar 2020 • Guansong Lu, Zhiming Zhou, Jian Shen, Cheng Chen, Wei-Nan Zhang, Yong Yu

Recent advances in large-scale optimal transport have greatly extended its application scenarios in machine learning.

Domain Adaptation Image-to-Image Translation +1

Paper
Add Code

Multi-Agent Interactions Modeling with Correlated Policies

1 code implementation • ICLR 2020 • Minghuan Liu, Ming Zhou, Wei-Nan Zhang, Yuzheng Zhuang, Jun Wang, Wulong Liu, Yong Yu

In this paper, we cast the multi-agent interactions modeling problem into a multi-agent imitation learning framework with explicit modeling of correlated policies by approximating opponents' policies, which can recover agents' policies that can regenerate similar interactions.

Imitation Learning

Paper
Code

Improving Unsupervised Domain Adaptation with Variational Information Bottleneck

no code implementations • 21 Nov 2019 • Yuxuan Song, Lantao Yu, Zhangjie Cao, Zhiming Zhou, Jian Shen, Shuo Shao, Wei-Nan Zhang, Yong Yu

Domain adaptation aims to leverage the supervision signal of source domain to obtain an accurate model for target domain, where the labels are not available.

Unsupervised Domain Adaptation

Paper
Add Code

Sequential Recommendation with Dual Side Neighbor-based Collaborative Relation Modeling

1 code implementation • 10 Nov 2019 • Jiarui Qin, Kan Ren, Yuchen Fang, Wei-Nan Zhang, Yong Yu

Various sequential recommendation methods are proposed to model the dynamic user behaviors.

Relation Sequential Recommendation

Paper
Code

Exploring Diverse Expressions for Paraphrase Generation

no code implementations • IJCNLP 2019 • Lihua Qian, Lin Qiu, Wei-Nan Zhang, Xin Jiang, Yong Yu

Paraphrasing plays an important role in various natural language processing (NLP) tasks, such as question answering, information retrieval and sentence simplification.

Information Retrieval Paraphrase Generation +4

Paper
Add Code

Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching

no code implementations • 7 Oct 2019 • Ming Zhou, Jiarui Jin, Wei-Nan Zhang, Zhiwei Qin, Yan Jiao, Chenxi Wang, Guobin Wu, Yong Yu, Jieping Ye

Improving the efficiency of dispatching orders to vehicles is a research hotspot in online ride-hailing systems.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning

no code implementations • 10 Sep 2019 • Liheng Chen, Hongyi Guo, Yali Du, Fei Fang, Haifeng Zhang, Yaoming Zhu, Ming Zhou, Wei-Nan Zhang, Qing Wang, Yong Yu

Although existing works formulate this problem into a centralized learning with decentralized execution framework, which avoids the non-stationary problem in training, their decentralized execution paradigm limits the agents' capability to coordinate.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Towards Making the Most of BERT in Neural Machine Translation

2 code implementations • 15 Aug 2019 • Jiacheng Yang, Mingxuan Wang, Hao Zhou, Chengqi Zhao, Yong Yu, Wei-Nan Zhang, Lei LI

Our experiments in machine translation show CTNMT gains of up to 3 BLEU score on the WMT14 English-German language pair which even surpasses the previous state-of-the-art pre-training aided NMT by 1. 4 BLEU score.

Machine Translation NMT +2

293

Paper
Code

Urban Traffic Prediction from Spatio-Temporal Data Using Deep Meta Learning

1 code implementation • KDD '19 2019 • Zheyi Pan, Yuxuan Liang, Weifeng Wang, Yong Yu, Yu Zheng, Junbo Zhang

Predicting urban traffic is of great importance to intelligent transportation systems and public safety, yet is very challenging because of two aspects: 1) complex spatio-temporal correlations of urban traffic, including spatial correlations between locations along with temporal correlations among timestamps; 2) diversity of such spatiotemporal correlations, which vary from location to location and depend on the surrounding geographical information, e. g., points of interests and road networks.

Graph Attention Meta-Learning +3

194

Paper
Code

Triple-to-Text: Converting RDF Triples into High-Quality Natural Languages via Optimizing an Inverse KL Divergence

1 code implementation • 25 May 2019 • Yaoming Zhu, Juncheng Wan, Zhiming Zhou, Liheng Chen, Lin Qiu, Wei-Nan Zhang, Xin Jiang, Yong Yu

Knowledge base is one of the main forms to represent information in a structured way.

Text Generation

Paper
Code

Dynamically Fused Graph Network for Multi-hop Reasoning

1 code implementation • ACL 2019 • Yunxuan Xiao, Yanru Qu, Lin Qiu, Hao Zhou, Lei LI, Wei-Nan Zhang, Yong Yu

However, many difficult questions require multiple supporting evidence from scattered text among two or more documents.

Ranked #33 on Question Answering on HotpotQA

Question Answering

190

Paper
Code

CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario

1 code implementation • 13 May 2019 • Huichu Zhang, Siyuan Feng, Chang Liu, Yaoyao Ding, Yichen Zhu, Zihan Zhou, Wei-Nan Zhang, Yong Yu, Haiming Jin, Zhenhui Li

The most commonly used open-source traffic simulator SUMO is, however, not scalable to large road network and large traffic flow, which hinders the study of reinforcement learning on traffic scenarios.

Multi-agent Reinforcement Learning reinforcement-learning +1

735

Paper
Code

Deep Landscape Forecasting for Real-time Bidding Advertising

2 code implementations • 7 May 2019 • Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Yong Yu

The problem is formulated as to forecast the probability distribution of market price for each ad auction.

Survival Analysis

Paper
Code

Lifelong Sequential Modeling with Personalized Memorization for User Response Prediction

1 code implementation • 2 May 2019 • Kan Ren, Jiarui Qin, Yuchen Fang, Wei-Nan Zhang, Lei Zheng, Weijie Bian, Guorui Zhou, Jian Xu, Yong Yu, Xiaoqiang Zhu, Kun Gai

In order to tackle these challenges, in this paper, we propose a Hierarchical Periodic Memory Network for lifelong sequential modeling with personalized memorization of sequential patterns for each user.

Memorization

101

Paper
Code

Towards Efficient and Unbiased Implementation of Lipschitz Continuity in GANs

1 code implementation • 2 Apr 2019 • Zhiming Zhou, Jian Shen, Yuxuan Song, Wei-Nan Zhang, Yong Yu

Lipschitz continuity recently becomes popular in generative adversarial networks (GANs).

Paper
Code

Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space

1 code implementation • 4 Mar 2019 • Zhou Fan, Rui Su, Wei-Nan Zhang, Yong Yu

In this paper we propose a hybrid architecture of actor-critic algorithms for reinforcement learning in parameterized action space, which consists of multiple parallel sub-actor networks to decompose the structured action space into simpler action spaces along with a critic network to guide the training of all sub-actor networks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Lipschitz Generative Adversarial Nets

1 code implementation • 15 Feb 2019 • Zhiming Zhou, Jiadong Liang, Yuxuan Song, Lantao Yu, Hongwei Wang, Wei-Nan Zhang, Yong Yu, Zhihua Zhang

By contrast, Wasserstein GAN (WGAN), where the discriminative function is restricted to 1-Lipschitz, does not suffer from such a gradient uninformativeness problem.

Informativeness

Paper
Code

Guiding the One-to-one Mapping in CycleGAN via Optimal Transport

no code implementations • 15 Nov 2018 • Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, Yong Yu

CycleGAN is capable of learning a one-to-one mapping between two data distributions without paired examples, achieving the task of unsupervised data translation.

Translation

Paper
Add Code

Layout Design for Intelligent Warehouse by Evolution with Fitness Approximation

no code implementations • 14 Nov 2018 • Haifeng Zhang, Zilong Guo, Han Cai, Chris Wang, Wei-Nan Zhang, Yong Yu, Wenxin Li, Jun Wang

With the rapid growth of the express industry, intelligent warehouses that employ autonomous robots for carrying parcels have been widely used to handle the vast express volume.

Layout Design

Paper
Add Code

Large-scale Interactive Recommendation with Tree-structured Policy Gradient

no code implementations • 14 Nov 2018 • Haokun Chen, Xinyi Dai, Han Cai, Wei-Nan Zhang, Xuejian Wang, Ruiming Tang, Yuzhou Zhang, Yong Yu

Reinforcement learning (RL) has recently been introduced to interactive recommender systems (IRS) because of its nature of learning from dynamic interactions and planning for long-run performance.

Clustering Recommendation Systems +1

Paper
Add Code

AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods

3 code implementations • ICLR 2019 • Zhiming Zhou, Qingru Zhang, Guansong Lu, Hongwei Wang, Wei-Nan Zhang, Yong Yu

Adam is shown not being able to converge to the optimal solution in certain cases.

Paper
Code

HyperST-Net: Hypernetworks for Spatio-Temporal Forecasting

no code implementations • 28 Sep 2018 • Zheyi Pan, Yuxuan Liang, Junbo Zhang, Xiuwen Yi, Yong Yu, Yu Zheng

In this paper, we propose a general framework (HyperST-Net) based on hypernetworks for deep ST models.

Spatio-Temporal Forecasting Time Series +1

Paper
Add Code

Sampled in Pairs and Driven by Text: A New Graph Embedding Framework

no code implementations • 12 Sep 2018 • Liheng Chen, Yanru Qu, Zhenghui Wang, Lin Qiu, Wei-Nan Zhang, Ken Chen, Shaodian Zhang, Yong Yu

TGE-PS uses Pairs Sampling (PS) to improve the sampling strategy of RW, being able to reduce ~99% training samples while preserving competitive performance.

Graph Embedding Link Prediction

Paper
Add Code

Deep Recurrent Survival Analysis

1 code implementation • 7 Sep 2018 • Kan Ren, Jiarui Qin, Lei Zheng, Zhengyu Yang, Wei-Nan Zhang, Lin Qiu, Yong Yu

By capturing the time dependency through modeling the conditional probability of the event for each sample, our method predicts the likelihood of the true event occurrence and estimates the survival rate over time, i. e., the probability of the non-occurrence of the event, for the censored data.

Survival Analysis

134

Paper
Code

Learning Multi-touch Conversion Attribution with Dual-attention Mechanisms for Online Advertising

1 code implementation • 11 Aug 2018 • Kan Ren, Yuchen Fang, Wei-Nan Zhang, Shuhao Liu, Jiajun Li, Ya zhang, Yong Yu, Jun Wang

To achieve this, we utilize sequence-to-sequence prediction for user clicks, and combine both post-view and post-click attribution patterns together for the final conversion estimation.

Paper
Code

Understanding the Effectiveness of Lipschitz-Continuity in Generative Adversarial Nets

1 code implementation • 2 Jul 2018 • Zhiming Zhou, Yuxuan Song, Lantao Yu, Hongwei Wang, Jiadong Liang, Wei-Nan Zhang, Zhihua Zhang, Yong Yu

In this paper, we investigate the underlying factor that leads to failure and success in the training of GANs.

valid

Paper
Code

Product-based Neural Networks for User Response Prediction over Multi-field Categorical Data

8 code implementations • 1 Jul 2018 • Yanru Qu, Bohui Fang, Wei-Nan Zhang, Ruiming Tang, Minzhe Niu, Huifeng Guo, Yong Yu, Xiuqiang He

User response prediction is a crucial component for personalized information retrieval and filtering scenarios, such as recommender system and web search.

Click-Through Rate Prediction Feature Engineering +3

7,342

Paper
Code

Path-Level Network Transformation for Efficient Architecture Search

3 code implementations • ICML 2018 • Han Cai, Jiacheng Yang, Wei-Nan Zhang, Song Han, Yong Yu

We introduce a new function-preserving transformation for efficient neural architecture search.

Ranked #9 on Neural Architecture Search on CIFAR-10 Image Classification

Image Classification Neural Architecture Search

170

Paper
Code

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

no code implementations • NAACL 2018 • Zhenghui Wang, Yanru Qu, Li-Heng Chen, Jian Shen, Wei-Nan Zhang, Shaodian Zhang, Yimei Gao, Gen Gu, Ken Chen, Yong Yu

We study the problem of named entity recognition (NER) from electronic medical records, which is one of the most fundamental and critical problems for medical text mining.

Medical Named Entity Recognition named-entity-recognition +3

Paper
Add Code

CoT: Cooperative Training for Generative Modeling of Discrete Data

2 code implementations • ICLR 2019 • Sidi Lu, Lantao Yu, Siyuan Feng, Yaoming Zhu, Wei-Nan Zhang, Yong Yu

In this paper, we study the generative models of sequential discrete data.

Paper
Code

QA4IE: A Question Answering based Framework for Information Extraction

1 code implementation • 10 Apr 2018 • Lin Qiu, Hao Zhou, Yanru Qu, Wei-Nan Zhang, Suoheng Li, Shu Rong, Dongyu Ru, Lihua Qian, Kewei Tu, Yong Yu

Information Extraction (IE) refers to automatically extracting structured relation tuples from unstructured texts.

Question Answering Relation +2

Paper
Code

Neural Text Generation: Past, Present and Beyond

no code implementations • 15 Mar 2018 • Sidi Lu, Yaoming Zhu, Wei-Nan Zhang, Jun Wang, Yong Yu

This paper presents a systematic survey on recent development of neural text generation models.

Benchmarking reinforcement-learning +2

Paper
Add Code

Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising

no code implementations • 1 Mar 2018 • Kan Ren, Wei-Nan Zhang, Ke Chang, Yifei Rong, Yong Yu, Jun Wang

From the learning perspective, we show that the bidding machine can be updated smoothly with both offline periodical batch or online sequential training schemes.

BIG-bench Machine Learning

Paper
Add Code

Unsupervised Deep Domain Adaptation for Pedestrian Detection

no code implementations • 9 Feb 2018 • Lihang Liu, Weiyao Lin, Lisheng Wu, Yong Yu, Michael Ying Yang

This paper addresses the problem of unsupervised domain adaptation on the task of pedestrian detection in crowded scenes.

Pedestrian Detection Unsupervised Domain Adaptation

Paper
Add Code

Texygen: A Benchmarking Platform for Text Generation Models

1 code implementation • 6 Feb 2018 • Yaoming Zhu, Sidi Lu, Lei Zheng, Jiaxian Guo, Wei-Nan Zhang, Jun Wang, Yong Yu

We introduce Texygen, a benchmarking platform to support research on open-domain text generation models.

Benchmarking Text Generation

859

Paper
Code

Supervised Hashing based on Energy Minimization

no code implementations • 2 Dec 2017 • Zihao Hu, Xiyi Luo, Hongtao Lu, Yong Yu

Recently, supervised hashing methods have attracted much attention since they can optimize retrieval speed and storage cost while preserving semantic information.

Retrieval

Paper
Add Code

MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence

3 code implementations • 2 Dec 2017 • Lianmin Zheng, Jiacheng Yang, Han Cai, Wei-Nan Zhang, Jun Wang, Yong Yu

Unlike previous research platforms on single or multi-agent reinforcement learning, MAgent focuses on supporting the tasks and the applications that require hundreds to millions of agents.

Multi-agent Reinforcement Learning reinforcement-learning +1

1,667

Paper
Code

Face Transfer with Generative Adversarial Network

no code implementations • 17 Oct 2017 • Runze Xu, Zhiming Zhou, Wei-Nan Zhang, Yong Yu

Face transfer animates the facial performances of the character in the target video by a source actor.

Face Transfer Generative Adversarial Network

Paper
Add Code

Long Text Generation via Adversarial Training with Leaked Information

6 code implementations • 24 Sep 2017 • Jiaxian Guo, Sidi Lu, Han Cai, Wei-Nan Zhang, Yong Yu, Jun Wang

Automatically generating coherent and semantically meaningful text has many applications in machine translation, dialogue systems, image captioning, etc.

Ranked #1 on Text Generation on COCO Captions

Sentence Text Generation

575

Paper
Code

A Study of AI Population Dynamics with Million-agent Reinforcement Learning

no code implementations • 13 Sep 2017 • Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Wei-Nan Zhang, Ying Wen, Yong Yu

We conduct an empirical study on discovering the ordered collective dynamics obtained by a population of intelligence agents, driven by million-agent reinforcement learning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Efficient Architecture Search by Network Transformation

3 code implementations • 16 Jul 2017 • Han Cai, Tianyao Chen, Wei-Nan Zhang, Yong Yu, Jun Wang

Techniques for automatically designing deep neural network architectures such as reinforcement learning based approaches have recently shown promising results.

Ranked #140 on Image Classification on CIFAR-10

Image Classification Neural Architecture Search +2

170

Paper
Code

Learning to Design Games: Strategic Environments in Reinforcement Learning

no code implementations • 5 Jul 2017 • Haifeng Zhang, Jun Wang, Zhiming Zhou, Wei-Nan Zhang, Ying Wen, Yong Yu, Wenxin Li

In typical reinforcement learning (RL), the environment is assumed given and the goal of the learning is to identify an optimal policy for the agent taking actions through its interactions with the environment.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Wasserstein Distance Guided Representation Learning for Domain Adaptation

8 code implementations • 5 Jul 2017 • Jian Shen, Yanru Qu, Wei-Nan Zhang, Yong Yu

Inspired by Wasserstein GAN, in this paper we propose a novel approach to learn domain invariant feature representations, namely Wasserstein Distance Guided Representation Learning (WDGRL).

Domain Adaptation General Classification +2

586

Paper
Code

Activation Maximization Generative Adversarial Nets

2 code implementations • ICLR 2018 • Zhiming Zhou, Han Cai, Shu Rong, Yuxuan Song, Kan Ren, Wei-Nan Zhang, Yong Yu, Jun Wang

Our proposed model also outperforms the baseline methods in the new metric.

Paper
Code

Unsupervised Diverse Colorization via Generative Adversarial Networks

1 code implementation • 22 Feb 2017 • Yun Cao, Zhiming Zhou, Wei-Nan Zhang, Yong Yu

Colorization of grayscale images has been a hot topic in computer vision.

Colorization

Paper
Code

Real-Time Bidding by Reinforcement Learning in Display Advertising

1 code implementation • 10 Jan 2017 • Han Cai, Kan Ren, Wei-Nan Zhang, Kleanthis Malialis, Jun Wang, Yong Yu, Defeng Guo

In this paper, we formulate the bid decision process as a reinforcement learning problem, where the state space is represented by the auction information and the campaign's real-time parameters, while an action is the bid price to set.

reinforcement-learning Reinforcement Learning (RL)

173

Paper
Code

Product-based Neural Networks for User Response Prediction

11 code implementations • 1 Nov 2016 • Yanru Qu, Han Cai, Kan Ren, Wei-Nan Zhang, Yong Yu, Ying Wen, Jun Wang

Predicting user responses, such as clicks and conversions, is of great importance and has found its usage in many Web applications including recommender systems, web search and online advertising.

Ranked #1 on Click-Through Rate Prediction on iPinYou

Click-Through Rate Prediction Recommendation Systems

7,342

Paper
Code

Context-Dependent Sense Embedding

no code implementations • EMNLP 2016 • Lin Qiu, Kewei Tu, Yong Yu

Clustering Word Embeddings +1

Paper
Add Code

SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

23 code implementations • 18 Sep 2016 • Lantao Yu, Wei-Nan Zhang, Jun Wang, Yong Yu

As a new way of training generative models, Generative Adversarial Nets (GAN) that uses a discriminative model to guide the training of the generative model has enjoyed considerable success in generating real-valued data.

Ranked #2 on Text Generation on Chinese Poems

Reinforcement Learning (RL) Text Generation

2,071

Paper
Code

A Graph Traversal Based Approach to Answer Non-Aggregation Questions Over DBpedia

no code implementations • 16 Oct 2015 • Chenhao Zhu, Kan Ren, Xuan Liu, Haofen Wang, Yiding Tian, Yong Yu

We present a question answering system over DBpedia, filling the gap between user information needs expressed in natural language and a structured query interface expressed in SPARQL over the underlying knowledge base (KB).

Question Answering

Paper
Add Code

A Latent Clothing Attribute Approach for Human Pose Estimation

no code implementations • 16 Nov 2014 • Weipeng Zhang, Jie Shen, Guangcan Liu, Yong Yu

Unlike previous approaches, our approach models the clothing attributes as latent variables and thus requires no explicit labeling for the clothing attributes.

Action Recognition Attribute +3

Paper
Add Code

A Parallel and Efficient Algorithm for Learning to Match

no code implementations • 22 Oct 2014 • Jingbo Shang, Tianqi Chen, Hang Li, Zhengdong Lu, Yong Yu

In this paper, we tackle this challenge with a novel parallel and efficient algorithm for feature-based matrix factorization.

Collaborative Filtering Link Prediction

Paper
Add Code

Unified Structured Learning for Simultaneous Human Pose Estimation and Garment Attribute Classification

no code implementations • 19 Apr 2014 • Jie Shen, Guangcan Liu, Jia Chen, Yuqiang Fang, Jianbin Xie, Yong Yu, Shuicheng Yan

In this paper, we utilize structured learning to simultaneously address two intertwined problems: human pose estimation (HPE) and garment attribute classification (GAC), which are valuable for a variety of computer vision and multimedia applications.

Attribute General Classification +1

Paper
Add Code

Feature-Based Matrix Factorization

no code implementations • 11 Sep 2011 • Tianqi Chen, Zhao Zheng, Qiuxia Lu, Weinan Zhang, Yong Yu

Recommender system has been more and more popular and widely used in many applications recently.

Recommendation Systems

Paper
Add Code

Robust Recovery of Subspace Structures by Low-Rank Representation

1 code implementation • 14 Oct 2010 • Guangcan Liu, Zhouchen Lin, Shuicheng Yan, Ju Sun, Yong Yu, Yi Ma

In this work we address the subspace recovery problem.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.