Search Results for author: Tiancheng Jin

Found 7 papers, 1 papers with code

Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition

no code implementations • ICML 2020 • Chi Jin, Tiancheng Jin, Haipeng Luo, Suvrit Sra, Tiancheng Yu

We consider the task of learning in episodic finite-horizon Markov decision processes with an unknown transition function, bandit feedback, and adversarial losses.

Paper
Add Code

Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification

1 code implementation • 7 May 2023 • Guang Yang, Tiancheng Jin, Liang Dou

In this study, we propose to represent AST as a heterogeneous directed hypergraph (HDHG) and process the graph by heterogeneous directed hypergraph neural network (HDHGN) for code classification.

Code Classification

Paper
Code

Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback

no code implementations • 31 Jan 2022 • Tiancheng Jin, Tal Lancewicki, Haipeng Luo, Yishay Mansour, Aviv Rosenberg

The standard assumption in reinforcement learning (RL) is that agents observe feedback for their actions immediately.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition

no code implementations • NeurIPS 2021 • Tiancheng Jin, Longbo Huang, Haipeng Luo

We consider the best-of-both-worlds problem for learning an episodic Markov Decision Process through $T$ episodes, with the goal of achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ regret when the losses are adversarial and simultaneously $\mathcal{O}(\text{polylog}(T))$ regret when the losses are (almost) stochastic.

Open-Ended Question Answering

Paper
Add Code

Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition

no code implementations • NeurIPS 2020 • Tiancheng Jin, Haipeng Luo

This work studies the problem of learning episodic Markov Decision Processes with known transition and bandit feedback.

Multi-Armed Bandits

Paper
Add Code

Learning Adversarial MDPs with Bandit Feedback and Unknown Transition

no code implementations • 3 Dec 2019 • Chi Jin, Tiancheng Jin, Haipeng Luo, Suvrit Sra, Tiancheng Yu

We consider the problem of learning in episodic finite-horizon Markov decision processes with an unknown transition function, bandit feedback, and adversarial losses.

Paper
Add Code

Deep Reinforcement Learning for Multi-Driver Vehicle Dispatching and Repositioning Problem

no code implementations • 25 Nov 2019 • John Holler, Risto Vuorio, Zhiwei Qin, Xiaocheng Tang, Yan Jiao, Tiancheng Jin, Satinder Singh, Chenxi Wang, Jieping Ye

Order dispatching and driver repositioning (also known as fleet management) in the face of spatially and temporally varying supply and demand are central to a ride-sharing platform marketplace.

BIG-bench Machine Learning Decision Making +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.