Search Results for author: Ran Tian

Found 28 papers, 4 papers with code

MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention

no code implementations24 Jun 2024 Yuxin Chen, Chen Tang, Chenran Li, Ran Tian, Peter Stone, Masayoshi Tomizuka, Wei Zhan

Instead of inferring the complete human behavior characteristics, MEReQ infers a residual reward function that captures the discrepancy between the human expert's and the prior policy's underlying reward functions.

Imitation Learning Q-Learning

Quantifying Agent Interaction in Multi-agent Reinforcement Learning for Cost-efficient Generalization

no code implementations11 Oct 2023 Yuxin Chen, Chen Tang, Ran Tian, Chenran Li, Jinning Li, Masayoshi Tomizuka, Wei Zhan

We observe that, generally, a more diverse set of co-play agents during training enhances the generalization performance of the ego agent; however, this improvement varies across distinct scenarios and environments.

Multi-agent Reinforcement Learning

Towards Modeling and Influencing the Dynamics of Human Learning

no code implementations2 Jan 2023 Ran Tian, Masayoshi Tomizuka, Anca Dragan, Andrea Bajcsy

Interestingly, robot actions influence what this experience is, and therefore influence how people's internal models change.

Amos: An Adam-style Optimizer with Adaptive Weight Decay towards Model-Oriented Scale

1 code implementation21 Oct 2022 Ran Tian, Ankur P. Parikh

We present Amos, a stochastic gradient-based optimizer designed for training deep neural networks.

Simple Recurrence Improves Masked Language Models

no code implementations23 May 2022 Tao Lei, Ran Tian, Jasmijn Bastings, Ankur P. Parikh

In this work, we explore whether modeling recurrence into the Transformer architecture can both be beneficial and efficient, by building an extremely simple recurrent module into the Transformer.

Shatter: An Efficient Transformer Encoder with Single-Headed Self-Attention and Relative Sequence Partitioning

no code implementations30 Aug 2021 Ran Tian, Joshua Maynez, Ankur P. Parikh

The highly popular Transformer architecture, based on self-attention, is the foundation of large pretrained models such as BERT, that have become an enduring paradigm in NLP.

Learning Human Rewards by Inferring Their Latent Intelligence Levels in Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

no code implementations7 Mar 2021 Ran Tian, Masayoshi Tomizuka, Liting Sun

In this work, we advocate that humans are bounded rational and have different intelligence levels when reasoning about others' decision-making process, and such an inherent and latent characteristic should be accounted for in reward learning algorithms.

Decision Making reinforcement-learning +1

Local Additivity Based Data Augmentation for Semi-supervised NER

1 code implementation EMNLP 2020 Jiaao Chen, Zhenghui Wang, Ran Tian, Zichao Yang, Diyi Yang

Named Entity Recognition (NER) is one of the first stages in deep language understanding yet current NER models heavily rely on human-annotated data.

Data Augmentation named-entity-recognition +3

Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory

no code implementations3 Sep 2020 Ran Tian, Liting Sun, Masayoshi Tomizuka

Classical game-theoretic approaches for multi-agent systems in both the forward policy design problem and the inverse reward learning problem often make strong rationality assumptions: agents perfectly maximize expected utilities under uncertainties.

Sticking to the Facts: Confident Decoding for Faithful Data-to-Text Generation

no code implementations19 Oct 2019 Ran Tian, Shashi Narayan, Thibault Sellam, Ankur P. Parikh

We address the issue of hallucination in data-to-text generation, i. e., reducing the generation of text that is unsupported by the source.

Data-to-Text Generation Decoder +1

Game-theoretic Modeling of Traffic in Unsignalized Intersection Network for Autonomous Vehicle Control Verification and Validation

no code implementations16 Oct 2019 Ran Tian, Nan Li, Ilya Kolmanovsky, Yildiray Yildiz, Anouck Girard

For a foreseeable future, autonomous vehicles (AVs) will operate in traffic together with human-driven vehicles.

Robotics Systems and Control Systems and Control

Beating humans in a penny-matching game by leveraging cognitive hierarchy theory and Bayesian learning

no code implementations27 Sep 2019 Ran Tian, Nan Li, Ilya Kolmanovsky, Anouck Girard

It is a long-standing goal of artificial intelligence (AI) to be superior to human beings in decision making.

Decision Making

Adaptive Game-Theoretic Decision Making for Autonomous Vehicle Control at Roundabouts

no code implementations1 Oct 2018 Ran Tian, Sisi Li, Nan Li, Ilya Kolmanovsky, Anouck Girard, Yildiray Yildiz

In this paper, we propose a decision making algorithm for autonomous vehicle control at a roundabout intersection.

Decision Making

Semi-supervised Learning with Multi-Domain Sentiment Word Embeddings

no code implementations27 Sep 2018 Ran Tian, Yash Agrawal, Kento Watanabe, Hiroya Takamura

Word embeddings are known to boost performance of many NLP tasks such as text classification, meanwhile they can be enhanced by labels at the document level to capture nuanced meaning such as sentiment and topic.

Domain Adaptation text-classification +2

Question-Answering with Logic Specific to Video Games

no code implementations LREC 2016 Corentin Dumont, Ran Tian, Kentaro Inui

We chose a popular game called {`}Minecraft{'}, and created a QA corpus with a knowledge database related to this game and the ontology of a meaning representation that will be used to structure this database.

Clustering Question Answering

The Mechanism of Additive Composition

no code implementations26 Nov 2015 Ran Tian, Naoaki Okazaki, Kentaro Inui

Additive composition (Foltz et al, 1998; Landauer and Dumais, 1997; Mitchell and Lapata, 2010) is a widely used method for computing meanings of phrases, which takes the average of vector representations of the constituent words.

Cannot find the paper you are looking for? You can Submit a new open access paper.