Search Results for author: Jens Tuyls

Found 8 papers, 4 papers with code

Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning

1 code implementation11 Dec 2024 Chongyi Zheng, Jens Tuyls, Joanne Peng, Benjamin Eysenbach

Self-supervised learning has the potential of lifting several of the key challenges in reinforcement learning today, such as exploration, representation learning, and reward design.

Representation Learning Self-Supervised Learning

Language-Guided World Models: A Model-Based Approach to AI Control

no code implementations24 Jan 2024 Alex Zhang, Khanh Nguyen, Jens Tuyls, Albert Lin, Karthik Narasimhan

In this work, we take initial steps in developing robust LWMs that can generalize to compositionally novel language descriptions.

Semantic Parsing

Scaling Laws for Imitation Learning in Single-Agent Games

1 code implementation18 Jul 2023 Jens Tuyls, Dhruv Madeka, Kari Torkkola, Dean Foster, Karthik Narasimhan, Sham Kakade

Inspired by recent work in Natural Language Processing (NLP) where "scaling up" has resulted in increasingly more capable LLMs, we investigate whether carefully scaling up model and data size can bring similar improvements in the imitation learning setting for single-agent games.

Atari Games Imitation Learning +1

Multi-Stage Episodic Control for Strategic Exploration in Text Games

1 code implementation ICLR 2022 Jens Tuyls, Shunyu Yao, Sham Kakade, Karthik Narasimhan

Text adventure games present unique challenges to reinforcement learning methods due to their combinatorially large action spaces and sparse rewards.

Gradient-based Analysis of NLP Models is Manipulable

no code implementations Findings of the Association for Computational Linguistics 2020 Junlin Wang, Jens Tuyls, Eric Wallace, Sameer Singh

Gradient-based analysis methods, such as saliency map visualizations and adversarial input perturbations, have found widespread use in interpreting neural NLP models due to their simplicity, flexibility, and most importantly, their faithfulness.

text-classification Text Classification

Generative Modeling for Atmospheric Convection

no code implementations3 Jul 2020 Griffin Mooers, Jens Tuyls, Stephan Mandt, Michael Pritchard, Tom Beucler

While cloud-resolving models can explicitly simulate the details of small-scale storm formation and morphology, these details are often ignored by climate models for lack of computational resources.

Clustering Dimensionality Reduction +1

AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models

1 code implementation IJCNLP 2019 Eric Wallace, Jens Tuyls, Junlin Wang, Sanjay Subramanian, Matt Gardner, Sameer Singh

Neural NLP models are increasingly accurate but are imperfect and opaque---they break in counterintuitive ways and leave end users puzzled at their behavior.

Language Modeling Language Modelling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.