Search Results for author: Eugene Ie

Found 24 papers, 11 papers with code

Multi-Level Gazetteer-Free Geocoding

no code implementations • ACL (splurobonlp) 2021 • Sayali Kulkarni, Shailee Jain, Mohammad Javad Hosseini, Jason Baldridge, Eugene Ie, Li Zhang

We present a multi-level geocoding model (MLG) that learns to associate texts to geographic coordinates.

Toponym Resolution

Paper
Add Code

Retouchdown: Releasing Touchdown on StreetLearn as a Public Resource for Language Grounding Tasks in Street View

no code implementations • EMNLP (SpLU) 2020 • Harsh Mehta, Yoav Artzi, Jason Baldridge, Eugene Ie, Piotr Mirowski

These have been added to the StreetLearn dataset and can be obtained via the same process as used previously for StreetLearn.

Vision and Language Navigation

Paper
Add Code

Pedestrian Crossing Action Recognition and Trajectory Prediction with 3D Human Keypoints

no code implementations • 1 Jun 2023 • Jiachen Li, Xinwei Shi, Feiyu Chen, Jonathan Stroud, Zhishuai Zhang, Tian Lan, Junhua Mao, Jeonhyung Kang, Khaled S. Refaat, Weilong Yang, Eugene Ie, CongCong Li

Accurate understanding and prediction of human behaviors are critical prerequisites for autonomous vehicles, especially in highly dynamic and interactive scenarios such as intersections in dense urban areas.

Action Recognition Autonomous Vehicles +3

Paper
Add Code

RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems

1 code implementation • 14 Mar 2021 • Martin Mladenov, Chih-Wei Hsu, Vihan Jain, Eugene Ie, Christopher Colby, Nicolas Mayoraz, Hubert Pham, Dustin Tran, Ivan Vendrov, Craig Boutilier

The development of recommender systems that optimize multi-turn interaction with users, and model the interactions of different agents (e. g., users, content providers, vendors) in the recommender ecosystem have drawn increasing attention in recent years.

counterfactual Probabilistic Programming +1

115

Paper
Code

On the Evaluation of Vision-and-Language Navigation Instructions

no code implementations • EACL 2021 • Ming Zhao, Peter Anderson, Vihan Jain, Su Wang, Alexander Ku, Jason Baldridge, Eugene Ie

Vision-and-Language Navigation wayfinding agents can be enhanced by exploiting automatically generated navigation instructions.

Vision and Language Navigation

Paper
Add Code

A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus

no code implementations • 18 Nov 2020 • BoWen Zhang, Hexiang Hu, Joonseok Lee, Ming Zhao, Sheide Chammas, Vihan Jain, Eugene Ie, Fei Sha

Identifying a short segment in a long video that semantically matches a text query is a challenging task that has important application potentials in language-based video search, browsing, and navigation.

Language Modelling Masked Language Modeling +3

Paper
Add Code

AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization

1 code implementation • 23 Oct 2020 • Sayali Kulkarni, Sheide Chammas, Wan Zhu, Fei Sha, Eugene Ie

Summarization is the task of compressing source document(s) into coherent and succinct passages.

Document Summarization Multi-Document Summarization +1

Paper
Code

Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding

3 code implementations • EMNLP 2020 • Alexander Ku, Peter Anderson, Roma Patel, Eugene Ie, Jason Baldridge

We introduce Room-Across-Room (RxR), a new Vision-and-Language Navigation (VLN) dataset.

Ranked #5 on Vision and Language Navigation on RxR

Vision and Language Navigation

216

Paper
Code

Learning to Represent Image and Text with Denotation Graph

no code implementations • EMNLP 2020 • BoWen Zhang, Hexiang Hu, Vihan Jain, Eugene Ie, Fei Sha

Recent progresses have leveraged the ideas of pre-training (from language modeling) and attention layers in Transformers to learn representation from datasets containing images aligned with linguistic expressions that describe the images.

Attribute Image Retrieval +4

Paper
Add Code

Spatial Language Representation with Multi-Level Geocoding

1 code implementation • 21 Aug 2020 • Sayali Kulkarni, Shailee Jain, Mohammad Javad Hosseini, Jason Baldridge, Eugene Ie, Li Zhang

We present a multi-level geocoding model (MLG) that learns to associate texts to geographic locations.

Toponym Resolution

Paper
Code

Mean-Field Approximation to Gaussian-Softmax Integral with Application to Uncertainty Estimation

no code implementations • 13 Jun 2020 • Zhiyun Lu, Eugene Ie, Fei Sha

Many methods have been proposed to quantify the predictive uncertainty associated with the outputs of deep neural networks.

Out-of-Distribution Detection

Paper
Add Code

BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps

1 code implementation • ACL 2020 • Wang Zhu, Hexiang Hu, Jiacheng Chen, Zhiwei Deng, Vihan Jain, Eugene Ie, Fei Sha

To this end, we propose BabyWalk, a new VLN agent that is learned to navigate by decomposing long instructions into shorter ones (BabySteps) and completing them sequentially.

Ranked #7 on Visual Navigation on Cooperative Vision-and-Dialogue Navigation

Imitation Learning Navigate +1

Paper
Code

Environment-agnostic Multitask Learning for Natural Language Grounded Navigation

1 code implementation • ECCV 2020 • Xin Eric Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi

Recent research efforts enable study for natural language grounded navigation in photo-realistic environments, e. g., following natural language instructions or dialog.

Ranked #8 on Visual Navigation on Cooperative Vision-and-Dialogue Navigation

Vision-Language Navigation

Paper
Code

Retouchdown: Adding Touchdown to StreetLearn as a Shareable Resource for Language Grounding Tasks in Street View

4 code implementations • 10 Jan 2020 • Harsh Mehta, Yoav Artzi, Jason Baldridge, Eugene Ie, Piotr Mirowski

These have been added to the StreetLearn dataset and can be obtained via the same process as used previously for StreetLearn.

Ranked #7 on Vision and Language Navigation on Touchdown Dataset

Vision and Language Navigation

Paper
Code

VALAN: Vision and Language Agent Navigation

1 code implementation • 6 Dec 2019 • Larry Lansing, Vihan Jain, Harsh Mehta, Haoshuo Huang, Eugene Ie

VALAN is a lightweight and scalable software framework for deep reinforcement learning based on the SEED RL architecture.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Generalized Natural Language Grounded Navigation via Environment-agnostic Multitask Learning

no code implementations • 25 Sep 2019 • Xin Wang, Vihan Jain, Eugene Ie, William Wang, Zornitsa Kozareva, Sujith Ravi

Recent research efforts enable study for natural language grounded navigation in photo-realistic environments, e. g., following natural language instructions or dialog.

Vision-Language Navigation

Paper
Add Code

Learning Dense Representations for Entity Retrieval

no code implementations • CONLL 2019 • Daniel Gillick, Sayali Kulkarni, Larry Lansing, Alessandro Presta, Jason Baldridge, Eugene Ie, Diego Garcia-Olano

We show that it is feasible to perform entity linking by training a dual encoder (two-tower) model that encodes mentions and entities in the same dense vector space, where candidate entities are retrieved by approximate nearest neighbor search.

Entity Linking Entity Retrieval +1

Paper
Add Code

RecSim: A Configurable Simulation Platform for Recommender Systems

1 code implementation • 11 Sep 2019 • Eugene Ie, Chih-Wei Hsu, Martin Mladenov, Vihan Jain, Sanmit Narvekar, Jing Wang, Rui Wu, Craig Boutilier

We propose RecSim, a configurable platform for authoring simulation environments for recommender systems (RSs) that naturally supports sequential interaction with users.

Recommendation Systems reinforcement-learning +1

725

Paper
Code

Transferable Representation Learning in Vision-and-Language Navigation

no code implementations • ICCV 2019 • Haoshuo Huang, Vihan Jain, Harsh Mehta, Alexander Ku, Gabriel Magalhaes, Jason Baldridge, Eugene Ie

Vision-and-Language Navigation (VLN) tasks such as Room-to-Room (R2R) require machine agents to interpret natural language instructions and learn to act in visually realistic environments to achieve navigation goals.

Ranked #115 on Vision and Language Navigation on VLN Challenge

Representation Learning Vision and Language Navigation

Paper
Add Code

General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping

1 code implementation • 11 Jul 2019 • Gabriel Ilharco, Vihan Jain, Alexander Ku, Eugene Ie, Jason Baldridge

We address fundamental flaws in previously used metrics and show how Dynamic Time Warping (DTW), a long known method of measuring similarity between two time series, can be used for evaluation of navigation agents.

Dynamic Time Warping Navigate +2

104

Paper
Code

Multi-modal Discriminative Model for Vision-and-Language Navigation

no code implementations • WS 2019 • Haoshuo Huang, Vihan Jain, Harsh Mehta, Jason Baldridge, Eugene Ie

Vision-and-Language Navigation (VLN) is a natural language grounding task where agents have to interpret natural language instructions in the context of visual scenes in a dynamic environment to achieve prescribed navigation goals.

Vision and Language Navigation

Paper
Add Code

Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation

no code implementations • ACL 2019 • Vihan Jain, Gabriel Magalhaes, Alexander Ku, Ashish Vaswani, Eugene Ie, Jason Baldridge

We also show that the existing paths in the dataset are not ideal for evaluating instruction following because they are direct-to-goal shortest paths.

Instruction Following Vision and Language Navigation

Paper
Add Code

Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology

3 code implementations • 29 May 2019 • Eugene Ie, Vihan Jain, Jing Wang, Sanmit Narvekar, Ritesh Agarwal, Rui Wu, Heng-Tze Cheng, Morgane Lustman, Vince Gatto, Paul Covington, Jim McFadden, Tushar Chandra, Craig Boutilier

(i) We develop SLATEQ, a decomposition of value-based temporal-difference and Q-learning that renders RL tractable with slates.

Q-Learning Recommendation Systems +2

31,490

Paper
Code

Using Web Co-occurrence Statistics for Improving Image Categorization

no code implementations • 19 Dec 2013 • Samy Bengio, Jeff Dean, Dumitru Erhan, Eugene Ie, Quoc Le, Andrew Rabinovich, Jonathon Shlens, Yoram Singer

Albeit the simplicity of the resulting optimization problem, it is effective in improving both recognition and localization accuracy.

Common Sense Reasoning Image Categorization +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.