Search Results for author: John Yang

Found 15 papers, 5 papers with code

Broadcasting Convolutional Network for Visual Relational Reasoning

no code implementations ECCV 2018 Simyung Chang, John Yang, SeongUk Park, Nojun Kwak

In this paper, we propose the Broadcasting Convolutional Network (BCN) that extracts key object features from the global field of an entire input image and recognizes their relationship with local features.

Relation Relational Reasoning +1

Towards Governing Agent's Efficacy: Action-Conditional $β$-VAE for Deep Transparent Reinforcement Learning

no code implementations11 Nov 2018 John Yang, Gyujeong Lee, Minsung Hyun, Simyung Chang, Nojun Kwak

We tackle the blackbox issue of deep neural networks in the settings of reinforcement learning (RL) where neural agents learn towards maximizing reward gains in an uncontrollable way.

reinforcement-learning Reinforcement Learning (RL) +1

Genetic-Gated Networks for Deep Reinforcement

no code implementations26 Nov 2018 Simyung Chang, John Yang, Jae-Seok Choi, Nojun Kwak

We introduce the Genetic-Gated Networks (G2Ns), simple neural networks that combine a gate vector composed of binary genetic genes in the hidden layer(s) of networks.

reinforcement-learning Reinforcement Learning (RL)

Sym-parameterized Dynamic Inference for Mixed-Domain Image Translation

1 code implementation ICCV 2019 Simyung Chang, SeongUk Park, John Yang, Nojun Kwak

Recent advances in image-to-image translation have led to some ways to generate multiple domain images through a single network.

Image-to-Image Translation Translation

Genetic-Gated Networks for Deep Reinforcement Learning

no code implementations NeurIPS 2018 Simyung Chang, John Yang, Jaeseok Choi, Nojun Kwak

We introduce the Genetic-Gated Networks (G2Ns), simple neural networks that combine a gate vector composed of binary genetic genes in the hidden layer(s) of networks.

reinforcement-learning Reinforcement Learning (RL)

SeqHAND:RGB-Sequence-Based 3D Hand Pose and Shape Estimation

no code implementations10 Jul 2020 John Yang, Hyung Jin Chang, Seungeui Lee, Nojun Kwak

In this paper, we attempt to not only consider the appearance of a hand but incorporate the temporal movement information of a hand in motion into the learning framework for better 3D hand pose estimation performance, which leads to the necessity of a large scale dataset with sequential RGB hand images.

3D Hand Pose Estimation

Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation

no code implementations11 Nov 2021 John Yang, Yash Bhalgat, Simyung Chang, Fatih Porikli, Nojun Kwak

While hand pose estimation is a critical component of most interactive extended reality and gesture recognition systems, contemporary approaches are not optimized for computational and memory efficiency.

3D Hand Pose Estimation Gesture Recognition

Depth Estimation with Simplified Transformer

no code implementations28 Apr 2022 John Yang, Le An, Anurag Dixit, Jinkyu Koo, Su Inn Park

Transformer and its variants have shown state-of-the-art results in many vision tasks recently, ranging from image classification to dense prediction.

Autonomous Driving Image Classification +1

WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

1 code implementation4 Jul 2022 Shunyu Yao, Howard Chen, John Yang, Karthik Narasimhan

Existing benchmarks for grounding language in interactive environments either lack real-world linguistic elements, or prove difficult to scale up due to substantial human involvement in the collection of data or feedback signals.

Imitation Learning Navigate

Referral Augmentation for Zero-Shot Information Retrieval

1 code implementation24 May 2023 Michael Tang, Shunyu Yao, John Yang, Karthik Narasimhan

We propose Referral-Augmented Retrieval (RAR), a simple technique that concatenates document indices with referrals, i. e. text from other documents that cite or link to the given document, to provide significant performance gains for zero-shot information retrieval.

Information Retrieval Retrieval

Swin-Free: Achieving Better Cross-Window Attention and Efficiency with Size-varying Window

no code implementations23 Jun 2023 Jinkyu Koo, John Yang, Le An, Gwenaelle Cunha Sergio, Su Inn Park

To mitigate this issue, we propose Swin-Free in which we apply size-varying windows across stages, instead of shifting windows, to achieve cross-connection among local windows.

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

2 code implementations NeurIPS 2023 John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao

Our framework is language and platform agnostic, uses self-contained Docker environments to provide safe and reproducible execution, and is compatible out-of-the-box with traditional seq2seq coding methods, while enabling the development of new methods for interactive code generation.

Benchmarking Code Generation +1

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

no code implementations10 Oct 2023 Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

We find real-world software engineering to be a rich, sustainable, and challenging testbed for evaluating the next generation of language models.

Bug fixing Code Generation +1

SeqHAND: RGB-Sequence-Based 3D Hand Pose and Shape Estimation

no code implementations ECCV 2020 John Yang, Hyung Jin Chang, Seungeui Lee, Nojun Kwak

In this paper, we attempt to not only consider the appearance of a hand but incorporate the temporal movement information of a hand in motion into the learning framework for better 3D hand pose estimation performance, which leads to the necessity of a large scale dataset with sequential RGB hand images.

3D Hand Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.