Search Results for author: Yuning Chai

Found 23 papers, 7 papers with code

VLMine: Long-Tail Data Mining with Vision Language Models

no code implementations23 Sep 2024 Mao Ye, Gregory P. Meyer, Zaiwei Zhang, Dennis Park, Siva Karthik Mustikovela, Yuning Chai, Eric M Wolff

We propose a simple and scalable data mining approach that leverages the knowledge contained within a large vision language model (VLM).

3D Object Detection Autonomous Driving +3

ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

3 code implementations CVPR 2024 Mu Cai, Haotian Liu, Dennis Park, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Yong Jae Lee

Furthermore, we present ViP-Bench, a comprehensive benchmark to assess the capability of models in understanding visual prompts across multiple dimensions, enabling future research in this domain.

Visual Commonsense Reasoning Visual Prompting

SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors

no code implementations ICCV 2023 Hongge Chen, Zhao Chen, Gregory P. Meyer, Dennis Park, Carl Vondrick, Ashish Shrivastava, Yuning Chai

We present SHIFT3D, a differentiable pipeline for generating 3D shapes that are structurally plausible yet challenging to 3D object detectors.

Autonomous Driving Object

NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects

1 code implementation24 Aug 2023 Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai

We propose a novel-view augmentation (NOVA) strategy to train NeRFs for photo-realistic 3D composition of dynamic objects in a static scene.

Optical Flow Estimation

Efficient Transformer-based 3D Object Detection with Dynamic Token Halting

no code implementations ICCV 2023 Mao Ye, Gregory P. Meyer, Yuning Chai, Qiang Liu

Although halting a token is a non-differentiable operation, our method allows for differentiable end-to-end learning by leveraging an equivalent differentiable forward-pass.

3D Object Detection Autonomous Vehicles +1

Occupancy Flow Fields for Motion Forecasting in Autonomous Driving

no code implementations8 Mar 2022 Reza Mahjourian, Jinkyu Kim, Yuning Chai, Mingxing Tan, Ben Sapp, Dragomir Anguelov

We propose Occupancy Flow Fields, a new representation for motion forecasting of multiple agents, an important task in autonomous driving.

Motion Estimation Motion Forecasting

Pseudo-labeling for Scalable 3D Object Detection

no code implementations2 Mar 2021 Benjamin Caine, Rebecca Roelofs, Vijay Vasudevan, Jiquan Ngiam, Yuning Chai, Zhifeng Chen, Jonathon Shlens

To safely deploy autonomous vehicles, onboard perception systems must work reliably at high accuracy across a diverse set of environments and geographies.

3D Object Detection Autonomous Vehicles +5

Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout

2 code implementations NeurIPS 2020 Zhao Chen, Jiquan Ngiam, Yanping Huang, Thang Luong, Henrik Kretzschmar, Yuning Chai, Dragomir Anguelov

The vast majority of deep models use multiple gradient signals, typically corresponding to a sum of multiple loss terms, to update a shared set of trainable weights.

Transfer Learning

StarNet: Targeted Computation for Object Detection in Point Clouds

no code implementations29 Aug 2019 Jiquan Ngiam, Benjamin Caine, Wei Han, Brandon Yang, Yuning Chai, Pei Sun, Yin Zhou, Xi Yi, Ouais Alsharif, Patrick Nguyen, Zhifeng Chen, Jonathon Shlens, Vijay Vasudevan

We show how our redesign---namely using only local information and using sampling instead of learned proposals---leads to a significantly more flexible and adaptable system: we demonstrate how we can vary the computational cost of a single trained StarNet without retraining, and how we can target proposals towards areas of interest with priors and heuristics.

3D Object Detection Object +3

FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

3 code implementations CVPR 2019 Paul Voigtlaender, Yuning Chai, Florian Schroff, Hartwig Adam, Bastian Leibe, Liang-Chieh Chen

Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use.

Object Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.