Search Results for author: Jiquan Ngiam

Found 22 papers, 6 papers with code

DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection

1 code implementation • CVPR 2022 • Yingwei Li, Adams Wei Yu, Tianjian Meng, Ben Caine, Jiquan Ngiam, Daiyi Peng, Junyang Shen, Bo Wu, Yifeng Lu, Denny Zhou, Quoc V. Le, Alan Yuille, Mingxing Tan

In this paper, we propose two novel techniques: InverseAug that inverses geometric-related augmentations, e. g., rotation, to enable accurate geometric alignment between lidar points and image pixels, and LearnableAlign that leverages cross-attention to dynamically capture the correlations between image and lidar features during fusion.

3D Object Detection Autonomous Driving +2

2,781

Paper
Code

Scene Transformer: A unified architecture for predicting future trajectories of multiple agents

no code implementations • ICLR 2022 • Jiquan Ngiam, Vijay Vasudevan, Benjamin Caine, Zhengdong Zhang, Hao-Tien Lewis Chiang, Jeffrey Ling, Rebecca Roelofs, Alex Bewley, Chenxi Liu, Ashish Venugopal, David J Weiss, Ben Sapp, Zhifeng Chen, Jonathon Shlens

In this work, we formulate a model for predicting the behavior of all agents jointly, producing consistent futures that account for interactions between agents.

Autonomous Driving Language Modelling +1

Paper
Add Code

To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

no code implementations • CVPR 2021 • Yuning Chai, Pei Sun, Jiquan Ngiam, Weiyue Wang, Benjamin Caine, Vijay Vasudevan, Xiao Zhang, Dragomir Anguelov

3D object detection is vital for many robotics applications.

3D Object Detection object-detection +1

Paper
Add Code

Scene Transformer: A unified architecture for predicting multiple agent trajectories

3 code implementations • 15 Jun 2021 • Jiquan Ngiam, Benjamin Caine, Vijay Vasudevan, Zhengdong Zhang, Hao-Tien Lewis Chiang, Jeffrey Ling, Rebecca Roelofs, Alex Bewley, Chenxi Liu, Ashish Venugopal, David Weiss, Ben Sapp, Zhifeng Chen, Jonathon Shlens

In this work, we formulate a model for predicting the behavior of all agents jointly, producing consistent futures that account for interactions between agents.

Autonomous Driving Language Modelling +1

119

Paper
Code

Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset

no code implementations • 20 Apr 2021 • Scott Ettinger, Shuyang Cheng, Benjamin Caine, Chenxi Liu, Hang Zhao, Sabeek Pradhan, Yuning Chai, Ben Sapp, Charles Qi, Yin Zhou, Zoey Yang, Aurelien Chouard, Pei Sun, Jiquan Ngiam, Vijay Vasudevan, Alexander McCauley, Jonathon Shlens, Dragomir Anguelov

Furthermore, we introduce a new set of metrics that provides a comprehensive evaluation of both single agent and joint agent interaction motion forecasting models.

Motion Forecasting Motion Planning

Paper
Add Code

3D-MAN: 3D Multi-frame Attention Network for Object Detection

no code implementations • CVPR 2021 • Zetong Yang, Yin Zhou, Zhifeng Chen, Jiquan Ngiam

In this paper, we present 3D-MAN: a 3D multi-frame attention network that effectively aggregates features from multiple perspectives and achieves state-of-the-art performance on Waymo Open Dataset.

3D Object Detection Autonomous Driving +1

Paper
Add Code

Pseudo-labeling for Scalable 3D Object Detection

no code implementations • 2 Mar 2021 • Benjamin Caine, Rebecca Roelofs, Vijay Vasudevan, Jiquan Ngiam, Yuning Chai, Zhifeng Chen, Jonathon Shlens

To safely deploy autonomous vehicles, onboard perception systems must work reliably at high accuracy across a diverse set of environments and geographies.

3D Object Detection Autonomous Vehicles +5

Paper
Add Code

Large Scale Interactive Motion Forecasting for Autonomous Driving: The Waymo Open Motion Dataset

no code implementations • ICCV 2021 • Scott Ettinger, Shuyang Cheng, Benjamin Caine, Chenxi Liu, Hang Zhao, Sabeek Pradhan, Yuning Chai, Ben Sapp, Charles R. Qi, Yin Zhou, Zoey Yang, Aurelien Chouard, Pei Sun, Jiquan Ngiam, Vijay Vasudevan, Alexander McCauley, Jonathon Shlens, Dragomir Anguelov

Furthermore, we introduce a new set of metrics that provides a comprehensive evaluation of both single agent and joint agent interaction motion forecasting models.

Motion Forecasting Motion Planning

Paper
Add Code

Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout

1 code implementation • NeurIPS 2020 • Zhao Chen, Jiquan Ngiam, Yanping Huang, Thang Luong, Henrik Kretzschmar, Yuning Chai, Dragomir Anguelov

The vast majority of deep models use multiple gradient signals, typically corresponding to a sum of multiple loss terms, to update a shared set of trainable weights.

Transfer Learning

2,781

Paper
Code

Streaming Object Detection for 3-D Point Clouds

no code implementations • ECCV 2020 • Wei Han, Zhengdong Zhang, Benjamin Caine, Brandon Yang, Christoph Sprunk, Ouais Alsharif, Jiquan Ngiam, Vijay Vasudevan, Jonathon Shlens, Zhifeng Chen

This built-in data capture latency is artificial, and based on treating the point cloud as a camera image in order to leverage camera-inspired architectures.

Action Recognition Autonomous Vehicles +4

Paper
Add Code

Improving 3D Object Detection through Progressive Population Based Augmentation

no code implementations • ECCV 2020 • Shuyang Cheng, Zhaoqi Leng, Ekin Dogus Cubuk, Barret Zoph, Chunyan Bai, Jiquan Ngiam, Yang song, Benjamin Caine, Vijay Vasudevan, Cong-Cong Li, Quoc V. Le, Jonathon Shlens, Dragomir Anguelov

Data augmentation has been widely adopted for object detection in 3D point clouds.

3D Object Detection Data Augmentation +2

Paper
Add Code

Scalability in Perception for Autonomous Driving: Waymo Open Dataset

8 code implementations • CVPR 2020 • Pei Sun, Henrik Kretzschmar, Xerxes Dotiwalla, Aurelien Chouard, Vijaysai Patnaik, Paul Tsui, James Guo, Yin Zhou, Yuning Chai, Benjamin Caine, Vijay Vasudevan, Wei Han, Jiquan Ngiam, Hang Zhao, Aleksei Timofeev, Scott Ettinger, Maxim Krivokon, Amy Gao, Aditya Joshi, Sheng Zhao, Shuyang Cheng, Yu Zhang, Jonathon Shlens, Zhifeng Chen, Dragomir Anguelov

In an effort to help align the research community's contributions with real-world self-driving problems, we introduce a new large scale, high quality, diverse dataset.

Autonomous Driving

4,774

Paper
Code

End-to-End Multi-View Fusion for 3D Object Detection in LiDAR Point Clouds

no code implementations • 15 Oct 2019 • Yin Zhou, Pei Sun, Yu Zhang, Dragomir Anguelov, Jiyang Gao, Tom Ouyang, James Guo, Jiquan Ngiam, Vijay Vasudevan

In this paper, we aim to synergize the birds-eye view and the perspective view and propose a novel end-to-end multi-view fusion (MVF) algorithm, which can effectively learn to utilize the complementary information from both.

3D Object Detection object-detection

Paper
Add Code

StarNet: Targeted Computation for Object Detection in Point Clouds

no code implementations • 29 Aug 2019 • Jiquan Ngiam, Benjamin Caine, Wei Han, Brandon Yang, Yuning Chai, Pei Sun, Yin Zhou, Xi Yi, Ouais Alsharif, Patrick Nguyen, Zhifeng Chen, Jonathon Shlens, Vijay Vasudevan

We show how our redesign---namely using only local information and using sampling instead of learned proposals---leads to a significantly more flexible and adaptable system: we demonstrate how we can vary the computational cost of a single trained StarNet without retraining, and how we can target proposals towards areas of interest with priors and heuristics.

3D Object Detection Object +3

Paper
Add Code

Learning a Multi-Domain Curriculum for Neural Machine Translation

no code implementations • ACL 2020 • Wei Wang, Ye Tian, Jiquan Ngiam, Yinfei Yang, Isaac Caswell, Zarana Parekh

Most data selection research in machine translation focuses on improving a single domain.

Denoising Machine Translation +1

Paper
Add Code

Using Videos to Evaluate Image Model Robustness

no code implementations • 22 Apr 2019 • Keren Gu, Brandon Yang, Jiquan Ngiam, Quoc Le, Jonathon Shlens

Compared to previous studies on adversarial examples and synthetic distortions, natural robustness captures a more diverse set of common image transformations that occur in the natural environment.

Paper
Add Code

CondConv: Conditionally Parameterized Convolutions for Efficient Inference

9 code implementations • NeurIPS 2019 • Brandon Yang, Gabriel Bender, Quoc V. Le, Jiquan Ngiam

We demonstrate that scaling networks with CondConv improves the performance and inference cost trade-off of several existing convolutional neural network architectures on both classification and detection tasks.

Ranked #773 on Image Classification on ImageNet

General Classification Image Classification +1

10,797

Paper
Code

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

13 code implementations • NeurIPS 2019 • Yanping Huang, Youlong Cheng, Ankur Bapna, Orhan Firat, Mia Xu Chen, Dehao Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V. Le, Yonghui Wu, Zhifeng Chen

Scaling up deep neural network capacity has been known as an effective approach to improving model quality for several different machine learning tasks.

Ranked #4 on Fine-Grained Image Classification on Birdsnap (using extra training data)

Fine-Grained Image Classification Machine Translation +1

2,781

Paper
Code

Domain Adaptive Transfer Learning with Specialist Models

no code implementations • 16 Nov 2018 • Jiquan Ngiam, Daiyi Peng, Vijay Vasudevan, Simon Kornblith, Quoc V. Le, Ruoming Pang

Our method to compute importance weights follow from ideas in domain adaptation, and we show a novel application to transfer learning.

Ranked #3 on Fine-Grained Image Classification on Stanford Cars (using extra training data)

Domain Adaptation Fine-Grained Image Classification +2

Paper
Add Code

Sparse Filtering

no code implementations • NeurIPS 2011 • Jiquan Ngiam, Zhenghao Chen, Sonia A. Bhaskar, Pang W. Koh, Andrew Y. Ng

Unsupervised feature learning has been shown to be effective at learning representations that perform well on image, video and audio classification.

Audio Classification General Classification

Paper
Add Code

ICA with Reconstruction Cost for Efficient Overcomplete Feature Learning

no code implementations • NeurIPS 2011 • Quoc V. Le, Alexandre Karpenko, Jiquan Ngiam, Andrew Y. Ng

We show that the soft reconstruction cost can also be used to prevent replicated features in tiled convolutional neural networks.

Ranked #118 on Image Classification on STL-10

Image Classification Object Recognition

Paper
Add Code

Tiled convolutional neural networks

no code implementations • NeurIPS 2010 • Jiquan Ngiam, Zhenghao Chen, Daniel Chia, Pang W. Koh, Quoc V. Le, Andrew Y. Ng

Using convolutional (tied) weights signiﬁcantly reduces the number of parameters that have to be learned, and also allows translational invariance to be hard-coded into the architecture.

Object Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.