no code implementations • 14 Feb 2025 • Hsu-kuang Chiu, Ryo Hachiuma, Chien-Yi Wang, Stephen F. Smith, Yu-Chiang Frank Wang, Min-Hung Chen
Inspired by recent progress using Large Language Models (LLMs) to build autonomous driving systems, we propose a novel problem setting that integrates a Multi-Modal LLM into cooperative autonomous driving, with the proposed Vehicle-to-Vehicle Question-Answering (V2V-QA) dataset and benchmark.
2 code implementations • 26 Sep 2023 • Hsu-kuang Chiu, Chien-Yi Wang, Min-Hung Chen, Stephen F. Smith
However, their proposed methods mainly use cooperative detection results as input to a standard single-sensor Kalman Filter-based tracking algorithm.
no code implementations • 20 Jun 2023 • Hsu-kuang Chiu, Stephen F. Smith
We present our approach, Collision Avoidance Detour (CAD), which won the 3rd place award in the 2023 Waymo Open Dataset Challenge - Sim Agents, held at the 2023 CVPR Workshop on Autonomous Driving.
no code implementations • 26 May 2023 • Hsu-kuang Chiu, Stephen F. Smith
The reliability of current autonomous driving systems is often jeopardized in situations when the vehicle's field-of-view is limited by nearby occluding objects.
1 code implementation • 26 Dec 2020 • Hsu-kuang Chiu, Jie Li, Rares Ambrus, Jeannette Bohg
Second, we propose to learn a metric that combines the Mahalanobis and feature distances when comparing a track and a new detection in data association.
3 code implementations • 16 Jan 2020 • Hsu-kuang Chiu, Antonio Prioletti, Jie Li, Jeannette Bohg
Our method estimates the object states by adopting a Kalman Filter.
no code implementations • ICCV 2019 • Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles
Modeling and prediction of human motion dynamics has long been a challenging problem in computer vision, and most existing methods rely on the end-to-end supervised training of various architectures of recurrent neural networks.
Ranked #2 on
Human Pose Forecasting
on Human3.6M
(MAR, walking, 1,000ms metric)
no code implementations • 24 Apr 2019 • Hsu-kuang Chiu, Ehsan Adeli, Juan Carlos Niebles
While prior work attempts to predict future video pixels, anticipate activities or forecast future scene semantic segments from segmentation of the preceding frames, methods that predict future semantic segmentation solely from the previous frame RGB data in a single end-to-end trainable model do not exist.
1 code implementation • 23 Oct 2018 • Hsu-kuang Chiu, Ehsan Adeli, Borui Wang, De-An Huang, Juan Carlos Niebles
In this paper, we propose a new action-agnostic method for short- and long-term human pose forecasting.
Ranked #5 on
Human Pose Forecasting
on Human3.6M
(MAR, walking, 1,000ms metric)