1 code implementation • 1 May 2024 • Jiahui Li, Tianle Shen, Zekai Gu, Jiawei Sun, Chengran Yuan, Yuhang Han, Shuo Sun, Marcelo H. Ang Jr
However, the significant time consumption and sensitivity to noise have limited the real-time predictive capability of diffusion models.
no code implementations • 26 Sep 2023 • Shuo Sun, Zekai Gu, Tianchen Sun, Jiawei Sun, Chengran Yuan, Yuhang Han, Dongen Li, Marcelo H. Ang Jr
Realistic and diverse traffic scenarios in large quantities are crucial for the development and validation of autonomous driving systems.
1 code implementation • 5 Sep 2023 • Lei Zhou, Zhiyang Liu, Runze Gan, Haozhe Wang, Marcelo H. Ang Jr
In the second stage, a novel registration network is designed to extract pose-sensitive features and predict the representation of object partial point cloud in canonical space based on the deformation results from the first stage.
1 code implementation • 10 Aug 2023 • Ziyuan Huang, Shiwei Zhang, Liang Pan, Zhiwu Qing, Yingya Zhang, Ziwei Liu, Marcelo H. Ang Jr
Spatial convolutions are extensively used in numerous deep video models.
Ranked #4 on Action Recognition on EPIC-KITCHENS-100 (using extra training data)
1 code implementation • 14 Jul 2023 • Zhili Ng, Haozhe Wang, Zhengshen Zhang, Francis Tay Eng Hock, Marcelo H. Ang Jr
In this work, we present SynTable, a unified and flexible Python-based dataset generator built using NVIDIA's Isaac Sim Replicator Composer for generating high-quality synthetic datasets for unseen object amodal instance segmentation of cluttered tabletop scenes.
no code implementations • 7 Oct 2022 • Jay Karhade, Haiyue Zhu, Ka-Shing Chung, Rajesh Tripathy, Wei Lin, Marcelo H. Ang Jr
The proposed approach aims to improve the rendering realness by minimizing the spectrum discrepancy between real and synthesized images, especially on the high-frequency localized sharpness information which causes image blur visually.
no code implementations • 25 Jun 2022 • Yechao Bai, Xiaogang Wang, Marcelo H. Ang Jr, Daniela Rus
The learning and aggregation of multi-scale features are essential in empowering neural networks to capture the fine-grained geometric details in the point cloud upsampling task.
no code implementations • 12 May 2022 • YiWen Chen, Sheng Guo, Zedong Zhang, Lei Zhou, Xian Yao Ng, Marcelo H. Ang Jr
Previous methods achieved good performance on such manipulation tasks.
no code implementations • 16 Nov 2021 • Arijit Dasgupta, Jiafei Duan, Marcelo H. Ang Jr, Yi Lin, Su-hua Wang, Renée Baillargeon, Cheston Tan
Recent work in computer vision and cognitive reasoning has given rise to an increasing adoption of the Violation-of-Expectation (VoE) paradigm in synthetic datasets.
2 code implementations • ICLR 2022 • Ziyuan Huang, Shiwei Zhang, Liang Pan, Zhiwu Qing, Mingqian Tang, Ziwei Liu, Marcelo H. Ang Jr
This work presents Temporally-Adaptive Convolutions (TAdaConv) for video understanding, which shows that adaptive weight calibration along the temporal dimension is an efficient way to facilitate modelling complex temporal dynamics in videos.
Ranked #67 on Action Recognition on Something-Something V2 (using extra training data)
1 code implementation • 12 Oct 2021 • Arijit Dasgupta, Jiafei Duan, Marcelo H. Ang Jr, Cheston Tan
Recent work in cognitive reasoning and computer vision has engendered an increasing popularity for the Violation-of-Expectation (VoE) paradigm in synthetic datasets.
1 code implementation • 24 Aug 2021 • Zhiwu Qing, Ziyuan Huang, Shiwei Zhang, Mingqian Tang, Changxin Gao, Marcelo H. Ang Jr, Rong Jin, Nong Sang
The visualizations show that ParamCrop adaptively controls the center distance and the IoU between two augmented views, and the learned change in the disparity along the training process is beneficial to learning a strong representation.
1 code implementation • 13 Jun 2021 • Zhiwu Qing, Ziyuan Huang, Xiang Wang, Yutong Feng, Shiwei Zhang, Jianwen Jiang, Mingqian Tang, Changxin Gao, Marcelo H. Ang Jr, Nong Sang
This technical report analyzes an egocentric video action detection method we used in the 2021 EPIC-KITCHENS-100 competition hosted in CVPR2021 Workshop.
1 code implementation • 9 Jun 2021 • Ziyuan Huang, Zhiwu Qing, Xiang Wang, Yutong Feng, Shiwei Zhang, Jianwen Jiang, Zhurong Xia, Mingqian Tang, Nong Sang, Marcelo H. Ang Jr
In this paper, we present empirical results for training a stronger video vision transformer on the EPIC-KITCHENS-100 Action Recognition dataset.
no code implementations • 3 Jun 2021 • Yechao Bai, Ziyuan Huang, Lyuyu Shen, Hongliang Guo, Marcelo H. Ang Jr, Daniela Rus
Experiment results on two challenging datasets Cityscapes and COCO demonstrate that the RSP head performs competitively on both semantic segmentation and panoptic segmentation with high efficiency.
1 code implementation • 2 Aug 2020 • Xiaogang Wang, Marcelo H. Ang Jr, Gim Hee Lee
Then we learn a mapping to transfer the point features from partial points to that of the complete points by optimizing feature alignment losses.
2 code implementations • ECCV 2020 • Meng Tian, Marcelo H. Ang Jr, Gim Hee Lee
We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image.
1 code implementation • 12 Apr 2020 • Feng Xue, Guirong Zhuo, Ziyuan Huang, Wufei Fu, Zhuoyue Wu, Marcelo H. Ang Jr
Our contributions are twofold: a) a novel dense connected prediction (DCP) layer is proposed to provide better object-level depth estimation and b) specifically for autonomous driving scenarios, dense geometrical constrains (DGC) is introduced so that precise scale factor can be recovered without additional cost for autonomous vehicles.
Ranked #59 on Monocular Depth Estimation on KITTI Eigen split
1 code implementation • CVPR 2020 • Xiaogang Wang, Marcelo H. Ang Jr, Gim Hee Lee
Point clouds are often sparse and incomplete.
1 code implementation • 29 Feb 2020 • Meng Tian, Liang Pan, Marcelo H. Ang Jr, Gim Hee Lee
Accurate 6D object pose estimation is fundamental to robotic manipulation and grasping.