no code implementations • 25 Jan 2024 • Quang-Trung Truong, Duc Thanh Nguyen, Binh-Son Hua, Sai-Kit Yeung
This is enabled by deformable attention mechanism, where the keys and values capturing the memory of a video sequence in the attention module have flexible locations updated across frames.
Ranked #8 on Unsupervised Video Object Segmentation on DAVIS 2016 val (using extra training data)
no code implementations • 29 Dec 2023 • Tuan-Anh Vu, Duc Thanh Nguyen, Qing Guo, Binh-Son Hua, Nhat Minh Chung, Ivor W. Tsang, Sai-Kit Yeung
Such cross-domain representations are desirable in segmenting camouflaged objects where visual cues are subtle to distinguish the objects from the background, especially in segmenting novel objects which are not seen in training.
no code implementations • 17 Oct 2023 • Miao Chang, Tan Vuong, Manas Palaparthi, Lachlan Howell, Alessio Bonti, Mohamed Abdelrazek, Duc Thanh Nguyen
Specifically, we collect a realistic dataset of drone-derived wildlife thermal detections.
no code implementations • 30 Sep 2023 • Huyen Tran, Duc Thanh Nguyen, John Yearwood
Medical image analysis using computer-based algorithms has attracted considerable attention from the research community and achieved tremendous progress in the last decade.
1 code implementation • 20 Sep 2023 • Ka Chun Shum, Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
Specifically, to insert a new foreground object represented by a set of multi-view images into a background radiance field, we use a text-to-image diffusion model to learn and generate combined images that fuse the object of interest into the given background across views.
no code implementations • ICCV 2023 • Ka Chun Shum, Hong-Wing Pang, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
We use this object layout to condition a generative adversarial network to synthesize images of an input scene.
1 code implementation • 6 Dec 2022 • Toan Nguyen, Kien Do, Duc Thanh Nguyen, Bao Duong, Thin Nguyen
A well-known existing causal inference method like back-door adjustment cannot be applied to remove spurious correlations as it requires the observation of confounders.
no code implementations • 16 Nov 2022 • Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
In this paper, we propose a new method for mapping a 3D point cloud to the latent space of a 3D generative adversarial network.
1 code implementation • 30 Mar 2022 • Tuan-Anh Vu, Duc Thanh Nguyen, Binh-Son Hua, Quang-Hieu Pham, Sai-Kit Yeung
The key insight is simultaneously performing both tasks via learning of spatial and temporal features from a sequence of point clouds can leverage individual tasks, leading to improved overall performance.
Ranked #1 on 3D Human Reconstruction on Dynamic FAUST
1 code implementation • 4 Aug 2021 • Hong-Wing Pang, Yingshu Chen, Phuoc-Hieu Le, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
In this paper, we introduce a new problem of domain-specific indoor scene image synthesis, namely neural scene decoration.
no code implementations • ICCV 2021 • Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
With recent developments of convolutional neural networks, deep learning for 3D point clouds has shown significant progress in various 3D scene understanding tasks, e. g., object recognition, semantic segmentation.
no code implementations • 23 Jun 2020 • Dung Nguyen, Sridha Sridharan, Duc Thanh Nguyen, Simon Denman, David Dean, Clinton Fookes
To mitigate this challenge, transfer learning performing fine-tuning on pre-trained models has been applied.
Facial Expression Recognition Facial Expression Recognition (FER) +2
no code implementations • 28 Apr 2020 • Dung Nguyen, Duc Thanh Nguyen, Rui Zeng, Thanh Thi Nguyen, Son N. Tran, Thin Nguyen, Sridha Sridharan, Clinton Fookes
Multimodal dimensional emotion recognition has drawn a great attention from the affective computing community and numerous schemes have been extensively investigated, making a significant progress in this area.
no code implementations • 24 Mar 2020 • Dung Nguyen, Sridha Sridharan, Duc Thanh Nguyen, Simon Denman, Son N. Tran, Rui Zeng, Clinton Fookes
Deep learning has been applied to achieve significant progress in emotion recognition.
no code implementations • ECCV 2020 • Jing Yu Koh, Duc Thanh Nguyen, Quang-Trung Truong, Sai-Kit Yeung, Alexander Binder
Fully-automatic execution is the ultimate goal for many Computer Vision applications.
1 code implementation • 21 Nov 2019 • Quang-Hieu Pham, Mikaela Angelina Uy, Binh-Son Hua, Duc Thanh Nguyen, Gemma Roig, Sai-Kit Yeung
In this work, we present a novel method to learn a local cross-domain descriptor for 2D image and 3D point cloud matching.
no code implementations • 25 Sep 2019 • Thanh Thi Nguyen, Quoc Viet Hung Nguyen, Dung Tien Nguyen, Duc Thanh Nguyen, Thien Huynh-The, Saeid Nahavandi, Thanh Tam Nguyen, Quoc-Viet Pham, Cuong M. Nguyen
By reviewing the background of deepfakes and state-of-the-art deepfake detection methods, this study provides a comprehensive overview of deepfake techniques and facilitates the development of new and more robust methods to deal with the increasingly challenging deepfakes.
1 code implementation • ICCV 2019 • Mikaela Angelina Uy, Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
From our comprehensive benchmark, we show that our dataset poses great challenges to existing point cloud classification techniques as objects from real-world scans are often cluttered with background and/or are partial due to occlusions.
1 code implementation • CVPR 2019 • Quang-Hieu Pham, Duc Thanh Nguyen, Binh-Son Hua, Gemma Roig, Sai-Kit Yeung
Deep learning techniques have become the to-go models for most vision-related tasks on 2D images.
Ranked #2 on 3D Instance Segmentation on SceneNN
3D Instance Segmentation 3D Semantic Instance Segmentation +3
no code implementations • ECCV 2018 • Tian Feng, Quang-Trung Truong, Duc Thanh Nguyen, Jing Yu Koh, Lap-Fai Yu, Alexander Binder, Sai-Kit Yeung
Urban zoning enables various applications in land use analysis and urban planning.
no code implementations • 1 Apr 2018 • Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung
The widespread adoption of autonomous systems such as drones and assistant robots has created a need for real-time high-quality semantic scene segmentation.
no code implementations • 19 Oct 2016 • Duc Thanh Nguyen, Binh-Son Hua, Lap-Fai Yu, Sai-Kit Yeung
Recent advances of 3D acquisition devices have enabled large-scale acquisition of 3D scene data.
no code implementations • CVPR 2016 • Duc Thanh Nguyen, Binh-Son Hua, Khoi Tran, Quang-Hieu Pham, Sai-Kit Yeung
The proposed method was evaluated on both artificial data and real data obtained from reconstruction of practical scenes.
no code implementations • ICCV 2015 • Duc Thanh Nguyen, Minh-Khoi Tran, Sai-Kit Yeung
The problem of human detection is then formulated as maximum a posteriori (MAP) estimation in the MRF model.
no code implementations • CVPR 2014 • Duc Thanh Nguyen
This paper proposes a novel mean field-based Chamfer template matching method.