Search Results for author: Duc Thanh Nguyen

Found 25 papers, 7 papers with code

Self-supervised Video Object Segmentation with Distillation Learning of Deformable Attention

no code implementations25 Jan 2024 Quang-Trung Truong, Duc Thanh Nguyen, Binh-Son Hua, Sai-Kit Yeung

This is enabled by deformable attention mechanism, where the keys and values capturing the memory of a video sequence in the attention module have flexible locations updated across frames.

Ranked #8 on Unsupervised Video Object Segmentation on DAVIS 2016 val (using extra training data)

Knowledge Distillation Object +5

Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation

no code implementations29 Dec 2023 Tuan-Anh Vu, Duc Thanh Nguyen, Qing Guo, Binh-Son Hua, Nhat Minh Chung, Ivor W. Tsang, Sai-Kit Yeung

Such cross-domain representations are desirable in segmenting camouflaged objects where visual cues are subtle to distinguish the objects from the background, especially in segmenting novel objects which are not seen in training.

Instance Segmentation Segmentation +1

MVC: A Multi-Task Vision Transformer Network for COVID-19 Diagnosis from Chest X-ray Images

no code implementations30 Sep 2023 Huyen Tran, Duc Thanh Nguyen, John Yearwood

Medical image analysis using computer-based algorithms has attracted considerable attention from the research community and achieved tremendous progress in the last decade.

COVID-19 Diagnosis Image Classification +1

Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

1 code implementation20 Sep 2023 Ka Chun Shum, Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

Specifically, to insert a new foreground object represented by a set of multi-view images into a background radiance field, we use a text-to-image diffusion model to learn and generate combined images that fuse the object of interest into the given background across views.

3D Reconstruction Object +1

Causal Inference via Style Transfer for Out-of-distribution Generalisation

1 code implementation6 Dec 2022 Toan Nguyen, Kien Do, Duc Thanh Nguyen, Bao Duong, Thin Nguyen

A well-known existing causal inference method like back-door adjustment cannot be applied to remove spurious correlations as it requires the observation of confounders.

Causal Inference Image Classification +2

RFNet-4D++: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds with Cross-Attention Spatio-Temporal Features

1 code implementation30 Mar 2022 Tuan-Anh Vu, Duc Thanh Nguyen, Binh-Son Hua, Quang-Hieu Pham, Sai-Kit Yeung

The key insight is simultaneously performing both tasks via learning of spatial and temporal features from a sequence of point clouds can leverage individual tasks, leading to improved overall performance.

3D Human Reconstruction 3D Reconstruction +3

Neural Scene Decoration from a Single Photograph

1 code implementation4 Aug 2021 Hong-Wing Pang, Yingshu Chen, Phuoc-Hieu Le, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

In this paper, we introduce a new problem of domain-specific indoor scene image synthesis, namely neural scene decoration.

Image Generation Scene Generation

Minimal Adversarial Examples for Deep Learning on 3D Point Clouds

no code implementations ICCV 2021 Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

With recent developments of convolutional neural networks, deep learning for 3D point clouds has shown significant progress in various 3D scene understanding tasks, e. g., object recognition, semantic segmentation.

3D Object Recognition Object Detection +3

Deep Auto-Encoders with Sequential Learning for Multimodal Dimensional Emotion Recognition

no code implementations28 Apr 2020 Dung Nguyen, Duc Thanh Nguyen, Rui Zeng, Thanh Thi Nguyen, Son N. Tran, Thin Nguyen, Sridha Sridharan, Clinton Fookes

Multimodal dimensional emotion recognition has drawn a great attention from the affective computing community and numerous schemes have been extensively investigated, making a significant progress in this area.

Emotion Recognition

LCD: Learned Cross-Domain Descriptors for 2D-3D Matching

1 code implementation21 Nov 2019 Quang-Hieu Pham, Mikaela Angelina Uy, Binh-Son Hua, Duc Thanh Nguyen, Gemma Roig, Sai-Kit Yeung

In this work, we present a novel method to learn a local cross-domain descriptor for 2D image and 3D point cloud matching.

3D Point Cloud Matching Depth Estimation +1

Deep Learning for Deepfakes Creation and Detection: A Survey

no code implementations25 Sep 2019 Thanh Thi Nguyen, Quoc Viet Hung Nguyen, Dung Tien Nguyen, Duc Thanh Nguyen, Thien Huynh-The, Saeid Nahavandi, Thanh Tam Nguyen, Quoc-Viet Pham, Cuong M. Nguyen

By reviewing the background of deepfakes and state-of-the-art deepfake detection methods, this study provides a comprehensive overview of deepfake techniques and facilitates the development of new and more robust methods to deal with the increasingly challenging deepfakes.

DeepFake Detection Face Swapping

Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data

1 code implementation ICCV 2019 Mikaela Angelina Uy, Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

From our comprehensive benchmark, we show that our dataset poses great challenges to existing point cloud classification techniques as objects from real-world scans are often cluttered with background and/or are partial due to occlusions.

3D Object Classification Classification +3

Real-time Progressive 3D Semantic Segmentation for Indoor Scene

no code implementations1 Apr 2018 Quang-Hieu Pham, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

The widespread adoption of autonomous systems such as drones and assistant robots has created a need for real-time high-quality semantic scene segmentation.

3D Semantic Segmentation Clustering +2

A Field Model for Repairing 3D Shapes

no code implementations CVPR 2016 Duc Thanh Nguyen, Binh-Son Hua, Khoi Tran, Quang-Hieu Pham, Sai-Kit Yeung

The proposed method was evaluated on both artificial data and real data obtained from reconstruction of practical scenes.

An MRF-Poselets Model for Detecting Highly Articulated Humans

no code implementations ICCV 2015 Duc Thanh Nguyen, Minh-Khoi Tran, Sai-Kit Yeung

The problem of human detection is then formulated as maximum a posteriori (MAP) estimation in the MRF model.

Human Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.