Search Results for author: Thanh-Dat Truong

Found 20 papers, 5 papers with code

Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding

no code implementations26 Nov 2023 Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu

In precision agriculture, the detection and recognition of insects play an essential role in the ability of crops to grow healthy and produce a high-quality yield.

Self-Supervised Learning

Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective

no code implementations25 May 2023 Thanh-Dat Truong, Khoa Luu

Then, we propose a new cross-view self-attention loss learned on unpaired cross-view data to enforce the self-attention mechanism learning to transfer knowledge across views.

Action Recognition

CROVIA: Seeing Drone Scenes from Car Perspective via Cross-View Adaptation

no code implementations14 Apr 2023 Thanh-Dat Truong, Chi Nhan Duong, Ashley Dowling, Son Lam Phung, Jackson Cothren, Khoa Luu

First, a novel geometry-based constraint to cross-view adaptation is introduced based on the geometry correlation between views.

Autonomous Driving Scene Segmentation +1

FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding

1 code implementation CVPR 2023 Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson Cothren, Khoa Luu

Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed.

Autonomous Driving Domain Adaptation +4

Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition

no code implementations11 Sep 2022 Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Marios Savvides, Khoa Luu

We therefore introduce a new method named Attention-based Bijective Generative Adversarial Networks in a Distillation framework (DAB-GAN) to synthesize faces of a subject given his/her extracted face recognition features.

Face Recognition Face Reconstruction +2

Self-supervised Domain Adaptation in Crowd Counting

no code implementations7 Jun 2022 Pha Nguyen, Thanh-Dat Truong, Miaoqing Huang, Yi Liang, Ngan Le, Khoa Luu

Self-training crowd counting has not been attentively explored though it is one of the important challenges in computer vision.

Crowd Counting Domain Adaptation

The Right to Talk: An Audio-Visual Transformer Approach

1 code implementation ICCV 2021 Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu

Therefore, this work introduces a new Audio-Visual Transformer approach to the problem of localization and highlighting the main speaker in both audio and visual channels of a multi-speaker conversation video in the wild.

LIAAD: Lightweight Attentive Angular Distillation for Large-scale Age-Invariant Face Recognition

no code implementations9 Apr 2020 Thanh-Dat Truong, Chi Nhan Duong, Kha Gia Quach, Ngan Le, Tien D. Bui, Khoa Luu

This work presents a novel Lightweight Attentive Angular Distillation (LIAAD) approach to Large-scale Lightweight AiFR that overcomes these limitations.

Age-Invariant Face Recognition

Vec2Face: Unveil Human Faces from their Blackbox Features in Face Recognition

no code implementations CVPR 2020 Chi Nhan Duong, Thanh-Dat Truong, Kha Gia Quach, Hung Bui, Kaushik Roy, Khoa Luu

Unveiling face images of a subject given his/her high-level representations extracted from a blackbox Face Recognition engine is extremely challenging.

Benchmarking Face Recognition +2

Image Alignment in Unseen Domains via Domain Deep Generalization

no code implementations28 May 2019 Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

This paper presents a novel deep learning based approach to tackle the problem of across unseen modalities.

Domain Adaptation

Fast Flow Reconstruction via Robust Invertible nxn Convolution

no code implementations24 May 2019 Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

The experiments on CIFAR-10, ImageNet and Celeb-HQ datasets, have shown that our invertible $n \times n$ convolution helps to improve the performance of generative models significantly.

Cannot find the paper you are looking for? You can Submit a new open access paper.