Search Results for author: Erman Tjiputra

Found 16 papers, 13 papers with code

Controllable Group Choreography using Contrastive Diffusion

no code implementations29 Oct 2023 Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications.

Music-Driven Group Choreography

no code implementations CVPR 2023 Nhat Le, Thang Pham, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

The proposed dataset consists of 16. 7 hours of paired music and 3D motion from in-the-wild videos, covering 7 dance styles and 16 music genres.

Style Transfer for 2D Talking Head Animation

1 code implementation17 Mar 2023 Trong-Thang Pham, Nhat Le, Tuong Do, Hung Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we present a new method to generate talking head animation with learnable style references.

Style Transfer

Uncertainty-aware Label Distribution Learning for Facial Expression Recognition

1 code implementation21 Sep 2022 Nhat Le, Khanh Nguyen, Quang Tran, Erman Tjiputra, Bac Le, Anh Nguyen

In this paper, we propose a new uncertainty-aware label distribution learning method to improve the robustness of deep models against uncertainty and ambiguity.

Facial Expression Recognition Facial Expression Recognition (FER)

Fine-Grained Visual Classification using Self Assessment Classifier

1 code implementation21 May 2022 Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets.

Classification Continual Learning +1

Deep Federated Learning for Autonomous Driving

1 code implementation12 Oct 2021 Anh Nguyen, Tuong Do, Minh Tran, Binh X. Nguyen, Chien Duong, Tu Phan, Erman Tjiputra, Quang D. Tran

We design a new Federated Autonomous Driving network (FADNet) that can improve the model stability, ensure convergence, and handle imbalanced data distribution problems while is being trained with federated learning methods.

Autonomous Driving Federated Learning

Coarse-to-Fine Reasoning for Visual Question Answering

2 code implementations6 Oct 2021 Binh X. Nguyen, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Bridging the semantic gap between image and question is an important step to improve the accuracy of the Visual Question Answering (VQA) task.

Question Answering Visual Question Answering

Light-weight Deformable Registration using Adversarial Learning with Distilling Knowledge

1 code implementation4 Oct 2021 Minh Q. Tran, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We design the student network such as it is light-weight and well suitable for deployment on a typical CPU.

Multiple Meta-model Quantifying for Medical Visual Question Answering

2 code implementations19 May 2021 Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran, Anh Nguyen

However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized.

Medical Visual Question Answering Meta-Learning +3

Graph-based Person Signature for Person Re-Identifications

1 code implementation14 Apr 2021 Binh X. Nguyen, Binh D. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph, namely Graph-based Person Signature, and utilize Graph Convolutional Networks to learn the topological structure of the visual signature of a person.

Attribute Multi-Task Learning +1

Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding

1 code implementation9 Sep 2020 Binh X. Nguyen, Binh D. Nguyen, Gustavo Carneiro, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do

Based on pseudo labels, we propose a novel unsupervised metric loss which enforces the positive concentration and negative separation of samples in the embedding space.

Benchmarking Clustering +2

Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network

no code implementations31 Jul 2020 Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran

In this work, we propose a multimodal fusion approach to address the problem of autonomous navigation in complex environments such as collapsed cites, or natural caves.

Robotics

Overcoming Data Limitation in Medical Visual Question Answering

2 code implementations26 Sep 2019 Binh D. Nguyen, Thanh-Toan Do, Binh X. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran

Traditional approaches for Visual Question Answering (VQA) require large amount of labeled data for training.

Ranked #13 on Medical Visual Question Answering on VQA-RAD (using extra training data)

Denoising Medical Visual Question Answering +3

Cannot find the paper you are looking for? You can Submit a new open access paper.