Search Results for author: Erman Tjiputra

Found 16 papers, 13 papers with code

Controllable Group Choreography using Contrastive Diffusion

no code implementations • 29 Oct 2023 • Nhat Le, Tuong Do, Khoa Do, Hien Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Music-driven group choreography poses a considerable challenge but holds significant potential for a wide range of industrial applications.

Paper
Add Code

Music-Driven Group Choreography

no code implementations • CVPR 2023 • Nhat Le, Thang Pham, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

The proposed dataset consists of 16. 7 hours of paired music and 3D motion from in-the-wild videos, covering 7 dance styles and 16 music genres.

Paper
Add Code

Style Transfer for 2D Talking Head Animation

1 code implementation • 17 Mar 2023 • Trong-Thang Pham, Nhat Le, Tuong Do, Hung Nguyen, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we present a new method to generate talking head animation with learnable style references.

Style Transfer

Paper
Code

Uncertainty-aware Label Distribution Learning for Facial Expression Recognition

1 code implementation • 21 Sep 2022 • Nhat Le, Khanh Nguyen, Quang Tran, Erman Tjiputra, Bac Le, Anh Nguyen

In this paper, we propose a new uncertainty-aware label distribution learning method to improve the robustness of deep models against uncertainty and ambiguity.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Code

Reducing Training Time in Cross-Silo Federated Learning using Multigraph Topology

1 code implementation • ICCV 2023 • Tuong Do, Binh X. Nguyen, Vuong Pham, Toan Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we present a new multigraph topology for cross-silo federated learning.

Federated Learning

Paper
Code

Fine-Grained Visual Classification using Self Assessment Classifier

1 code implementation • 21 May 2022 • Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets.

Ranked #4 on Fine-Grained Image Classification on Stanford Dogs

Classification Continual Learning +1

Paper
Code

Deep Federated Learning for Autonomous Driving

1 code implementation • 12 Oct 2021 • Anh Nguyen, Tuong Do, Minh Tran, Binh X. Nguyen, Chien Duong, Tu Phan, Erman Tjiputra, Quang D. Tran

We design a new Federated Autonomous Driving network (FADNet) that can improve the model stability, ensure convergence, and handle imbalanced data distribution problems while is being trained with federated learning methods.

Autonomous Driving Federated Learning

Paper
Code

Coarse-to-Fine Reasoning for Visual Question Answering

2 code implementations • 6 Oct 2021 • Binh X. Nguyen, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Bridging the semantic gap between image and question is an important step to improve the accuracy of the Visual Question Answering (VQA) task.

Ranked #1 on Visual Question Answering (VQA) on GQA test-dev

Question Answering Visual Question Answering

Paper
Code

Light-weight Deformable Registration using Adversarial Learning with Distilling Knowledge

1 code implementation • 4 Oct 2021 • Minh Q. Tran, Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

We design the student network such as it is light-weight and well suitable for deployment on a typical CPU.

Paper
Code

Multiple Meta-model Quantifying for Medical Visual Question Answering

2 code implementations • 19 May 2021 • Tuong Do, Binh X. Nguyen, Erman Tjiputra, Minh Tran, Quang D. Tran, Anh Nguyen

However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized.

Ranked #5 on Medical Visual Question Answering on PathVQA

Medical Visual Question Answering Meta-Learning +3

Paper
Code

Graph-based Person Signature for Person Re-Identifications

1 code implementation • 14 Apr 2021 • Binh X. Nguyen, Binh D. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran, Anh Nguyen

In this paper, we propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph, namely Graph-based Person Signature, and utilize Graph Convolutional Networks to learn the topological structure of the visual signature of a person.

Ranked #48 on Person Re-Identification on DukeMTMC-reID

Attribute Multi-Task Learning +1

Paper
Code

Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering

1 code implementation • 23 Sep 2020 • Tuong Do, Binh X. Nguyen, Huy Tran, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do

Different approaches have been proposed to Visual Question Answering (VQA).

Question Answering Visual Question Answering

Paper
Code

Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding

1 code implementation • 9 Sep 2020 • Binh X. Nguyen, Binh D. Nguyen, Gustavo Carneiro, Erman Tjiputra, Quang D. Tran, Thanh-Toan Do

Based on pseudo labels, we propose a novel unsupervised metric loss which enforces the positive concentration and negative separation of samples in the embedding space.

Benchmarking Clustering +2

Paper
Code

Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network

no code implementations • 31 Jul 2020 • Anh Nguyen, Ngoc Nguyen, Kim Tran, Erman Tjiputra, Quang D. Tran

In this work, we propose a multimodal fusion approach to address the problem of autonomous navigation in complex environments such as collapsed cites, or natural caves.

Robotics

Paper
Add Code

Compact Trilinear Interaction for Visual Question Answering

1 code implementation • ICCV 2019 • Tuong Do, Thanh-Toan Do, Huy Tran, Erman Tjiputra, Quang D. Tran

In Visual Question Answering (VQA), answers have a great correlation with question meaning and visual contents.

Ranked #2 on Visual Question Answering (VQA) on TDIUC

Benchmarking Knowledge Distillation +2

Paper
Code

Overcoming Data Limitation in Medical Visual Question Answering

2 code implementations • 26 Sep 2019 • Binh D. Nguyen, Thanh-Toan Do, Binh X. Nguyen, Tuong Do, Erman Tjiputra, Quang D. Tran

Traditional approaches for Visual Question Answering (VQA) require large amount of labeled data for training.

Ranked #13 on Medical Visual Question Answering on VQA-RAD (using extra training data)

Denoising Medical Visual Question Answering +3

110

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.