Search Results for author: Minh-Triet Tran

Found 50 papers, 23 papers with code

Improving Referring Image Segmentation using Vision-Aware Text Features

no code implementations12 Apr 2024 Hai Nguyen-Truong, E-Ro Nguyen, Tuan-Anh Vu, Minh-Triet Tran, Binh-Son Hua, Sai-Kit Yeung

Our method involves using CLIP to derive a CLIP Prior that integrates an object-centric visual heatmap with text description, which can be used as the initial query in DETR-based architecture for the segmentation task.

Image Segmentation Segmentation +1

Cluster-based Video Summarization with Temporal Context Awareness

1 code implementation6 Apr 2024 Hai-Dang Huynh-Lam, Ngoc-Phuong Ho-Thi, Minh-Triet Tran, Trung-Nghia Le

In this paper, we present TAC-SUM, a novel and efficient training-free approach for video summarization that addresses the limitations of existing cluster-based models by incorporating temporal context.

Clustering Unsupervised Video Summarization

Enhancing Video Summarization with Context Awareness

1 code implementation6 Apr 2024 Hai-Dang Huynh-Lam, Ngoc-Phuong Ho-Thi, Minh-Triet Tran, Trung-Nghia Le

Despite the importance of video summarization, there is a lack of diverse and representative datasets, hindering comprehensive evaluation and benchmarking of algorithms.

Benchmarking Informativeness +1

iCONTRA: Toward Thematic Collection Design Via Interactive Concept Transfer

1 code implementation13 Mar 2024 Dinh-Khoi Vo, Duy-Nam Ly, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

With a user-friendly interface, iCONTRA enables both experienced designers and novices to effortlessly explore creative design concepts and efficiently generate thematic collections.

TSRNet: Simple Framework for Real-time ECG Anomaly Detection with Multimodal Time and Spectrogram Restoration Network

1 code implementation15 Dec 2023 Nhat-Tan Bui, Dinh-Hieu Hoang, Thinh Phan, Minh-Triet Tran, Brijesh Patel, Donald Adjeroh, Ngan Le

As a result, we introduce a specialized network called the Multimodal Time and Spectrogram Restoration Network (TSRNet) designed specifically for detecting anomalies in ECG signals.

Anomaly Detection Time Series

NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images

1 code implementation12 Dec 2023 Gia-Bao Le, Van-Tien Nguyen, Trung-Nghia Le, Minh-Triet Tran

In addressing the demands of this critical task, self-supervised learning (SSL) methods have emerged as a valuable resource, leveraging their efficiency in circumventing the need for a large number of annotations, which can be both costly and time-consuming to deploy supervised methods.

Contrastive Learning Multi-class Classification +3

SAM3D: Segment Anything Model in Volumetric Medical Images

2 code implementations7 Sep 2023 Nhat-Tan Bui, Dinh-Hieu Hoang, Minh-Triet Tran, Gianfranco Doretto, Donald Adjeroh, Brijesh Patel, Arabinda Choudhary, Ngan Le

Image segmentation remains a pivotal component in medical image analysis, aiding in the extraction of critical information for precise diagnostic practices.

Image Segmentation Segmentation +1

MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation

2 code implementations6 Sep 2023 Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le

MEGANet is designed as an end-to-end framework, encompassing three key modules: an encoder, which is responsible for capturing and abstracting the features from the input image, a decoder, which focuses on salient features, and the Edge-Guided Attention module (EGA) that employs the Laplacian Operator to accentuate polyp boundaries.

Edge Detection Segmentation

Unveiling Camouflage: A Learnable Fourier-based Augmentation for Camouflaged Object Detection and Instance Segmentation

no code implementations29 Aug 2023 Minh-Quan Le, Minh-Triet Tran, Trung-Nghia Le, Tam V. Nguyen, Thanh-Toan Do

Camouflaged object detection (COD) and camouflaged instance segmentation (CIS) aim to recognize and segment objects that are blended into their surroundings, respectively.

Generative Adversarial Network Instance Segmentation +3

DM-VTON: Distilled Mobile Real-time Virtual Try-On

1 code implementation26 Aug 2023 Khoi-Nguyen Nguyen-Ngoc, Thanh-Tung Phan-Nguyen, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

Additionally, we propose Virtual Try-on-guided Pose for Data Synthesis to address the limited pose variation observed in training images.

Human Parsing Knowledge Distillation +1

VIDES: Virtual Interior Design via Natural Language and Visual Guidance

no code implementations26 Aug 2023 Minh-Hien Le, Chi-Bien Chu, Khanh-Duy Le, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

The system successfully captures the essence of users' descriptions while providing flexibility for customization.

MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation

1 code implementation9 Mar 2023 Minh-Quan Le, Tam V. Nguyen, Trung-Nghia Le, Thanh-Toan Do, Minh N. Do, Minh-Triet Tran

To overcome the disadvantage of the point estimation mechanism, we propose a novel approach, dubbed MaskDiff, which models the underlying conditional distribution of a binary mask, which is conditioned on an object region and $K-$shot information.

Few-Shot Learning Instance Segmentation +1

Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation

1 code implementation17 Jan 2023 Trong-Hieu Nguyen Mau, Quoc-Huy Trinh, Nhat-Tan Bui, Minh-Triet Tran, Hai-Dang Nguyen

Specifically, with the increase in cases, the diagnosis and identification need to be faster and more accurate for many patients; in endoscopic images, the segmentation task has been vital to helping the doctor identify the position of the polyps or the ache in the system correctly.

Image Segmentation Medical Image Segmentation +2

Multilingual Communication System with Deaf Individuals Utilizing Natural and Visual Languages

no code implementations1 Dec 2022 Tuan-Luc Huynh, Khoi-Nguyen Nguyen-Ngoc, Chi-Bien Chu, Minh-Triet Tran, Trung-Nghia Le

To bridge this language barrier, we propose a novel multilingual communication system, namely MUGCAT, to improve the communication efficiency of sign language users.

Semantic Similarity Semantic Textual Similarity

EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification

1 code implementation7 Oct 2022 Tien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler, Minh-Triet Tran, Ngan Le

The transformer expanding path models the temporal coherency between embryo images to ensure monotonic non-decreasing constraint and is optimized by a segmentation head.

AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation

1 code implementation5 Oct 2022 Khoa Vo, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le

PMR module represents each video snippet by a visual-linguistic feature, in which main actors and surrounding environment are represented by visual information, whereas relevant objects are depicted by linguistic features through an image-text model.

Action Detection Temporal Action Proposal Generation

SHREC 2022 Track on Online Detection of Heterogeneous Gestures

no code implementations14 Jul 2022 Ariel Caputo, Marco Emporio, Andrea Giachetti, Marco Cristani, Guido Borghi, Andrea D'Eusanio, Minh-Quan Le, Hai-Dang Nguyen, Minh-Triet Tran, F. Ambellan, M. Hanik, E. Nava-Yazdani, C. von Tycowicz

This paper presents the outcomes of a contest organized to evaluate methods for the online recognition of heterogeneous gestures from sequences of 3D hand poses.

Mixed Reality

An Improved Subject-Independent Stress Detection Model Applied to Consumer-grade Wearable Devices

no code implementations18 Mar 2022 Van-Tu Ninh, Manh-Duy Nguyen, Sinéad Smyth, Minh-Triet Tran, Graham Healy, Binh T. Nguyen, Cathal Gurrin

Using our proposed model architecture, we compare the accuracy between stress detection models that use measures from each individual signal source, and one model employing the fusion of multiple sensor sources.

Management

Analysing the Performance of Stress Detection Models on Consumer-Grade Wearable Devices

no code implementations18 Mar 2022 Van-Tu Ninh, Sinéad Smyth, Minh-Triet Tran, Cathal Gurrin

The results from the experiment show that training the model with (comparatively low-cost) low-resolution EDA signal does not affect the stress detection accuracy of the model significantly compared to using a high-resolution EDA signal.

Heart Rate Variability

ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation

1 code implementation16 Mar 2022 Khoa Vo, Kashu Yamazaki, Sang Truong, Minh-Triet Tran, Akihiro Sugimoto, Ngan Le

Temporal action proposal generation (TAPG) aims to estimate temporal intervals of actions in untrimmed videos, which is a challenging yet plays an important role in many tasks of video analysis and understanding.

Action Detection Temporal Action Proposal Generation

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications

no code implementations16 Mar 2022 Viet-Khoa Vo-Ho, Kashu Yamazaki, Hieu Hoang, Minh-Triet Tran, Ngan Le

To address such limitations, meta-learning has been adopted in the scenarios of few-shot learning and multiple tasks.

Few-Shot Learning Image Classification +1

DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant Brain Image Segmentation

1 code implementation27 Dec 2021 Dinh-Hieu Hoang, Gia-Han Diep, Minh-Triet Tran, Ngan T. H Le

While Magnetic Resonance Imaging (MRI) has played an essential role in infant brain analysis, segmenting MRI into a number of tissues such as gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) is crucial and complex due to the extremely low intensity contrast between tissues at around 6-9 months of age as well as amplified noise, myelination, and incomplete volume.

Brain Image Segmentation Image Segmentation +1

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

1 code implementation21 Oct 2021 Khoa Vo, Hyekang Joo, Kashu Yamazaki, Sang Truong, Kris Kitani, Minh-Triet Tran, Ngan Le

In this paper, we make an attempt to simulate that ability of a human by proposing Actor Environment Interaction (AEI) network to improve the video representation for temporal action proposals generation.

Action Detection Temporal Action Proposal Generation

Agent-Environment Network for Temporal Action Proposal Generation

no code implementations17 Jul 2021 Viet-Khoa Vo-Ho, Ngan Le, Kashu Yamazaki, Akihiro Sugimoto, Minh-Triet Tran

Temporal action proposal generation is an essential and challenging task that aims at localizing temporal intervals containing human actions in untrimmed videos.

Temporal Action Proposal Generation

Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation

no code implementations7 Jun 2021 Trung-Nghia Le, Tam V. Nguyen, Minh-Triet Tran

For human instance, we develop skeleton-guided segmentation in a frame along with object flow to correct and refine the result across frames.

Human-Object Interaction Detection Instance Segmentation +3

Refined Deep Neural Network and U-Net for Polyps Segmentation

1 code implementation31 May 2021 Quoc-Huy Trinh, Minh-Van Nguyen, Thiet-Gia Huynh, Minh-Triet Tran

The Medico: Multimedia Task 2020 focuses on developing an efficient and accurate computer-aided diagnosis system for automatic segmentation [3].

Segmentation Semantic Segmentation

Anabranch Network for Camouflaged Object Segmentation

2 code implementations Computer Vision and Image Understanding 2019 Trung-Nghia Le, Tam V. Nguyen, Zhongliang Nie, Minh-Triet Tran, Akihiro Sugimoto

Different from existing networks for segmentation, our proposed network possesses the second branch for classification to predict the probability of containing camouflaged object(s) in an image, which is then fused into the main branch for segmentation to boost up the segmentation accuracy.

Benchmarking Camouflaged Object Segmentation +3

MirrorNet: Bio-Inspired Camouflaged Object Segmentation

no code implementations Pattern Recognition Journal 2020 Jinnan Yan, Trung-Nghia Le, Khanh-Duy Nguyen, Minh-Triet Tran, Thanh-Toan Do, Tam V. Nguyen

Differently from existing networks for segmentation, our proposed network possesses two segmentation streams: the main stream and the mirror stream corresponding with the original image and its flipped image, respectively.

Camouflaged Object Segmentation Camouflage Segmentation +3

Image Alignment in Unseen Domains via Domain Deep Generalization

no code implementations28 May 2019 Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

This paper presents a novel deep learning based approach to tackle the problem of across unseen modalities.

Domain Adaptation

Fast Flow Reconstruction via Robust Invertible nxn Convolution

no code implementations24 May 2019 Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

The experiments on CIFAR-10, ImageNet and Celeb-HQ datasets, have shown that our invertible $n \times n$ convolution helps to improve the performance of generative models significantly.

Cannot find the paper you are looking for? You can Submit a new open access paper.