Search Results for author: Khoi Nguyen

Found 32 papers, 16 papers with code

SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models

no code implementations13 Dec 2024 Hung Nguyen, Quang Qui-Vinh Nguyen, Khoi Nguyen, Rang Nguyen

Given an input video of a person and a new garment, the objective of this paper is to synthesize a new video where the person is wearing the specified garment while maintaining spatiotemporal consistency.

Video Inpainting Virtual Try-on

SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion

no code implementations5 Dec 2024 Trong-Tung Nguyen, Quang Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham

Recent advances in text-guided image editing enable users to perform image edits through simple text inputs, leveraging the extensive priors of multi-step diffusion-based text-to-image models.

Image Reconstruction text-guided-image-editing

EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM

no code implementations5 Dec 2024 Quang Nguyen, Truong Vu, Trong-Tung Nguyen, Yuxin Wen, Preston K Robinette, Taylor T Johnson, Tom Goldstein, Anh Tran, Khoi Nguyen

By leveraging the contextual and semantic strengths of LLMs, our framework achieves promising results on MagicBrush, AutoSplice, and PerfBrush (novel diffusion-based dataset) datasets, outperforming previous approaches in mIoU and F1-score metrics.

Image Manipulation Language Modeling +3

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

no code implementations3 Dec 2024 Viet Nguyen, Anh Nguyen, Trung Dao, Khoi Nguyen, Cuong Pham, Toan Tran, Anh Tran

However, our study reveals its instability when handling different diffusion model backbones due to using a fixed guidance scale within the Variational Score Distillation (VSD) loss.

Image Generation

SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation

no code implementations27 Nov 2024 Duc-Hai Pham, Tung Do, Phong Nguyen, Binh-Son Hua, Khoi Nguyen, Rang Nguyen

We propose SharpDepth, a novel approach to monocular metric depth estimation that combines the metric accuracy of discriminative depth estimation methods (e. g., Metric3D, UniDepth) with the fine-grained boundary sharpness typically achieved by generative methods (e. g., Marigold, Lotus).

Depth Estimation

Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking

no code implementations25 Nov 2024 Phuc Nguyen, Minh Luu, Anh Tran, Cuong Pham, Khoi Nguyen

Existing 3D instance segmentation methods frequently encounter issues with over-segmentation, leading to redundant and inaccurate 3D proposals that complicate downstream tasks.

3D Instance Segmentation Segmentation +1

SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher

no code implementations26 Aug 2024 Trung Dao, Thuan Hoang Nguyen, Thanh Le, Duc Vu, Khoi Nguyen, Cuong Pham, Anh Tran

Remarkably, by combining the weights of models trained with efficient LoRA and full training, we achieve a new state-of-the-art one-step diffusion model, achieving an FID of 8. 14 and surpassing all GAN-based and multi-step Stable Diffusion models.

Diversity

Open-Ended 3D Point Cloud Instance Segmentation

no code implementations21 Aug 2024 Phuc D. A. Nguyen, Minh Luu, Anh Tran, Cuong Pham, Khoi Nguyen

To mitigate this constraint, we propose a novel problem termed Open-Ended 3D Instance Segmentation (OE-3DIS), which eliminates the necessity for predefined class names during testing.

3D Instance Segmentation Semantic Segmentation

Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance

no code implementations21 Aug 2024 Duc-Hai Pham, Duc-Dung Nguyen, Anh Pham, Tuan Ho, Phong Nguyen, Khoi Nguyen, Rang Nguyen

Accurate prediction of 3D semantic occupancy from 2D visual images is vital in enabling autonomous agents to comprehend their surroundings for planning and navigation.

3D Semantic Occupancy Prediction 3D Semantic Scene Completion

Stable Messenger: Steganography for Message-Concealed Image Generation

no code implementations3 Dec 2023 Quang Nguyen, Truong Vu, Cuong Pham, Anh Tran, Khoi Nguyen

In the ever-expanding digital landscape, safeguarding sensitive information remains paramount.

Image Generation

DiverseDream: Diverse Text-to-3D Synthesis with Augmented Text Embedding

no code implementations2 Dec 2023 Uy Dieu Tran, Minh Luu, Phong Ha Nguyen, Khoi Nguyen, Binh-Son Hua

Text-to-3D synthesis has recently emerged as a new approach to sampling 3D models by adopting pretrained text-to-image models as guiding visual priors.

Diversity Text to 3D

LP-OVOD: Open-Vocabulary Object Detection by Linear Probing

1 code implementation26 Oct 2023 Chau Pham, Truong Vu, Khoi Nguyen

To address this issue, we propose a novel method, LP-OVOD, that discards low-quality boxes by training a sigmoid linear classifier on pseudo labels retrieved from the top relevant region proposals to the novel text.

Object object-detection +2

GaPro: Box-Supervised 3D Point Cloud Instance Segmentation Using Gaussian Processes as Pseudo Labelers

1 code implementation ICCV 2023 Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

Furthermore, we demonstrate the robustness of our approach, where we can adapt various state-of-the-art fully supervised methods to the weak supervision task by using our pseudo labels for training.

3D Instance Segmentation Gaussian Processes +2

ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

2 code implementations CVPR 2023 Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen

Existing 3D instance segmentation methods are predominated by the bottom-up design -- manually fine-tuned algorithm to group points into clusters followed by a refinement network.

3D Instance Segmentation Semantic Segmentation

PSENet: Progressive Self-Enhancement Network for Unsupervised Extreme-Light Image Enhancement

2 code implementations3 Oct 2022 Hue Nguyen, Diep Tran, Khoi Nguyen, Rang Nguyen

The extremes of lighting (e. g. too much or too little light) usually cause many troubles for machine and human vision.

Image Enhancement

POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples

1 code implementation NeurIPS 2021 Duong H. Le, Khoi D. Nguyen, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua

In this work, we propose to use out-of-distribution samples, i. e., unlabeled samples coming from outside the target classes, to improve few-shot learning.

Few-Shot Learning

iFS-RCNN: An Incremental Few-shot Instance Segmenter

1 code implementation CVPR 2022 Khoi Nguyen, Sinisa Todorovic

This paper addresses incremental few-shot instance segmentation, where a few examples of new object classes arrive when access to training examples of old classes is not available anymore, and the goal is to perform well on both old and new classes.

Few-shot Instance Segmentation Instance Segmentation +2

A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation

1 code implementation ICCV 2021 Khoi Nguyen, Sinisa Todorovic

The resulting predictions on training images are taken as the pseudo-ground truth for the standard training of Mask-RCNN, which we use for amodal instance segmentation of test images.

Amodal Instance Segmentation Segmentation +1

Semi-Supervising Learning, Transfer Learning, and Knowledge Distillation with SimCLR

no code implementations2 Aug 2021 Khoi Nguyen, Yen Nguyen, Bao Le

Most successful semi-supervised learning approaches in computer vision focus on leveraging huge amount of unlabeled data, learning the general representation via data augmentation and transformation, creating pseudo labels, implementing different loss functions, and eventually transferring this knowledge to more task-specific smaller models.

Data Augmentation Knowledge Distillation +1

FAPIS: A Few-shot Anchor-free Part-based Instance Segmenter

1 code implementation CVPR 2021 Khoi Nguyen, Sinisa Todorovic

This paper is about few-shot instance segmentation, where training and test image sets do not share the same object classes.

Few-shot Instance Segmentation Few-Shot Learning +4

A Self-supervised GAN for Unsupervised Few-shot Object Recognition

no code implementations16 Aug 2020 Khoi Nguyen, Sinisa Todorovic

This paper addresses unsupervised few-shot object recognition, where all training images are unlabeled, and test images are divided into queries and a few labeled support images per object class of interest.

Object Object Recognition +2

Feature Weighting and Boosting for Few-Shot Segmentation

1 code implementation ICCV 2019 Khoi Nguyen, Sinisa Todorovic

Finally, the target object is segmented in the query image by using a cosine similarity between the class feature vector and the query's feature map.

Few-Shot Semantic Segmentation Segmentation

Causal Mediation Analysis Leveraging Multiple Types of Summary Statistics Data

no code implementations24 Jan 2019 Yongjin Park, Abhishek Sarkar, Khoi Nguyen, Manolis Kellis

We can achieve necessary interpretation of GWAS in a causal mediation framework, looking to establish a sparse set of mediators between genetic and downstream variables, but there are several challenges.

Causal Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.