Search Results for author: Khoa Luu

Found 67 papers, 15 papers with code

Beyond Principal Components: Deep Boltzmann Machines for Face Modeling

no code implementations • CVPR 2015 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

The "interpretation through synthesis", i. e.

Paper
Add Code

Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines

no code implementations • CVPR 2016 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

The Temporal Deep Restricted Boltzmann Machines based age progression model together with the prototype faces are then constructed to learn the aging transformation between faces in the sequence.

MORPH

Paper
Add Code

CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection

no code implementations • 17 Jun 2016 • Chenchen Zhu, Yutong Zheng, Khoa Luu, Marios Savvides

Robust face detection in the wild is one of the ultimate components to support various facial related problems, i. e. unconstrained face recognition, facial periocular recognition, facial landmarking and pose estimation, facial expression recognition, 3D facial model construction, etc.

Ranked #30 on Face Detection on WIDER Face (Medium)

Face Detection Face Recognition +6

Paper
Add Code

Robust Deep Appearance Models

no code implementations • 3 Jul 2016 • Kha Gia Quach, Chi Nhan Duong, Khoa Luu, Tien D. Bui

In this approach, two crucial components of face images, i. e. shape and texture, are represented by Deep Boltzmann Machines and Robust Deep Boltzmann Machines (RDBM), respectively.

Paper
Add Code

Deep Appearance Models: A Deep Boltzmann Machine Approach for Face Modeling

no code implementations • 23 Jul 2016 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

This paper presents a novel Deep Appearance Models (DAMs) approach, an efficient replacement for AAMs, to accurately capture both shape and texture of face images under large variations.

Age Estimation Super-Resolution

Paper
Add Code

Towards a Deep Learning Framework for Unconstrained Face Detection

no code implementations • 16 Dec 2016 • Yutong Zheng, Chenchen Zhu, Khoa Luu, Chandrasekhar Bhagavatula, T. Hoang Ngan Le, Marios Savvides

Robust face detection is one of the most important pre-processing steps to support facial expression analysis, facial landmarking, face recognition, pose estimation, building of 3D facial models, etc.

Face Detection Face Recognition +2

Paper
Add Code

Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition

no code implementations • ICCV 2017 • Chi Nhan Duong, Kha Gia Quach, Khoa Luu, T. Hoang Ngan Le, Marios Savvides

Modeling the long-term facial aging process is extremely challenging due to the presence of large and non-linear variations during the face development stages.

Age-Invariant Face Recognition Density Estimation +2

Paper
Add Code

Deep Contextual Recurrent Residual Networks for Scene Labeling

no code implementations • 12 Apr 2017 • T. Hoang Ngan Le, Chi Nhan Duong, Ligong Han, Khoa Luu, Marios Savvides, Dipan Pal

Designed as extremely deep architectures, deep residual networks which provide a rich visual representation and offer robust convergence behaviors have recently achieved exceptional performance in numerous computer vision problems.

Representation Learning Scene Labeling

Paper
Add Code

Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation

1 code implementation • 12 Apr 2017 • Ngan Le, Kha Gia Quach, Khoa Luu, Marios Savvides, Chenchen Zhu

To address these issues and boost the classic variational LS methods to a new level of the learnable deep learning approaches, we propose a novel definition of contour evolution named Recurrent Level Set (RLS)} to employ Gated Recurrent Unit under the energy minimization of a variational LS functional.

Segmentation Semantic Segmentation

Paper
Code

Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses

no code implementations • ICCV 2017 • Chandrasekhar Bhagavatula, Chenchen Zhu, Khoa Luu, Marios Savvides

We present our novel approach to simultaneously extract the 3D shape of the face and the semantically consistent 2D alignment through a 3D Spatial Transformer Network (3DSTN) to model both the camera projection matrix and the warping parameters of a 3D model.

Ranked #10 on Face Alignment on AFLW2000-3D

Face Alignment

Paper
Add Code

Learning from Longitudinal Face Demonstration - Where Tractable Deep Modeling Meets Inverse Reinforcement Learning

no code implementations • 28 Nov 2017 • Chi Nhan Duong, Kha Gia Quach, Khoa Luu, T. Hoang Ngan Le, Marios Savvides, Tien D. Bui

The proposed model is experimented in both tasks of face aging synthesis and cross-age face verification.

Face Verification MORPH +2

Paper
Add Code

Longitudinal Face Aging in the Wild - Recent Deep Learning Approaches

no code implementations • 23 Feb 2018 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui

Face Aging has raised considerable attentions and interest from the computer vision community in recent years.

Paper
Add Code

Seeing Small Faces from Robust Anchor's Perspective

no code implementations • CVPR 2018 • Chenchen Zhu, Ran Tao, Khoa Luu, Marios Savvides

This paper introduces a novel anchor design to support anchor-based face detection for superior scale-invariant performance, especially on tiny faces.

Face Detection

Paper
Add Code

Automatic Face Aging in Videos via Deep Reinforcement Learning

no code implementations • CVPR 2019 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Nghia Nguyen, Eric Patterson, Tien D. Bui, Ngan Le

This paper presents a novel approach to synthesize automatically age-progressed facial images in video sequences using Deep Reinforcement Learning.

Face Verification reinforcement-learning +1

Paper
Add Code

MobiFace: A Lightweight Deep Learning Face Recognition on Mobile Devices

no code implementations • 27 Nov 2018 • Chi Nhan Duong, Kha Gia Quach, Ibsa Jalata, Ngan Le, Khoa Luu

Deep neural networks have been widely used in numerous computer vision applications, particularly in face recognition.

Face Recognition

Paper
Add Code

Non-Volume Preserving-based Fusion to Group-Level Emotion Recognition on Crowd Videos

no code implementations • 28 Nov 2018 • Kha Gia Quach, Ngan Le, Chi Nhan Duong, Ibsa Jalata, Kaushik Roy, Khoa Luu

To demonstrate the robustness and effectiveness of each component in the proposed approach, three experiments were conducted: (i) evaluation on AffectNet database to benchmark the proposed EmoNet for recognizing facial expression; (ii) evaluation on EmotiW2018 to benchmark the proposed deep feature level fusion mechanism NVPF; and, (iii) examine the proposed TNVPF on an innovative Group-level Emotion on Crowd Videos (GECV) dataset composed of 627 videos collected from publicly available sources.

Emotion Recognition

Paper
Add Code

Beyond Domain Adaptation: Unseen Domain Encapsulation via Universal Non-volume Preserving Models

no code implementations • 9 Dec 2018 • Thanh-Dat Truong, Chi Nhan Duong, Khoa Luu, Minh-Triet Tran, Minh Do

However, it has been largely overlooked in the problem of recognition in new unseen domains.

Domain Generalization Face Recognition +1

Paper
Add Code

Image Processing in Quantum Computers

3 code implementations • 28 Dec 2018 • Aditya Dendukuri, Khoa Luu

Quantum Image Processing (QIP)is an exciting new field showing a lot of promise as a powerful addition to the arsenal of Image Processing techniques.

Paper
Code

Fast Flow Reconstruction via Robust Invertible nxn Convolution

no code implementations • 24 May 2019 • Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

The experiments on CIFAR-10, ImageNet and Celeb-HQ datasets, have shown that our invertible $n \times n$ convolution helps to improve the performance of generative models significantly.

Paper
Add Code

ShrinkTeaNet: Million-scale Lightweight Face Recognition via Shrinking Teacher-Student Networks

2 code implementations • 25 May 2019 • Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Ngan Le

In addition, this work introduces a novel Angular Distillation Loss for distilling the feature direction and the sample distributions of the teacher's hypersphere to its student.

Lightweight Face Recognition

Paper
Code

Defining Quantum Neural Networks via Quantum Time Evolution

no code implementations • 27 May 2019 • Aditya Dendukuri, Blake Keeling, Arash Fereidouni, Joshua Burbridge, Khoa Luu, Hugh Churchill

This work presents a novel fundamental algorithm for for defining and training Neural Networks in Quantum Information based on time evolution and the Hamiltonian.

Image Classification

Paper
Add Code

Image Alignment in Unseen Domains via Domain Deep Generalization

no code implementations • 28 May 2019 • Thanh-Dat Truong, Khoa Luu, Chi Nhan Duong, Ngan Le, Minh-Triet Tran

This paper presents a novel deep learning based approach to tackle the problem of across unseen modalities.

Domain Adaptation

Paper
Add Code

Domain Generalization via Universal Non-volume Preserving Models

no code implementations • 28 May 2019 • Thanh-Dat Truong, Chi Nhan Duong, Khoa Luu, Minh-Triet Tran, Ngan Le

However, it has been largely overlooked in the problem of recognition in new unseen domains.

Domain Generalization Face Recognition +2

Paper
Add Code

Vec2Face: Unveil Human Faces from their Blackbox Features in Face Recognition

no code implementations • CVPR 2020 • Chi Nhan Duong, Thanh-Dat Truong, Kha Gia Quach, Hung Bui, Kaushik Roy, Khoa Luu

Unveiling face images of a subject given his/her high-level representations extracted from a blackbox Face Recognition engine is extremely challenging.

Benchmarking Face Recognition +2

Paper
Add Code

LIAAD: Lightweight Attentive Angular Distillation for Large-scale Age-Invariant Face Recognition

no code implementations • 9 Apr 2020 • Thanh-Dat Truong, Chi Nhan Duong, Kha Gia Quach, Ngan Le, Tien D. Bui, Khoa Luu

This work presents a novel Lightweight Attentive Angular Distillation (LIAAD) approach to Large-scale Lightweight AiFR that overcomes these limitations.

Age-Invariant Face Recognition

Paper
Add Code

Flow-based Deformation Guidance for Unpaired Multi-Contrast MRI Image-to-Image Translation

no code implementations • 3 Dec 2020 • Toan Duc Bui, Manh Nguyen, Ngan Le, Khoa Luu

To capture temporal structures in the medical images, we explore the displacement between the consecutive slices using a deformation field.

Generative Adversarial Network Image-to-Image Translation +1

Paper
Add Code

Offset Curves Loss for Imbalanced Problem in Medical Segmentation

no code implementations • 4 Dec 2020 • Ngan Le, Trung Le, Kashu Yamazaki, Toan Duc Bui, Khoa Luu, Marios Savides

Our proposed Offset Curves (OsC) loss consists of three main fitting terms.

Image Segmentation Medical Image Segmentation +2

Paper
Add Code

Deep reinforcement learning in medical imaging: A literature review

no code implementations • 5 Mar 2021 • S. Kevin Zhou, Hoang Ngan Le, Khoa Luu, Hien V. Nguyen, Nicholas Ayache

Deep reinforcement learning (DRL) augments the reinforcement learning framework, which learns a sequence of actions that maximizes the expected reward, with the representative power of deep neural networks.

Lesion Detection Miscellaneous +3

Paper
Add Code

Progressive Semantic Segmentation

1 code implementation • CVPR 2021 • Chuong Huynh, Anh Tran, Khoa Luu, Minh Hoai

In this work, we present MagNet, a multi-scale framework that resolves local ambiguity by looking at the image at multiple magnification levels.

Ranked #3 on Land Cover Classification on DeepGlobe

Land Cover Classification Segmentation +1

113

Paper
Code

DyGLIP: A Dynamic Graph Model with Link Prediction for Accurate Multi-Camera Multiple Object Tracking

1 code implementation • CVPR 2021 • Kha Gia Quach, Pha Nguyen, Huu Le, Thanh-Dat Truong, Chi Nhan Duong, Minh-Triet Tran, Khoa Luu

Multi-Camera Multiple Object Tracking (MC-MOT) is a significant computer vision problem due to its emerging applicability in several real-world applications.

Link Prediction Multiple Object Tracking

Paper
Code

Clusformer: A Transformer Based Clustering Approach to Unsupervised Large-Scale Face and Visual Landmark Recognition

1 code implementation • CVPR 2021 • Xuan-Bac Nguyen, Duc Toan Bui, Chi Nhan Duong, Tien D. Bui, Khoa Luu

This work therefore presents the Clusformer, a simple but new perspective of Transformer based approach, to automatic visual clustering via its unsupervised attention mechanism.

Clustering Landmark Recognition

Paper
Code

The Right to Talk: An Audio-Visual Transformer Approach

1 code implementation • ICCV 2021 • Thanh-Dat Truong, Chi Nhan Duong, The De Vu, Hoang Anh Pham, Bhiksha Raj, Ngan Le, Khoa Luu

Therefore, this work introduces a new Audio-Visual Transformer approach to the problem of localization and highlighting the main speaker in both audio and visual channels of a multi-speaker conversation video in the wild.

Paper
Code

BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation

1 code implementation • ICCV 2021 • Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Son Lam Phung, Chase Rainwater, Khoa Luu

Semantic segmentation aims to predict pixel-level labels.

Ranked #19 on Unsupervised Domain Adaptation on GTAV-to-Cityscapes Labels

Scene Segmentation Segmentation +1

Paper
Code

Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey

no code implementations • 25 Aug 2021 • Ngan Le, Vidhiwar Singh Rathour, Kashu Yamazaki, Khoa Luu, Marios Savvides

In this work, we provide a detailed review of recent and state-of-the-art research advances of deep reinforcement learning in computer vision.

Image Segmentation object-detection +5

Paper
Add Code

CapsNet for Medical Image Segmentation

no code implementations • 16 Mar 2022 • Minh Tran, Viet-Khoa Vo-Ho, Kyle Quinn, Hien Nguyen, Khoa Luu, Ngan Le

We then provide recent developments of CapsNet for the task of medical image segmentation.

Image Segmentation Representation Learning +3

Paper
Add Code

DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition

1 code implementation • CVPR 2022 • Thanh-Dat Truong, Quoc-Huy Bui, Chi Nhan Duong, Han-Seok Seo, Son Lam Phung, Xin Li, Khoa Luu

Various 3D-CNN based methods have been presented to tackle both the spatial and temporal dimensions in the task of video action recognition with competitive results.

Ranked #1 on Action Recognition on Jester (Gesture Recognition)

Action Classification Action Recognition In Videos +2

Paper
Code

Multi-Camera Multiple 3D Object Tracking on the Move for Autonomous Vehicles

no code implementations • 19 Apr 2022 • Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Ngan Le, Xuan-Bac Nguyen, Khoa Luu

The experimental results on the nuScenes dataset demonstrate the benefits of the proposed method to produce SOTA performance on the existing vision-based tracking dataset.

3D Object Detection 3D Object Tracking +5

Paper
Add Code

OTAdapt: Optimal Transport-based Approach For Unsupervised Domain Adaptation

no code implementations • 22 May 2022 • Thanh-Dat Truong, Naga Venkata Sai Raviteja Chappa, Xuan Bac Nguyen, Ngan Le, Ashley Dowling, Khoa Luu

Unsupervised domain adaptation is one of the challenging problems in computer vision.

Object Recognition Unsupervised Domain Adaptation

Paper
Add Code

Two-Dimensional Quantum Material Identification via Self-Attention and Soft-labeling in Deep Learning

no code implementations • 31 May 2022 • Xuan Bac Nguyen, Apoorva Bisht, Ben Thompson, Hugh Churchill, Khoa Luu, Samee U. Khan

In this work, we present a novel method to tackle the problem of missing annotation in instance segmentation in 2D quantum material identification.

2k Instance Segmentation +2

Paper
Add Code

Self-supervised Domain Adaptation in Crowd Counting

no code implementations • 7 Jun 2022 • Pha Nguyen, Thanh-Dat Truong, Miaoqing Huang, Yi Liang, Ngan Le, Khoa Luu

Self-training crowd counting has not been attentively explored though it is one of the important challenges in computer vision.

Crowd Counting Domain Adaptation

Paper
Add Code

VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

1 code implementation • 26 Jun 2022 • Kashu Yamazaki, Sang Truong, Khoa Vo, Michael Kidd, Chase Rainwater, Khoa Luu, Ngan Le

In this paper, we leverage the human perceiving process, that involves vision and language interaction, to generate a coherent paragraph description of untrimmed videos.

Ranked #3 on Video Captioning on ActivityNet Captions

Contrastive Learning Video Captioning

Paper
Code

Depth Perspective-aware Multiple Object Tracking

no code implementations • 10 Jul 2022 • Kha Gia Quach, Huu Le, Pha Nguyen, Chi Nhan Duong, Tien Dai Bui, Khoa Luu

This paper aims to tackle Multiple Object Tracking (MOT), an important problem in computer vision but remains challenging due to many practical issues, especially occlusions.

Depth Estimation Multiple Object Tracking +1

Paper
Add Code

Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition

no code implementations • 11 Sep 2022 • Thanh-Dat Truong, Chi Nhan Duong, Ngan Le, Marios Savvides, Khoa Luu

We therefore introduce a new method named Attention-based Bijective Generative Adversarial Networks in a Distillation framework (DAB-GAN) to synthesize faces of a subject given his/her extracted face recognition features.

Face Recognition Face Reconstruction +2

Paper
Add Code

Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach

no code implementations • 17 Nov 2022 • Pha Nguyen, Kha Gia Quach, Chi Nhan Duong, Son Lam Phung, Ngan Le, Khoa Luu

The development of autonomous vehicles generates a tremendous demand for a low-cost solution with a complete set of camera sensors capturing the environment around the car.

3D Object Detection Autonomous Vehicles +3

Paper
Add Code

CONDA: Continual Unsupervised Domain Adaptation Learning in Visual Perception for Self-Driving Cars

no code implementations • 1 Dec 2022 • Thanh-Dat Truong, Pierce Helton, Ahmed Moustafa, Jackson David Cothren, Khoa Luu

Also, the previous data training of segmentation models may be inaccessible due to privacy problems.

Scene Segmentation Segmentation +2

Paper
Add Code

Neural Cell Video Synthesis via Optical-Flow Diffusion

no code implementations • 6 Dec 2022 • Manuel Serna-Aguilera, Khoa Luu, Nathaniel Harris, Min Zou

This problem presents several challenges as it can be difficult to grow and maintain the culture for days, and it is expensive to acquire the materials and equipment.

Cultural Vocal Bursts Intensity Prediction Denoising +2

Paper
Add Code

Contextual Explainable Video Representation: Human Perception-based Understanding

1 code implementation • 12 Dec 2022 • Khoa Vo, Kashu Yamazaki, Phong X. Nguyen, Phat Nguyen, Khoa Luu, Ngan Le

We choose video paragraph captioning and temporal action detection to illustrate the effectiveness of human perception based-contextual representation in video understanding.

Action Detection Action Recognition +4

Paper
Code

SPARTAN: Self-supervised Spatiotemporal Transformers Approach to Group Activity Recognition

1 code implementation • 6 Mar 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu

In this paper, we propose a new, simple, and effective Self-supervised Spatio-temporal Transformers (SPARTAN) approach to Group Activity Recognition (GAR) using unlabeled video data.

Group Activity Recognition

Paper
Code

FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding

1 code implementation • CVPR 2023 • Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson Cothren, Khoa Luu

Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed.

Ranked #5 on Domain Adaptation on SYNTHIA-to-Cityscapes

Autonomous Driving Domain Adaptation +4

Paper
Code

Micron-BERT: BERT-based Facial Micro-Expression Recognition

1 code implementation • CVPR 2023 • Xuan-Bac Nguyen, Chi Nhan Duong, Xin Li, Susan Gauch, Han-Seok Seo, Khoa Luu

By incorporating these components into an end-to-end deep network, the proposed $\mu$-BERT significantly outperforms all previous work in various micro-expression tasks.

Ranked #1 on Micro Expression Recognition on SMIC

Micro Expression Recognition Micro-Expression Recognition +1

101

Paper
Code

CROVIA: Seeing Drone Scenes from Car Perspective via Cross-View Adaptation

no code implementations • 14 Apr 2023 • Thanh-Dat Truong, Chi Nhan Duong, Ashley Dowling, Son Lam Phung, Jackson Cothren, Khoa Luu

First, a novel geometry-based constraint to cross-view adaptation is introduced based on the geometry correlation between views.

Autonomous Driving Scene Segmentation +1

Paper
Add Code

Fairness in Visual Clustering: A Novel Transformer Clustering Approach

no code implementations • 14 Apr 2023 • Xuan-Bac Nguyen, Chi Nhan Duong, Marios Savvides, Kaushik Roy, Hugh Churchill, Khoa Luu

Promoting fairness for deep clustering models in unsupervised clustering settings to reduce demographic bias is a challenging goal.

Attribute Clustering +2

Paper
Add Code

CoMaL: Conditional Maximum Likelihood Approach to Self-supervised Domain Adaptation in Long-tail Semantic Segmentation

no code implementations • 14 Apr 2023 • Thanh-Dat Truong, Chi Nhan Duong, Pierce Helton, Ashley Dowling, Xin Li, Khoa Luu

They are insufficient to model both global and local structures of a given image, especially in small regions of tail classes.

Domain Adaptation Segmentation +1

Paper
Add Code

SoGAR: Self-supervised Spatiotemporal Attention-based Social Group Activity Recognition

no code implementations • 27 Apr 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Alexander H Nelson, Han-Seok Seo, Xin Li, Page Daniel Dobbs, Khoa Luu

This paper introduces a novel approach to Social Group Activity Recognition (SoGAR) using Self-supervised Transformers network that can effectively utilize unlabeled video data.

Group Activity Recognition

Paper
Add Code

Type-to-Track: Retrieve Any Object via Prompt-based Tracking

no code implementations • NeurIPS 2023 • Pha Nguyen, Kha Gia Quach, Kris Kitani, Khoa Luu

This paper introduces a novel paradigm for Multiple Object Tracking called Type-to-Track, which allows users to track objects in videos by typing natural language descriptions.

Grounded Multiple Object Tracking Multiple Object Tracking +1

Paper
Add Code

Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective

no code implementations • 25 May 2023 • Thanh-Dat Truong, Khoa Luu

Then, we propose a new cross-view self-attention loss learned on unpaired cross-view data to enforce the self-attention mechanism learning to transfer knowledge across views.

Action Recognition

Paper
Add Code

Z-GMOT: Zero-shot Generic Multiple Object Tracking

no code implementations • 28 May 2023 • Kim Hoang Tran, Anh Duy Le Dinh, Tien Phat Nguyen, Thinh Phan, Pha Nguyen, Khoa Luu, Donald Adjeroh, Gianfranco Doretto, Ngan Hoang Le

Our contributions are benchmarked through extensive experiments conducted on the Referring GMOT dataset for GMOT task.

Multi-Object Tracking Multiple Object Tracking +1

Paper
Add Code

UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation

no code implementations • 16 Jun 2023 • Pha Nguyen, Kha Gia Quach, John Gauch, Samee U. Khan, Bhiksha Raj, Khoa Luu

Then, a new cross-domain MOT adaptation from existing datasets is proposed without any pre-defined human knowledge in understanding and modeling objects.

Domain Adaptation Multiple Object Tracking +1

Paper
Add Code

The Algonauts Project 2023 Challenge: UARK-UAlbany Team Solution

1 code implementation • 1 Aug 2023 • Xuan-Bac Nguyen, Xudong Liu, Xin Li, Khoa Luu

The goal is to predict brain responses across the entire visual brain, as it is the region where the most reliable responses to images have been observed.

Paper
Code

Quantum Vision Clustering

no code implementations • 18 Sep 2023 • Xuan Bac Nguyen, Hugh Churchill, Khoa Luu, Samee U. Khan

Unsupervised visual clustering has garnered significant attention in recent times, aiming to characterize distributions of unlabeled visual images through clustering based on a parameterized appearance approach.

Clustering

Paper
Add Code

Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding

no code implementations • 26 Nov 2023 • Hoang-Quan Nguyen, Thanh-Dat Truong, Xuan Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu

In precision agriculture, the detection and recognition of insects play an essential role in the ability of crops to grow healthy and produce a high-quality yield.

Self-Supervised Learning

Paper
Add Code

FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding in Open World

no code implementations • 27 Nov 2023 • Thanh-Dat Truong, Utsav Prabhu, Bhiksha Raj, Jackson Cothren, Khoa Luu

In particular, we first introduce a new Fairness Contrastive Clustering loss to address the problems of catastrophic forgetting and fairness.

Continual Learning Continual Semantic Segmentation +3

Paper
Add Code

REACT: Recognize Every Action Everywhere All At Once

no code implementations • 27 Nov 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Page Daniel Dobbs, Khoa Luu

Group Activity Recognition (GAR) is a fundamental problem in computer vision, with diverse applications in sports video analysis, video surveillance, and social scene understanding.

Action Recognition Decoder +3

Paper
Add Code

HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos

no code implementations • 28 Nov 2023 • Naga VS Raviteja Chappa, Pha Nguyen, Thi Hoang Ngan Le, Khoa Luu

Flow-Attention incorporates flow conservation principles, fostering competition for sources and allocation for sinks, effectively preventing the generation of trivial attention.

Graph Generation Scene Graph Generation +2

Paper
Add Code

Brainformer: Modeling MRI Brain Functions to Machine Vision

no code implementations • 30 Nov 2023 • Xuan-Bac Nguyen, Xin Li, Samee U. Khan, Khoa Luu

In this work, we first present a simple yet effective Brainformer approach, a novel Transformer-based framework, to analyze the patterns of fMRI in the human perception system from the machine learning perspective.

Paper
Add Code

HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding

no code implementations • 5 Dec 2023 • Trong-Thuan Nguyen, Pha Nguyen, Khoa Luu

In this paper, we delve into interactivities understanding within visual content by deriving scene graph representations from dense interactivities among humans and objects.

Graph Generation Position +3

Paper
Add Code

Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy

no code implementations • 2 May 2024 • Hoang-Quan Nguyen, Thanh-Dat Truong, Khoa Luu

Thirdly, we built an action recognition model based on Video Transformers and Neural Radiance Fields.

Action Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.