Search Results for author: Xiaohao Xu

Found 23 papers, 19 papers with code

Customizing Visual-Language Foundation Models for Multi-modal Anomaly Detection and Reasoning

1 code implementation17 Mar 2024 Xiaohao Xu, Yunkang Cao, Yongqi Chen, Weiming Shen, Xiaonan Huang

In addition, we unify the input representation of multi-modality into a 2D image format, enabling multi-modal anomaly detection and reasoning.

Anomaly Detection

GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection

1 code implementation10 Mar 2024 Huaxin Zhang, Xiang Wang, Xiaohao Xu, Xiaonan Huang, Chuchu Han, Yuehuan Wang, Changxin Gao, Shanjun Zhang, Nong Sang

In recent years, video anomaly detection has been extensively investigated in both unsupervised and weakly supervised settings to alleviate costly temporal labeling.

Anomaly Detection Video Anomaly Detection

$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

2 code implementations7 Mar 2024 Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazak, Hao Chen, Xiaonan Huang, Bhiksha Raj

Referring perception, which aims at grounding visual objects with multimodal referring guidance, is essential for bridging the gap between humans, who provide instructions, and the environment where intelligent systems perceive.

Benchmarking

Customizable Perturbation Synthesis for Robust SLAM Benchmarking

1 code implementation12 Feb 2024 Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang

To this end, we propose a novel, customizable pipeline for noisy data synthesis, aimed at assessing the resilience of multi-modal SLAM models against various perturbations.

Benchmarking Simultaneous Localization and Mapping

A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect

no code implementations29 Jan 2024 Yunkang Cao, Xiaohao Xu, Jiangning Zhang, Yuqi Cheng, Xiaonan Huang, Guansong Pang, Weiming Shen

Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e. g., industrial defect inspection, and medical lesion detection.

Anomaly Detection Lesion Detection

Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense Prediction

1 code implementation16 Jan 2024 Zhaoge Liu, Xiaohao Xu, Yunkang Cao, Weiming Shen

Knowledge distillation is the process of transferring knowledge from a more powerful large model (teacher) to a simpler counterpart (student).

Instance Segmentation Knowledge Distillation +5

Towards Transferable Multi-modal Perception Representation Learning for Autonomy: NeRF-Supervised Masked AutoEncoder

no code implementations23 Nov 2023 Xiaohao Xu

This work proposes a unified self-supervised pre-training framework for transferable multi-modal perception representation learning via masked multi-modal reconstruction in Neural Radiance Field (NeRF), namely NeRF-Supervised Masked AutoEncoder (NS-MAE).

3D Object Detection Neural Rendering +2

Towards Generic Anomaly Detection and Understanding: Large-scale Visual-linguistic Model (GPT-4V) Takes the Lead

1 code implementation5 Nov 2023 Yunkang Cao, Xiaohao Xu, Chen Sun, Xiaonan Huang, Weiming Shen

This study explores the use of GPT-4V(ision), a powerful visual-linguistic model, to address anomaly detection tasks in a generic manner.

3D Anomaly Detection Time Series

QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition

3 code implementations29 Sep 2023 Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj

We propose a semantic decomposition method based on product quantization, where the multi-source semantics can be decomposed and represented by several disentangled and noise-suppressed single-source semantics.

Quantization

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

1 code implementation21 Sep 2023 Shilin Yan, Xiaohao Xu, Renrui Zhang, Lingyi Hong, Wenchao Chen, Wenqiang Zhang, Wei zhang

Our dataset poses new challenges in panoramic VOS and we hope that our PanoVOS can advance the development of panoramic segmentation/tracking.

Autonomous Driving Segmentation +4

2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection

1 code implementation15 Jun 2023 Yunkang Cao, Xiaohao Xu, Chen Sun, Yuqi Cheng, Liang Gao, Weiming Shen

This technical report introduces the winning solution of the team Segment Any Anomaly for the CVPR2023 Visual Anomaly and Novelty Detection (VAND) challenge.

Anomaly Detection Novelty Detection +2

Segment Any Anomaly without Training via Hybrid Prompt Regularization

2 code implementations18 May 2023 Yunkang Cao, Xiaohao Xu, Chen Sun, Yuqi Cheng, Zongwei Du, Liang Gao, Weiming Shen

We present a novel framework, i. e., Segment Any Anomaly + (SAA+), for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models.

Anomaly Detection Segmentation +1

Collaborative Discrepancy Optimization for Reliable Image Anomaly Localization

1 code implementation IEEE Transactions on Industrial Informatics 2023 Yunkang Cao, Xiaohao Xu, Zhaoge Liu, Weiming Shen

CDO introduces a margin optimization module and an overlap optimization module to optimize the two key factors determining the localization performance, i. e., the margin and the overlap between the discrepancy distributions (DDs) of normal and abnormal samples.

 Ranked #1 on Anomaly Detection on MVTEC 3D-AD (using extra training data)

Anomaly Detection

Optimization of Forcemyography Sensor Placement for Arm Movement Recognition

1 code implementation22 Jul 2022 Xiaohao Xu, Zihao Du, Huaxin Zhang, Ruichao Zhang, Zihan Hong, Qin Huang, Bin Han

To study the effectiveness of our optimization algorithm, a dataset for mechanical maintenance tasks using FMG armbands with 16 sensors is collected.

Online Video Instance Segmentation via Robust Context Fusion

no code implementations12 Jul 2022 Xiang Li, Jinglu Wang, Xiaohao Xu, Bhiksha Raj, Yan Lu

We propose a robust context fusion network to tackle VIS in an online fashion, which predicts instance segmentation frame-by-frame with a few preceding frames.

Instance Segmentation Segmentation +2

Towards Robust Video Object Segmentation with Adaptive Object Calibration

1 code implementation2 Jul 2022 Xiaohao Xu, Jinglu Wang, Xiang Ming, Yan Lu

We consolidate this conditional mask calibration process in a progressive manner, where the object representations and proto-masks evolve to be discriminative iteratively.

Object Segmentation +5

Reliable Propagation-Correction Modulation for Video Object Segmentation

1 code implementation6 Dec 2021 Xiaohao Xu, Jinglu Wang, Xiao Li, Yan Lu

We introduce two modulators, propagation and correction modulators, to separately perform channel-wise re-calibration on the target frame embeddings according to local temporal correlations and reliable references respectively.

Object Semantic Segmentation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.