Search Results for author: Xu Zheng

Found 44 papers, 10 papers with code

MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection

no code implementations22 Dec 2024 Xu Zheng, Yuanhuiyi Lyu, Lutao Jiang, Jiazhou Zhou, Lin Wang, Xuming Hu

This way, our method can eliminate the dependence on RGB modality at every feature granularity and better overcome sensor failures and environmental noises while ensuring the segmentation performance.

Semantic Segmentation

Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts

no code implementations5 Dec 2024 Chenyang Zhu, Bin Xiao, Lin Shi, Shoukun Xu, Xu Zheng

The recent Segment Anything Model (SAM) represents a significant breakthrough in scaling segmentation models, delivering strong performance across various downstream applications in the RGB modality.

Segmentation Semantic Segmentation

Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation

1 code implementation26 Nov 2024 Xu Zheng, Haiwei Xue, Jialei Chen, Yibo Yan, Lutao Jiang, Yuanhuiyi Lyu, Kailun Yang, Linfeng Zhang, Xuming Hu

The cross-modal and unimodal distillation is then achieved in the multi scale representation space by transferring the feature level knowledge from multimodal to anymodal segmentors, aiming at addressing the unimodal bias and avoiding over-reliance on specific modalities.

SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context

no code implementations25 Nov 2024 Jungang Li, Sicheng Tao, Yibo Yan, Xiaojie Gu, Haodong Xu, Xu Zheng, Yuanhuiyi Lyu, Linfeng Zhang, Xuming Hu

Endeavors have been made to explore Large Language Models for video analysis (Video-LLMs), particularly in understanding and interpreting long videos.

Large Language Model MME +2

Explanation-Preserving Augmentation for Semi-Supervised Graph Representation Learning

1 code implementation16 Oct 2024 Zhuomin Chen, Jingchao Ni, Hojat Allah Salehi, Xu Zheng, Esteban Schafir, Farhad Shirani, Dongsheng Luo

Graph representation learning (GRL), enhanced by graph augmentation methods, has emerged as an effective technique achieving performance improvements in wide tasks such as node classification and graph classification.

Graph Classification Graph Representation Learning +1

F-Fidelity: A Robust Framework for Faithfulness Evaluation of Explainable AI

no code implementations3 Oct 2024 Xu Zheng, Farhad Shirani, Zhuomin Chen, Chaohao Lin, Wei Cheng, Wenbo Guo, Dongsheng Luo

This approach although efficient suffers the Out-of-Distribution (OOD) problem as the perturbed samples may no longer follow the original data distribution.

EventDance++: Language-guided Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition

no code implementations19 Sep 2024 Xu Zheng, Lin Wang

This enables the creation of surrogate images to extract knowledge (i. e., labels) from the source model.

Object Recognition

GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation

no code implementations17 Aug 2024 Weiming Zhang, Yexin Liu, Xu Zheng, Lin Wang

The `out-of-the-box' insight of GoodSAM++ is to introduce a teacher assistant (TA) to provide semantic information for SAM, integrated with SAM to obtain reliable pseudo semantic maps to bridge both domain and capacity gaps.

Domain Adaptation Instance Segmentation +2

Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation

no code implementations16 Jul 2024 Xu Zheng, Yuanhuiyi Lyu, Jiazhou Zhou, Lin Wang

On top, a unified arbitrary-modal selection module is proposed to utilize the aggregated features as the benchmark to rank the multi-modal features based on the similarity scores.

Semantic Segmentation

Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities

no code implementations16 Jul 2024 Xu Zheng, Yuanhuiyi Lyu, Lin Wang

Specifically, we first introduce a novel language-guided semantic correlation distillation (LSCD) module to transfer both inter-modal and intra-modal semantic knowledge in the embedding space from MVLMs, e. g., LanguageBind.

Knowledge Distillation Semantic Segmentation

EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More

no code implementations2 Jul 2024 Xu Zheng, Ling Wang, Kanghao Chen, Yuanhuiyi Lyu, Jiazhou Zhou, Lin Wang

To verify the effectiveness of EIT-1M, we provide an in-depth analysis of EEG data captured from multi-modal stimuli across different categories and participants, along with data quality scores for transparency.

EEG Object Recognition

OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All

no code implementations25 May 2024 Yuanhuiyi Lyu, Xu Zheng, Dahun Kim, Lin Wang

Specifically, we propose Cross-modal Alignment Distillation (CAD) to address the unequal-scale problem between student and teacher modalities and effectively align student modalities into the teacher modalities' representation space in stage one.

cross-modal alignment

TimeX++: Learning Time-Series Explanations with Information Bottleneck

2 code implementations15 May 2024 Zichuan Liu, Tianchun Wang, Jimeng Shi, Xu Zheng, Zhuomin Chen, Lei Song, Wenqian Dong, Jayantha Obeysekera, Farhad Shirani, Dongsheng Luo

The design of the objective function builds upon the principle of information bottleneck (IB), and modifies the IB objective function to avoid trivial solutions and distributional shift issues.

Time Series

360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes

no code implementations25 Apr 2024 Xu Zheng, Pengyuan Zhou, Athanasios V. Vasilakos, Lin Wang

However, as the distinct projections make it less possible to directly transfer knowledge between domains, we then propose Reliable Panoramic Prototype Adaptation Module (RP2AM) to transfer knowledge at both prediction and prototype levels.

ERP Semantic Segmentation +1

GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation

no code implementations CVPR 2024 Weiming Zhang, Yexin Liu, Xu Zheng, Lin Wang

To this end, we propose a novel framework, called GoodSAM, that introduces a teacher assistant (TA) to provide semantic information, integrated with SAM to generate ensemble logits to achieve knowledge transfer.

Domain Adaptation Instance Segmentation +3

EventDance: Unsupervised Source-free Cross-modal Adaptation for Event-based Object Recognition

no code implementations CVPR 2024 Xu Zheng, Lin Wang

To this end, we propose a novel framework, dubbed EventDance for this unsupervised source-free cross-modal adaptation problem.

Object Recognition Transfer Learning +1

Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation

no code implementations19 Mar 2024 Xu Zheng, Pengyuan Zhou, Athanasios V. Vasilakos, Lin Wang

However, the distinct projection discrepancies between source and target domains impede the direct knowledge transfer; thus, we propose a panoramic prototype adaptation module (PPAM) to integrate panoramic prototypes from the extracted knowledge for adaptation.

ERP Semantic Segmentation +2

UniBind: LLM-Augmented Unified and Balanced Representation Space to Bind Them All

no code implementations CVPR 2024 Yuanhuiyi Lyu, Xu Zheng, Jiazhou Zhou, Lin Wang

To make this possible, we 1) construct a knowledge base of text embeddings with the help of LLMs and multi-modal LLMs; 2) adaptively build LLM-augmented class-wise embedding center on top of the knowledge base and encoded visual embeddings; 3) align all the embeddings to the LLM-augmented embedding center via contrastive learning to achieve a unified and balanced representation space.

Contrastive Learning Zero-Shot Learning

ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More

1 code implementation CVPR 2024 Jiazhou Zhou, Xu Zheng, Yuanhuiyi Lyu, Lin Wang

Then, we propose a conceptual reasoning-based uncertainty estimation module, which simulates the recognition process to enrich the semantic representation.

Action Recognition

BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D Synthesis

1 code implementation17 Mar 2024 Lutao Jiang, Xu Zheng, Yuanhuiyi Lyu, Jiazhou Zhou, Lin Wang

However, a hurdle of existing methods is the low efficiency, per-prompt optimization for a single 3D object.

3D Generation Text to 3D

Parametric Augmentation for Time Series Contrastive Learning

1 code implementation16 Feb 2024 Xu Zheng, Tianchun Wang, Wei Cheng, Aitian Ma, Haifeng Chen, Mo Sha, Dongsheng Luo

In this study, we address this gap by analyzing time series data augmentation using information theory and summarizing the most commonly adopted augmentations in a unified format.

Contrastive Learning Data Augmentation +2

PAC Learnability under Explanation-Preserving Graph Perturbations

no code implementations7 Feb 2024 Xu Zheng, Farhad Shirani, Tianchun Wang, Shouwei Gao, Wenqian Dong, Wei Cheng, Dongsheng Luo

It is shown that the sample complexity of explanation-assisted learning can be arbitrarily smaller than explanation-agnostic learning.

Data Augmentation

Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation

no code implementations31 Jan 2024 Yuanhuiyi Lyu, Xu Zheng, Lin Wang

It extracts entity features from the multi-modal representations powered by our specially constructed entity knowledge graph; 2) Attribute Fusion Branch adeptly preserves and processes the attributes.

Attribute Image Generation

Semantics Distortion and Style Matter: Towards Source-free UDA for Panoramic Segmentation

no code implementations CVPR 2024 Xu Zheng, Pengyuan Zhou, Athanasios V. Vasilakos, Lin Wang

However the distinct projection discrepancies between source and target domains impede the direct knowledge transfer; thus we propose a panoramic prototype adaptation module (PPAM) to integrate panoramic prototypes from the extracted knowledge for adaptation.

ERP Semantic Segmentation +2

Distilling Efficient Vision Transformers from CNNs for Semantic Segmentation

no code implementations11 Oct 2023 Xu Zheng, Yunhao Luo, Pengyuan Zhou, Lin Wang

Due to the completely different characteristics of ViT and CNN and the long-existing capacity gap between teacher and student models in Knowledge Distillation (KD), directly transferring the cross-model knowledge is non-trivial.

Knowledge Distillation Semantic Segmentation

CLIP Is Also a Good Teacher: A New Learning Framework for Inductive Zero-shot Semantic Segmentation

no code implementations3 Oct 2023 Jialei Chen, Daisuke Deguchi, Chenkai Zhang, Xu Zheng, Hiroshi Murase

Moreover, to enhance the ability to discriminate unseen categories, PLM consisting of pseudo labels and weight generation is designed.

Segmentation Semantic Segmentation +1

Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

1 code implementation3 Oct 2023 Xu Zheng, Farhad Shirani, Tianchun Wang, Wei Cheng, Zhuomin Chen, Haifeng Chen, Hua Wei, Dongsheng Luo

An explanation function for GNNs takes a pre-trained GNN along with a graph as input, to produce a `sufficient statistic' subgraph with respect to the graph label.

Decision Making

Chasing Day and Night: Towards Robust and Efficient All-Day Object Detection Guided by an Event Camera

1 code implementation17 Sep 2023 Jiahang Cao, Xu Zheng, Yuanhuiyi Lyu, Jiaxu Wang, Renjing Xu, Lin Wang

The ability to detect objects in all lighting (i. e., normal-, over-, and under-exposed) conditions is crucial for real-world applications, such as self-driving. Traditional RGB-based detectors often fail under such varying lighting conditions. Therefore, recent works utilize novel event cameras to supplement or guide the RGB modality; however, these methods typically adopt asymmetric network structures that rely predominantly on the RGB modality, resulting in limited robustness for all-day detection.

Novel Object Detection object-detection +2

Look at the Neighbor: Distortion-aware Unsupervised Domain Adaptation for Panoramic Semantic Segmentation

no code implementations ICCV 2023 Xu Zheng, Tianbo Pan, Yunhao Luo, Lin Wang

The aim is to tackle the domain gaps caused by the style disparities and distortion problem from the non-uniformly distributed pixels of equirectangular projection (ERP).

ERP Semantic Segmentation +1

EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding

no code implementations6 Aug 2023 Jiazhou Zhou, Xu Zheng, Yuanhuiyi Lyu, Lin Wang

Accordingly, we first introduce a novel event encoder that subtly models the temporal information from events and meanwhile, generates event prompts for modality bridging.

Transfer Learning

A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation

no code implementations ICCV 2023 Jinjing Zhu, Yunhao Luo, Xu Zheng, Hao Wang, Lin Wang

In this paper, we strive to answer the question "how to collaboratively learn convolutional neural network (CNN)-based and vision transformer (ViT)-based models by selecting and exchanging the reliable knowledge between them for semantic segmentation?"

Knowledge Distillation Semantic Segmentation

Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation

no code implementations CVPR 2023 Xu Zheng, Jinjing Zhu, Yexin Liu, Zidong Cao, Chong Fu, Lin Wang

Moreover, adversarial intra-projection training is proposed to reduce the inherent gap, between the features of the pinhole images and those of the ERP and TP images, respectively.

ERP Scene Understanding +2

Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks

1 code implementation17 Feb 2023 Xu Zheng, Yexin Liu, Yunfan Lu, Tongyan Hua, Tianbo Pan, Weiming Zhang, DaCheng Tao, Lin Wang

Event cameras are bio-inspired sensors that capture the per-pixel intensity changes asynchronously and produce event streams encoding the time, pixel position, and polarity (sign) of the intensity changes.

Deblurring Deep Learning +6

Evaluation of the Synthetic Electronic Health Records

no code implementations16 Oct 2022 Emily Muller, Xu Zheng, Jer Hayes

Generative models have been found effective for data synthesis due to their ability to capture complex underlying data distributions.

Open-Ended Question Answering Synthetic Data Evaluation

Transformer-CNN Cohort: Semi-supervised Semantic Segmentation by the Best of Both Students

no code implementations6 Sep 2022 Xu Zheng, Yunhao Luo, Chong Fu, Kangcheng Liu, Lin Wang

To this end, we propose class-aware feature consistency distillation (CFCD) that first leverages the outputs of each student as the pseudo labels and generates class-aware feature (CF) maps for knowledge transfer between the two students.

Semi-Supervised Semantic Segmentation Transfer Learning

Priors in Deep Image Restoration and Enhancement: A Survey

1 code implementation4 Jun 2022 Yunfan Lu, Yiqi Lin, Hao Wu, Yunhao Luo, Xu Zheng, Hui Xiong, Lin Wang

Image restoration and enhancement is a process of improving the image quality by removing degradations, such as noise, blur, and resolution degradation.

Image Restoration Survey

Synthesising Electronic Health Records: Cystic Fibrosis Patient Group

no code implementations14 Jan 2022 Emily Muller, Xu Zheng, Jer Hayes

Deep generative models are effective data synthesisers due to their ability to capture complex underlying distributions.

Uncertainty-Aware Deep Co-training for Semi-supervised Medical Image Segmentation

no code implementations23 Nov 2021 Xu Zheng, Chong Fu, Haoyu Xie, Jialei Chen, Xingwei Wang, Chiu-Wing Sham

However, due to the scarcity of labeled data, the features extracted by the models are limited in supervised learning, and the quality of predictions for unlabeled data also cannot be guaranteed.

Image Segmentation Semantic Segmentation +1

A Graph-based Imputation Method for Sparse Medical Records

no code implementations17 Nov 2021 Ramon Vinas, Xu Zheng, Jer Hayes

Our work can facilitate the diagnosis of novel diseases based on the clinical history of past events, with the potential to increase our understanding of the landscape of comorbidities.

Imputation

Network Generation with Differential Privacy

no code implementations17 Nov 2021 Xu Zheng, Nicholas McCarthy, Jer Hayes

Differential privacy is a gold standard for data privacy, and the introduction of the differentially private stochastic gradient descent (DP-SGD) algorithm has facilitated the training of private neural models in a number of domains.

Graph Generation

Fröhlich Condensate of Phonons in Optomechanical Systems

no code implementations19 Jan 2021 Xu Zheng, Baowen Li

We propose that the optomechanical systems can be potential platforms to implement the Fr\"{o}hlich condensate of phonons.

Quantum Physics Mesoscale and Nanoscale Physics

STaDA: Style Transfer as Data Augmentation

no code implementations3 Sep 2019 Xu Zheng, Tejo Chalasani, Koustav Ghosal, Sebastian Lutz, Aljosa Smolic

The success of training deep Convolutional Neural Networks (CNNs) heavily depends on a significant amount of labelled data.

Classification Data Augmentation +3

Beyond Keywords and Relevance: A Personalized Ad Retrieval Framework in E-Commerce Sponsored Search

no code implementations29 Dec 2017 Su Yan, Wei. Lin, Tianshu Wu, Daorui Xiao, Xu Zheng, Bo Wu, Kaipeng Liu

Given a search request, ad retrieval module rewrites the query into bidding keywords, and uses these keywords as keys to select Top N ads through inverted indexes.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.