Search Results for author: Xijun Wang

Found 25 papers, 6 papers with code

Trustworthy Image Semantic Communication with GenAI: Explainablity, Controllability, and Efficiency

no code implementations7 Aug 2024 Xijun Wang, Dongshan Ye, Chenyuan Feng, Howard H. Yang, Xiang Chen, Tony Q. S. Quek

Image semantic communication (ISC) has garnered significant attention for its potential to achieve high efficiency in visual content transmission.

Semantic Communication

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

no code implementations4 Apr 2024 Tianrui Guan, Ruiqi Xian, Xijun Wang, Xiyang Wu, Mohamed Elnoor, Daeun Song, Dinesh Manocha

We present AGL-NET, a novel learning-based method for global localization using LiDAR point clouds and satellite maps.

A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models

no code implementations15 Mar 2024 Xijun Wang, Santiago López-Tapia, Alice Lucas, Xinyi Wu, Rafael Molina, Aggelos K. Katsaggelos

To reduce these artifacts and enhance the perceptual quality of the results, in this paper, we propose a general method that can be effectively used in most GAN-based super-resolution (SR) models by introducing essential spatial information into the training process.

Super-Resolution

Real-World Atmospheric Turbulence Correction via Domain Adaptation

no code implementations12 Feb 2024 Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelos

Atmospheric turbulence, a common phenomenon in daily life, is primarily caused by the uneven heating of the Earth's surface.

Domain Adaptation

Adv-Diffusion: Imperceptible Adversarial Face Identity Attack via Latent Diffusion Model

1 code implementation18 Dec 2023 Decheng Liu, Xijun Wang, Chunlei Peng, Nannan Wang, Ruiming Hu, Xinbo Gao

Adversarial attacks involve adding perturbations to the source image to cause misclassification by the target model, which demonstrates the potential of attacking face recognition models.

Image Generation

Foundation Model Based Native AI Framework in 6G with Cloud-Edge-End Collaboration

no code implementations26 Oct 2023 Xiang Chen, Zhiheng Guo, Xijun Wang, Howard H. Yang, Chenyuan Feng, Junshen Su, Sihui Zheng, Tony Q. S. Quek

Future wireless communication networks are in a position to move beyond data-centric, device-oriented connectivity and offer intelligent, immersive experiences based on task-oriented connections, especially in the context of the thriving development of pre-trained foundation models (PFM) and the evolving vision of 6G native artificial intelligence (AI).

ICAR: Image-based Complementary Auto Reasoning

no code implementations17 Aug 2023 Xijun Wang, Anqi Liang, Junbang Liang, Ming Lin, Yu Lou, Shan Yang

Based on this notion, we propose a compatibility learning framework, a category-aware Flexible Bidirectional Transformer (FBT), for visual "scene-based set compatibility reasoning" with the cross-domain visual similarity input and auto-regressive complementary item generation.

Retrieval

SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers

no code implementations14 Aug 2023 Xijun Wang, Xiaojie Chu, Chunrui Han, Xiangyu Zhang

This paper presents a module, Spatial Cross-scale Convolution (SCSC), which is verified to be effective in improving both CNNs and Transformers.

Face Recognition

Triplet Knowledge Distillation

no code implementations25 May 2023 Xijun Wang, Dongyang Liu, Meina Kan, Chunrui Han, Zhongqin Wu, Shiguang Shan

Distillation then begins in an online manner, and the teacher is only allowed to express solutions within the aforementioned subspace.

Face Recognition Image Classification +1

SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition

no code implementations21 May 2023 Xijun Wang, Ruiqi Xian, Tianrui Guan, Fuxiao Liu, Dinesh Manocha

In practice, we observe a 3. 17-10. 2% accuracy improvement on the aerial video datasets (Okutama, NECDrone), which consist of scenes with single-agent and multi-agent actions.

Action Recognition Optical Flow Estimation +1

Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

no code implementations7 May 2023 Xijun Wang, Aggelos K. Katsaggelos

To better learn these action category queries, we exploit not only the features of the current input video but also the correlation between different videos through a novel video-specific action category query learner worked with a query similarity loss.

Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization

DPP-based Client Selection for Federated Learning with Non-IID Data

no code implementations30 Mar 2023 Yuxuan Zhang, Chao Xu, Howard H. Yang, Xijun Wang, Tony Q. S. Quek

This paper proposes a client selection (CS) method to tackle the communication bottleneck of federated learning (FL) while concurrently coping with FL's data heterogeneity issue.

Federated Learning

FAR: Fourier Aerial Video Recognition

1 code implementation21 Mar 2022 Divya Kothandaraman, Tianrui Guan, Xijun Wang, Sean Hu, Ming Lin, Dinesh Manocha

Our formulation uses a novel Fourier object disentanglement method to innately separate out the human agent (which is typically small) from the background.

Action Recognition Disentanglement +1

Optimizing the Long-Term Average Reward for Continuing MDPs: A Technical Report

no code implementations13 Apr 2021 Chao Xu, Yiping Xie, Xijun Wang, Howard H. Yang, Dusit Niyato, Tony Q. S. Quek

cost), by integrating R-learning, a tabular reinforcement learning (RL) algorithm tailored for maximizing the long-term average reward, and traditional DRL algorithms, initially developed to optimize the discounted long-term cumulative reward rather than the average one.

reinforcement-learning Reinforcement Learning (RL)

Dynamic Region-Aware Convolution

no code implementations CVPR 2021 Jin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun

More gracefully, our DRConv transfers the increasing channel-wise filters to spatial dimension with learnable instructor, which not only improve representation ability of convolution, but also maintains computational cost and the translation-invariance as standard convolution dose.

Face Recognition General Classification +2

Fully Learnable Group Convolution for Acceleration of Deep Neural Networks

no code implementations CVPR 2019 Xijun Wang, Meina Kan, Shiguang Shan, Xilin Chen

Benefitted from its great success on many tasks, deep learning is increasingly used on low-computational-cost devices, e. g. smartphone, embedded devices, etc.

Cannot find the paper you are looking for? You can Submit a new open access paper.