Search Results for author: Zhaoyang Zhang

Found 60 papers, 17 papers with code

Evolving Semantic Communication with Generative Model

1 code implementation29 Mar 2024 Shunpu Tang, Qianqian Yang, Deniz Gündüz, Zhaoyang Zhang

In this paper, we explore an evolving semantic communication system for image transmission, referred to as ESemCom, with the capability to continuously enhance transmission efficiency.

Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding

1 code implementation27 Mar 2024 Run Shao, Zhaoyang Zhang, Chao Tao, Yunsheng Zhang, Chengli Peng, Haifeng Li

Compared to Patch Embed, which requires more than one hundred tokens for one image, HOOK requires only 6 and 8 tokens for sparse and dense tasks, respectively, resulting in efficiency improvements of 1. 5 to 2. 8 times.

Language Modelling Large Language Model

TernaryVote: Differentially Private, Communication Efficient, and Byzantine Resilient Distributed Optimization on Heterogeneous Data

no code implementations16 Feb 2024 Richeng Jin, Yujie Gu, Kai Yue, Xiaofan He, Zhaoyang Zhang, Huaiyu Dai

In this paper, we propose TernaryVote, which combines a ternary compressor and the majority vote mechanism to realize differential privacy, gradient compression, and Byzantine resilience simultaneously.

Distributed Optimization

Channel Mapping Based on Interleaved Learning with Complex-Domain MLP-Mixer

no code implementations7 Jan 2024 Zirui Chen, Zhaoyang Zhang, Zhaohui Yang, Lei Liu

For such a channel mapping task, inspired by the intrinsic coupling across the space and frequency domains, this letter proposes to use interleaved learning with partial antenna and subcarrier characteristics to represent the whole MIMO-OFDM channel.

Representation Learning

Point Cloud in the Air

no code implementations1 Jan 2024 Yulin Shao, Chenghong Bian, Li Yang, Qianqian Yang, Zhaoyang Zhang, Deniz Gunduz

Acquisition and processing of point clouds (PCs) is a crucial enabler for many emerging applications reliant on 3D spatial data, such as robot navigation, autonomous vehicles, and augmented reality.

Autonomous Vehicles Robot Navigation

Cached Transformers: Improving Transformers with Differentiable Memory Cache

1 code implementation20 Dec 2023 Zhaoyang Zhang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, Ping Luo

This work introduces a new Transformer model called Cached Transformer, which uses Gated Recurrent Cached (GRC) attention to extend the self-attention mechanism with a differentiable memory cache of tokens.

Image Classification Instance Segmentation +6

Robust Target Detection of Intelligent Integrated Optical Camera and mmWave Radar System

no code implementations12 Dec 2023 Chen Zhu, Zhouxiang Zhao, Zejing Shan, Lijie Yang, Sijie Ji, Zhaohui Yang, Zhaoyang Zhang

To improve the target detection performance under complex real-world scenarios, this paper proposes an intelligent integrated optical camera and millimeter-wave (mmWave) radar system.

AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion

1 code implementation16 Oct 2023 Yitong Jiang, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu

To this end, we propose an all-in-one image restoration framework with latent diffusion (AutoDIR), which can automatically detect and address multiple unknown degradations.

Blind Image Quality Assessment Image Restoration

Semantic Information Extraction for Text Data with Probability Graph

no code implementations16 Sep 2023 Zhouxiang Zhao, Zhaohui Yang, Ye Hu, Licheng Lin, Zhaoyang Zhang

In this paper, the problem of semantic information extraction for resource constrained text data transmission is studied.

Semantic Similarity Semantic Textual Similarity

Mean Field Game-based Waveform Precoding Design for Mobile Crowd Integrated Sensing, Communication, and Computation Systems

no code implementations6 Sep 2023 Dezhi Wang, Chongwen Huang, Jiguang He, Xiaoming Chen, Wei Wang, Zhaoyang Zhang, Zhu Han, Mérouane Debbah

In this paper, we consider the environment sensing problem in the large-scale mobile crowd ISCC systems and propose an efficient waveform precoding design algorithm based on the mean field game~(MFG).

Deep Joint Source-Channel Coding for Wireless Image Transmission with Entropy-Aware Adaptive Rate Control

no code implementations5 Jun 2023 Weixuan Chen, Yuhao Chen, Qianqian Yang, Chongwen Huang, Qian Wang, Zhaoyang Zhang

Adaptive rate control for deep joint source and channel coding (JSCC) is considered as an effective approach to transmit sufficient information in scenarios with limited communication resources.

MIMO Precoding Design with QoS and Per-Antenna Power Constraints

no code implementations4 Jun 2023 Kaiyi Chi, Yingzhi Huang, Qianqian Yang, Zhaohui Yang, Zhaoyang Zhang

Precoding design for the downlink of multiuser multiple-input multiple-output (MU-MIMO) systems is a fundamental problem.

Distributed Learning over Networks with Graph-Attention-Based Personalization

1 code implementation22 May 2023 Zhuojun Tian, Zhaoyang Zhang, Zhaohui Yang, Richeng Jin, Huaiyu Dai

In conventional distributed learning over a network, multiple agents collaboratively build a common machine learning model.

Graph Attention

Semantic-aware Digital Twin for Metaverse: A Comprehensive Review

no code implementations12 May 2023 Senthil Kumar Jagatheesaperumal, Zhaohui Yang, Qianqian Yang, Chongwen Huang, Wei Xu, Mohammad Shikh-Bahaei, Zhaoyang Zhang

To facilitate the deployment of digital twins in Metaverse, the paradigm with semantic awareness has been proposed as a means for enabling accurate and task-oriented information extraction with inherent intelligence.

Management

Musketeer: Joint Training for Multi-task Vision Language Model with Task Explanation Prompts

1 code implementation11 May 2023 Zhaoyang Zhang, Yantao Shen, Kunyu Shi, Zhaowei Cai, Jun Fang, Siqi Deng, Hao Yang, Davide Modolo, Zhuowen Tu, Stefano Soatto

We present a vision-language model whose parameters are jointly trained on all tasks and fully shared among multiple heterogeneous tasks which may interfere with each other, resulting in a single model which we named Musketeer.

Language Modelling

From Data-driven Learning to Physics-inspired Inferring: A Novel Mobile MIMO Channel Prediction Scheme Based on Neural ODE

no code implementations9 Apr 2023 Zhuoran Xiao, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang, Chongwen Huang, Xiaoming Chen

Then, we design a novel physics-inspired spatial channel gradient network (SCGnet), which represents the derivative process of channel varying as a special neural network and can obtain the gradients at any relative displacement needed for the ODE solving.

Real-time Controllable Denoising for Image and Video

1 code implementation CVPR 2023 Zhaoyang Zhang, Yitong Jiang, Wenqi Shao, Xiaogang Wang, Ping Luo, Kaimo Lin, Jinwei Gu

Controllable image denoising aims to generate clean samples with human perceptual priors and balance sharpness and smoothness.

Image Denoising Video Denoising

Robust mmWave Beamforming by Self-Supervised Hybrid Deep Learning

no code implementations9 Mar 2023 Fenghao Zhu, Bohao Wang, Zhaohui Yang, Chongwen Huang, Zhaoyang Zhang, George C. Alexandropoulos, Chau Yuen, Merouane Debbah

Beamforming with large-scale antenna arrays has been widely used in recent years, which is acknowledged as an important part in 5G and incoming 6G.

Breaking the Communication-Privacy-Accuracy Tradeoff with $f$-Differential Privacy

no code implementations NeurIPS 2023 Richeng Jin, Zhonggen Su, Caijun Zhong, Zhaoyang Zhang, Tony Quek, Huaiyu Dai

We consider a federated data analytics problem in which a server coordinates the collaborative data analysis of multiple users with privacy concerns and limited communication capability.

Data Compression Federated Learning

Distributed Machine Learning for UAV Swarms: Computing, Sensing, and Semantics

no code implementations3 Jan 2023 Yahao Ding, Zhaohui Yang, Quoc-Viet Pham, Zhaoyang Zhang, Mohammad Shikh-Bahaei

In this survey, we first introduce several popular DL algorithms such as federated learning (FL), multi-agent Reinforcement Learning (MARL), distributed inference, and split learning, and present a comprehensive overview of their applications for UAV swarms, such as trajectory design, power control, wireless resource allocation, user assignment, perception, and satellite communications.

Federated Learning Multi-agent Reinforcement Learning

WAIR-D: Wireless AI Research Dataset

no code implementations5 Dec 2022 Yourui Huangfu, Jian Wang, Shengchen Dai, Rong Li, Jun Wang, Chongwen Huang, Zhaoyang Zhang

The statistical data hinder the trained AI models from further fine-tuning for a specific scenario, and ray-tracing data with limited environments lower down the generalization capability of the trained AI models.

Intelligent Communication

Holographic MIMO Communications: Theoretical Foundations, Enabling Technologies, and Future Directions

no code implementations2 Dec 2022 Tierui Gong, Panagiotis Gavriilidis, Ran Ji, Chongwen Huang, George C. Alexandropoulos, Li Wei, Zhaoyang Zhang, Mérouane Debbah, H. Vincent Poor, Chau Yuen

In this survey, we present a comprehensive overview of the latest advances in the HMIMO communications paradigm, with a special focus on their physical aspects, their theoretical foundations, as well as the enabling technologies for HMIMO systems.

Generative Model Based Highly Efficient Semantic Communication Approach for Image Transmission

no code implementations18 Nov 2022 Tianxiao Han, Jiancheng Tang, Qianqian Yang, Yiping Duan, Zhaoyang Zhang, Zhiguo Shi

Deep learning (DL) based semantic communication methods have been explored to transmit images efficiently in recent years.

False: False Negative Samples Aware Contrastive Learning for Semantic Segmentation of High-Resolution Remote Sensing Image

2 code implementations15 Nov 2022 Zhaoyang Zhang, Xuying Wang, Xiaoming Mei, Chao Tao, Haifeng Li

This indicates that the SSCL model has the ability to self-differentiate FNS and that the FALSE effectively mitigates the SCI in self-supervised contrastive learning.

Contrastive Learning Segmentation +1

Over-the-Air Split Learning with MIMO-Based Neural Network and Constellation-Based Activation

no code implementations8 Oct 2022 Yuzhi Yang, Zhaoyang Zhang, Zhaohui Yang

The precoding and combining matrices are trainable parameters in such a system, whereas the MIMO channel is implicit.

Over-the-Air Split Machine Learning in Wireless MIMO Networks

no code implementations7 Oct 2022 Yuzhi Yang, Zhaoyang Zhang, Yuqing Tian, Zhaohui Yang, Chongwen Huang, Caijun Zhong, Kai-Kit Wong

In such a split ML system, the precoding and combining matrices are regarded as trainable parameters, while MIMO channel matrix is regarded as unknown (implicit) parameters.

Deep Learning-Based Rate-Splitting Multiple Access for Reconfigurable Intelligent Surface-Aided Tera-Hertz Massive MIMO

no code implementations18 Sep 2022 Minghui Wu, Zhen Gao, Yang Huang, Zhenyu Xiao, Derrick Wing Kwan Ng, Zhaoyang Zhang

Then, to acquire accurate CSI at the BS for the investigated RSMA precoding scheme to achieve higher spectral efficiency, we propose a CSI acquisition network (CAN) with low pilot and feedback signaling overhead, where the downlink pilot transmission, CSI feedback at the user equipments (UEs), and CSI reconstruction at the BS are modeled as an end-to-end neural network based on Transformer.

Mobile MIMO Channel Prediction with ODE-RNN: a Physics-Inspired Adaptive Approach

no code implementations8 Jul 2022 Zhuoran Xiao, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang, Richeng Jin

Through exploring the intrinsic correlation among a set of historical CSI instances randomly obtained in a certain communication environment, channel prediction can significantly increase CSI accuracy and save signaling overhead.

Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space

1 code implementation7 Jul 2022 Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo

It is challenging because the ground-truth model ranking for each task can only be generated by fine-tuning the pre-trained models on the target dataset, which is brute-force and computationally expensive.

Transferability

Environment Sensing Considering the Occlusion Effect: A Multi-View Approach

no code implementations2 Jul 2022 Xin Tong, Zhaoyang Zhang, Yihan Zhang, Zhaohui Yang, Chongwen Huang, Kai-Kit Wong, Merouane Debbah

In this paper, we consider the problem of sensing the environment within a wireless cellular framework.

Semantic-preserved Communication System for Highly Efficient Speech Transmission

no code implementations25 May 2022 Tianxiao Han, Qianqian Yang, Zhiguo Shi, Shibo He, Zhaoyang Zhang

Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years.

speech-recognition Speech Recognition

Semantic-aware Speech to Text Transmission with Redundancy Removal

no code implementations7 Feb 2022 Tianxiao Han, Qianqian Yang, Zhiguo Shi, Shibo He, Zhaoyang Zhang

We also propose a two-stage training scheme, which speeds up the training of the proposed DL model.

Sufficient-Statistic Memory AMP

no code implementations31 Dec 2021 Lei Liu, Shunqi Huang, Yuzhi Yang, Zhaoyang Zhang, Brian M. Kurkoski

Given an arbitrary MAMP, we can construct an SS-MAMP by damping, which not only ensures the convergence of the state evolution, but also preserves the orthogonality, i. e., its dynamics can be correctly described by state evolution.

C-GRBFnet: A Physics-Inspired Generative Deep Neural Network for Channel Representation and Prediction

no code implementations5 Dec 2021 Zhuoran Xiao, Zhaoyang Zhang, Chongwen Huang, Xiaoming Chen, Caijun Zhong, Mérouane Debbah

Specifically, we first use a forward deep neural network to infer the positions of all possible images of the source reflected by the surrounding scatterers within that environment, and then use the well-known Gaussian Radial Basis Function network (GRBF) to approximate the amplitudes of all possible propagation paths.

Blind Channel Estimation for MIMO Systems via Variational Inference

no code implementations16 Nov 2021 Jiancheng Tang, Qianqian Yang, Zhaoyang Zhang

In this paper, we investigate the blind channel estimation problem for MIMO systems under Rayleigh fading channel.

Variational Inference

JMSNAS: Joint Model Split and Neural Architecture Search for Learning over Mobile Edge Networks

no code implementations16 Nov 2021 Yuqing Tian, Zhaoyang Zhang, Zhaohui Yang, Qianqian Yang

In this paper, a joint model split and neural architecture search (JMSNAS) framework is proposed to automatically generate and deploy a DNN model over a mobile edge network.

Neural Architecture Search

Communication-Efficient Federated Learning with Binary Neural Networks

1 code implementation5 Oct 2021 Yuzhi Yang, Zhaoyang Zhang, Qianqian Yang

{ Numerical results show that the proposed FL framework significantly reduces the communication cost compared to the conventional neural networks with typical real-valued parameters, and the performance loss incurred by the binarization can be further compensated by a hybrid method.

Binarization Federated Learning +1

Joint Multi-User Communication and Sensing Exploiting Both Signal and Environment Sparsity

no code implementations6 Sep 2021 Xin Tong, Zhaoyang Zhang, Jue Wang, Chongwen Huang, Merouane Debbah

As a potential technology feature for 6G wireless networks, the idea of sensing-communication integration requires the system not only to complete reliable multi-user communication but also to achieve accurate environment sensing.

object-detection Object Detection

BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening

no code implementations13 May 2021 Wenqi Shao, Hang Yu, Zhaoyang Zhang, Hang Xu, Zhenguo Li, Ping Luo

To address this problem, we develop a probability-based pruning algorithm, called batch whitening channel pruning (BWCP), which can stochastically discard unimportant channels by modeling the probability of a channel being activated.

FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation

1 code implementation15 Feb 2021 Chaofan Tao, Rui Lin, Quan Chen, Zhaoyang Zhang, Ping Luo, Ngai Wong

Prior arts often discretize the network weights by carefully tuning hyper-parameters of quantization (e. g. non-uniform stepsize and layer-wise bitwidths), which are complicated and sub-optimal because the full-precision and low-precision models have a large discrepancy.

Neural Network Compression Quantization

Multi-hop RIS-Empowered Terahertz Communications: A DRL-based Hybrid Beamforming Design

no code implementations22 Jan 2021 Chongwen Huang, Zhaohui Yang, George C. Alexandropoulos, Kai Xiong, Li Wei, Chau Yuen, Zhaoyang Zhang, Merouane Debbah

We investigate the joint design of digital beamforming matrix at the BS and analog beamforming matrices at the RISs, by leveraging the recent advances in deep reinforcement learning (DRL) to combat the propagation loss.

Distributed ADMM with Synergetic Communication and Computation

no code implementations29 Sep 2020 Zhuojun Tian, Zhaoyang Zhang, Jue Wang, Xiaoming Chen, Wei Wang, Huaiyu Dai

In this paper, we propose a novel distributed alternating direction method of multipliers (ADMM) algorithm with synergetic communication and computation, called SCCD-ADMM, to reduce the total communication and computation cost of the system.

Hybrid Beamforming for RIS-Empowered Multi-hop Terahertz Communications: A DRL-based Method

no code implementations20 Sep 2020 Chongwen Huang, Zhaohui Yang, George C. Alexandropoulos, Kai Xiong, Li Wei, Chau Yuen, Zhaoyang Zhang

Wireless communication in the TeraHertz band (0. 1--10 THz) is envisioned as one of the key enabling technologies for the future six generation (6G) wireless communication systems.

Channel Estimation for RIS-Empowered Multi-User MISO Wireless Communications

no code implementations4 Aug 2020 Li Wei, Chongwen Huang, George C. Alexandropoulos, Chau Yuen, Zhaoyang Zhang, Mérouane Debbah

We also discuss the downlink achievable sum rate computation with estimated channels and different precoding schemes for the base station.

AdaX: Adaptive Gradient Descent with Exponential Long Term Memory

1 code implementation21 Apr 2020 Wenjie Li, Zhaoyang Zhang, Xinjiang Wang, Ping Luo

Although adaptive optimization algorithms such as Adam show fast convergence in many machine learning tasks, this paper identifies a problem of Adam by analyzing its performance in a simple non-convex synthetic problem, showing that Adam's fast convergence would possibly lead the algorithm to local minimums.

Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks

no code implementations ICCV 2019 Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang, Xiaogang Wang, Ping Luo

ResNeXt, still suffers from the sub-optimal performance due to manually defining the number of groups as a constant over all of the layers.

Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos

no code implementations15 Aug 2018 Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei zhang

Secondly, TSD significantly reduces the computations to run video action recognition with compressed frames on the cloud, while maintaining high recognition accuracies.

Action Recognition In Videos Temporal Action Localization

Performance Evaluation of Channel Decoding With Deep Neural Networks

1 code implementation1 Nov 2017 Wei Lyu, Zhaoyang Zhang, Chunxu Jiao, Kangjian Qin, Huazi Zhang

With the demand of high data rate and low latency in fifth generation (5G), deep neural network decoder (NND) has become a promising candidate due to its capability of one-shot decoding and parallel computing.

Cannot find the paper you are looking for? You can Submit a new open access paper.