Search Results for author: Zhaoyang Zhang

Found 100 papers, 23 papers with code

Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design

no code implementations19 May 2025 Ziqing Xing, Zhaoyang Zhang, Zirui Chen, Hongning Ruan, Zhaohui Yang

In this paper, we incorporate physical knowledge into learning-based high-precision target sensing using the multi-view channel state information (CSI) between multiple base stations (BSs) and user equipment (UEs).

Computational Imaging-Based ISAC Method with Large Pixel Division

no code implementations12 May 2025 Xin Tong, Zhaoyang Zhang, Zhaohui Yang, Yu Ge, Henk Wymeersch

In this paper, a novel method is proposed to address such a problem in environment sensing in millimeter-wave wireless cellular networks, which effectively cancels the severe errors caused by large pixel division as in conventional computational imaging algorithms.

Integrated sensing and communication ISAC

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

no code implementations6 May 2025 Shiyi Zhang, Junhao Zhuang, Zhaoyang Zhang, Ying Shan, Yansong Tang

Unlike existing methods, FlexiAct allows for variations in layout, viewpoint, and skeletal structure between the subject of the reference video and the target image, while maintaining identity consistency.

Denoising

Communication-Efficient Personalized Distributed Learning with Data and Node Heterogeneity

no code implementations24 Apr 2025 Zhuojun Tian, Zhaoyang Zhang, Yiwei Li, Mehdi Bennis

In the proposed method, each local model is represented as the Hadamard product of global real-valued parameters and a personalized binary mask for pruning.

Cobra: Efficient Line Art COlorization with BRoAder References

no code implementations16 Apr 2025 Junhao Zhuang, Lingen Li, Xuan Ju, Zhaoyang Zhang, Chun Yuan, Ying Shan

The comic production industry requires reference-based line art colorization with high accuracy, efficiency, contextual consistency, and flexible control.

Image Generation Line Art Colorization

Fair Resource Allocation in UAV-based Semantic Communication System with Fluid Antenna

no code implementations8 Apr 2025 Liang Siyun, Chen Zhu, Zhaohui Yang, Changsheng You, Dusit Niyato, Kai-Kit Wong, Zhaoyang Zhang

In this paper, the problem of maximization of the minimum equivalent rate in a unmanned-aerial-vehicle (UAV)-based multi-user semantic communication system is investigated.

Semantic Communication Semantic Compression

WorldPrompter: Traversable Text-to-Scene Generation

no code implementations2 Apr 2025 Zhaoyang Zhang, Yannick Hold-Geoffroy, Miloš Hašan, Chen Ziwen, Fujun Luan, Julie Dorsey, Yiwei Hu

Scene-level 3D generation is a challenging research topic, with most existing methods generating only partial scenes and offering limited navigational freedom.

3D Generation Scene Generation +1

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

no code implementations17 Mar 2025 Yaowei Li, Lingen Li, Zhaoyang Zhang, Xiaoyu Li, Guangzhi Wang, Hongxiang Li, Xiaodong Cun, Ying Shan, Yuexian Zou

Element-level visual manipulation is essential in digital content creation, but current diffusion-based methods lack the precision and flexibility of traditional tools.

Computational Efficiency Data Augmentation +2

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

no code implementations6 Mar 2025 Adnan Shahid, Adrian Kliks, Ahmed Al-Tahmeesschi, Ahmed Elbakary, Alexandros Nikou, Ali Maatouk, Ali Mokh, Amirreza Kazemi, Antonio De Domenico, Athanasios Karapantelakis, Bo Cheng, Bo Yang, Bohao Wang, Carlo Fischione, Chao Zhang, Chaouki Ben Issaid, Chau Yuen, Chenghui Peng, Chongwen Huang, Christina Chaccour, Christo Kurisummoottil Thomas, Dheeraj Sharma, Dimitris Kalogiros, Dusit Niyato, Eli de Poorter, Elissa Mhanna, Emilio Calvanese Strinati, Faouzi Bader, Fathi Abdeldayem, Fei Wang, Fenghao Zhu, Gianluca Fontanesi, Giovanni Geraci, Haibo Zhou, Hakimeh Purmehdi, Hamed Ahmadi, Hang Zou, Hongyang Du, Hoon Lee, Howard H. Yang, Iacopo Poli, Igor Carron, Ilias Chatzistefanidis, Inkyu Lee, Ioannis Pitsiorlas, Jaron Fontaine, Jiajun Wu, Jie Zeng, Jinan Li, Jinane Karam, Johny Gemayel, Juan Deng, Julien Frison, Kaibin Huang, Kehai Qiu, Keith Ball, Kezhi Wang, Kun Guo, Leandros Tassiulas, Lecorve Gwenole, Liexiang Yue, Lina Bariah, Louis Powell, Marcin Dryjanski, Maria Amparo Canaveras Galdon, Marios Kountouris, Maryam Hafeez, Maxime Elkael, Mehdi Bennis, Mehdi Boudjelli, Meiling Dai, Merouane Debbah, Michele Polese, Mohamad Assaad, Mohamed Benzaghta, Mohammad Al Refai, Moussab Djerrab, Mubeen Syed, Muhammad Amir, Na Yan, Najla Alkaabi, Nan Li, Nassim Sehad, Navid Nikaein, Omar Hashash, Pawel Sroka, Qianqian Yang, Qiyang Zhao, Rasoul Nikbakht Silab, Rex Ying, Roberto Morabito, Rongpeng Li, Ryad Madi, Salah Eddine El Ayoubi, Salvatore D'Oro, Samson Lasaulce, Serveh Shalmashi, Sige Liu, Sihem Cherrared, Swarna Bindu Chetty, Swastika Dutta, Syed A. R. Zaidi, Tianjiao Chen, Timothy Murphy, Tommaso Melodia, Tony Q. S. Quek, Vishnu Ram, Walid Saad, Wassim Hamidouche, Weilong Chen, Xiaoou Liu, Xiaoxue Yu, Xijun Wang, Xingyu Shang, Xinquan Wang, Xuelin Cao, Yang Su, Yanping Liang, Yansha Deng, Yifan Yang, Yingping Cui, Yu Sun, Yuxuan Chen, Yvan Pointurier, Zeinab Nehme, Zeinab Nezami, Zhaohui Yang, Zhaoyang Zhang, Zhe Liu, Zhenyu Yang, Zhu Han, Zhuang Zhou, Zihan Chen, Zirui Chen, Zitao Shuai

This white paper discusses the role of large-scale AI in the telecommunications industry, with a specific focus on the potential of generative AI to revolutionize network functions and user experiences, especially in the context of 6G systems.

Management

Semantic Communication with Entropy-and-Channel-Adaptive Rate Control over Multi-User MIMO Fading Channels

no code implementations26 Jan 2025 Weixuan Chen, Qianqian Yang, Yuhao Chen, Chongwen Huang, Qian Wang, Zehui Xiong, Zhaoyang Zhang

Although significant improvements in transmission efficiency have been achieved, existing semantic communication (SemCom) methods typically use a fixed transmission rate for varying channel conditions and transmission contents, leading to performance degradation under harsh channel conditions.

Semantic Communication

Comprehensive Subjective and Objective Evaluation Method for Text-generated Video

no code implementations15 Jan 2025 Zelu Qi, Ping Shi, Shuqi Wang, Zhaoyang Zhang, Zefeng Ying, Da Pan

Recent text-to-video (T2V) technology advancements, as demonstrated by models such as Gen3, Pika, and Sora, have significantly broadened its applicability and popularity.

Video Generation

Generative AI Empowered Semantic Feature Multiple Access (SFMA) Over Wireless Networks

no code implementations30 Dec 2024 Jiaxiang Wang, Yinchao Yang, Zhaohui Yang, Chongwen Huang, Mingzhe Chen, Zhaoyang Zhang, Mohammad Shikh-Bahaei

Second, we optimize inter-group power allocation by formulating an optimization problem that allocates proper transmit power across all user groups to maximize system sum rates while satisfying each user's minimum rate requirement.

Semantic Communication Video Frame Interpolation

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

1 code implementation24 Dec 2024 Minghong Cai, Xiaodong Cun, Xiaoyu Li, Wenze Liu, Zhaoyang Zhang, Yong Zhang, Ying Shan, Xiangyu Yue

Based on our careful design, the video generated by DiTCtrl achieves smooth transitions and consistent object motion given multiple sequential prompts without additional training.

Video Editing Video Generation

DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network

no code implementations22 Dec 2024 Xiangtian Li, Xiaobo Wang, Zhen Qi, Han Cao, Zhaoyang Zhang, Ao Xiang

Dynamic texture synthesis aims to generate sequences that are visually similar to a reference video texture and exhibit specific stationary properties in time.

Diversity Generative Adversarial Network +1

Consistent Human Image and Video Generation with Spatially Conditioned Diffusion

1 code implementation19 Dec 2024 Mingdeng Cao, Chong Mou, Ziyang Yuan, Xintao Wang, Zhaoyang Zhang, Ying Shan, Yinqiang Zheng

By fine-tuning existing base diffusion models on human video data, our method demonstrates strong generalization to unseen human identities and poses without requiring additional per-instance fine-tuning.

Computational Efficiency Denoising +1

ColorFlow: Retrieval-Augmented Image Sequence Colorization

no code implementations16 Dec 2024 Junhao Zhuang, Xuan Ju, Zhaoyang Zhang, Yong liu, Shiyi Zhang, Chun Yuan, Ying Shan

Automatic black-and-white image sequence colorization while preserving character and object identity (ID) is a complex task with significant market demand, such as in cartoon or comic series colorization.

Colorization Image Colorization +2

BrushEdit: All-In-One Image Inpainting and Editing

no code implementations13 Dec 2024 Yaowei Li, Yuxuan Bian, Xuan Ju, Zhaoyang Zhang, Junhao Zhuang, Ying Shan, Yuexian Zou, Qiang Xu

Image editing has advanced significantly with the development of diffusion models using both inversion-based and instruction-based methods.

All Image Inpainting

Towards Wireless-Native Big AI Model: Insights into Its Ambitions, Peculiarities and Methodologies

no code implementations12 Dec 2024 Zirui Chen, Zhaoyang Zhang, Chenyu Liu, Ziqing Xing

Researches on leveraging big artificial intelligence model (BAIM) technology to drive the intelligent evolution of wireless networks are emerging.

Implicit Neural Compression of Point Clouds

no code implementations11 Dec 2024 Hongning Ruan, Yulin Shao, Qianqian Yang, Liang Zhao, Zhaoyang Zhang, Dusit Niyato

Our approach employs two coordinate-based neural networks to implicitly represent a voxelized point cloud: the first determines the occupancy status of a voxel, while the second predicts the attributes of occupied voxels.

Attribute

Mitigating Knowledge Conflicts in Language Model-Driven Question Answering

no code implementations18 Nov 2024 Han Cao, Zhaoyang Zhang, Xiangtian Li, Chufan Wu, Hansong Zhang, Wenqing Zhang

In the context of knowledge-driven seq-to-seq generation tasks, such as document-based question answering and document summarization systems, two fundamental knowledge sources play crucial roles: the inherent knowledge embedded within model parameters and the external knowledge obtained through context.

Document Summarization Hallucination +3

Artistic Neural Style Transfer Algorithms with Activation Smoothing

no code implementations12 Nov 2024 Xiangtian Li, Han Cao, Zhaoyang Zhang, Jiacheng Hu, Yuhui Jin, Zihao Zhao

The works of Gatys et al. demonstrated the capability of Convolutional Neural Networks (CNNs) in creating artistic style images.

Style Transfer

Sampling-guided Heterogeneous Graph Neural Network with Temporal Smoothing for Scalable Longitudinal Data Imputation

no code implementations7 Nov 2024 Zhaoyang Zhang, Ziqi Chen, Qiao Liu, Jinhan Xie, Hongtu Zhu

In this paper, we propose a novel framework, the Sampling-guided Heterogeneous Graph Neural Network (SHT-GNN), to effectively tackle the challenge of missing data imputation in longitudinal studies.

Computational Efficiency Graph Neural Network +1

Enhancing Missing Data Imputation through Combined Bipartite Graph and Complete Directed Graph

no code implementations7 Nov 2024 Zhaoyang Zhang, Hongtu Zhu, Ziqi Chen, Yingjie Zhang, Hai Shu

In this paper, we aim to address a significant challenge in the field of missing data imputation: identifying and leveraging the interdependencies among features to enhance missing data imputation for tabular data.

Graph Neural Network Imputation

Research on Key Technologies for Cross-Cloud Federated Training of Large Language Models

no code implementations24 Oct 2024 Haowei Yang, Mingxiu Sui, Shaobo Liu, Xinyue Qian, Zhaoyang Zhang, Bingying Liu

With the rapid development of natural language processing technology, large language models have demonstrated exceptional performance in various application scenarios.

MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls

1 code implementation30 Jul 2024 Yuxuan Bian, Ailing Zeng, Xuan Ju, Xian Liu, Zhaoyang Zhang, Wei Liu, Qiang Xu

However, employing a unified model to achieve various generation tasks with different condition modalities presents two main challenges: motion distribution drifts across different tasks (e. g., co-speech gestures and text-driven daily actions) and the complex optimization of mixed conditions with varying granularities (e. g., text and audio).

Gesture Generation Motion Generation +2

Image Inpainting Models are Effective Tools for Instruction-guided Image Editing

no code implementations18 Jul 2024 Xuan Ju, Junhao Zhuang, Zhaoyang Zhang, Yuxuan Bian, Qiang Xu, Ying Shan

The most advanced methods, such as SmartEdit and MGIE, usually combine large language models with diffusion models through joint training, where the former provides text understanding ability, and the latter provides image generation ability.

Image Inpainting

FedsLLM: Federated Split Learning for Large Language Models over Communication Networks

no code implementations12 Jul 2024 Kai Zhao, Zhaohui Yang, Chongwen Huang, Xiaoming Chen, Zhaoyang Zhang

This paper models the minimization of the training delay by integrating computation and communication optimization, simplifying the optimization problem into a convex problem to find the optimal solution.

LEMMA

MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions

2 code implementations8 Jul 2024 Xuan Ju, Yiming Gao, Zhaoyang Zhang, Ziyang Yuan, Xintao Wang, Ailing Zeng, Yu Xiong, Qiang Xu, Ying Shan

Sora's high-motion intensity and long consistent videos have significantly impacted the field of video generation, attracting unprecedented attention.

Video Alignment Video Generation

Joint Beamforming and Antenna Design for Near-Field Fluid Antenna System

no code implementations8 Jul 2024 Yixuan Chen, Mingzhe Chen, Hao Xu, Zhaohui Yang, Kai-Kit Wong, Zhaoyang Zhang

In this letter, we study the energy efficiency maximization problem for a fluid antenna system (FAS) in near field communications.

Position

Image Conductor: Precision Control for Interactive Video Synthesis

no code implementations21 Jun 2024 Yaowei Li, Xintao Wang, Zhaoyang Zhang, Zhouxia Wang, Ziyang Yuan, Liangbin Xie, Yuexian Zou, Ying Shan

To this end, we propose Image Conductor, a method for precise control of camera transitions and object movements to generate video assets from a single image.

Object

Exploring Channel Estimation and Signal Detection for ODDM-based ISAC Systems

no code implementations1 Jun 2024 Dezhi Wang, Chongwen Huang, Lei Liu, Xiaoming Chen, Wei Wang, Zhaoyang Zhang, Chau Yuen, Mérouane Debbah

Inspired by providing reliable communications for high-mobility scenarios, in this letter, we investigate the channel estimation and signal detection in integrated sensing and communication~(ISAC) systems based on the orthogonal delay-Doppler multiplexing~(ODDM) modulation, which consists of a pulse-train that can achieve the orthogonality with respect to the resolution of the delay-Doppler~(DD) plane.

Integrated sensing and communication ISAC

VBIM-Net: Variational Born Iterative Network for Inverse Scattering Problems

no code implementations29 May 2024 Ziqing Xing, Zhaoyang Zhang, Zirui Chen, Yusong Wang, Haoran Ma, Zhun Wei

In this article, we propose a novel Variational Born Iterative Network, namely, VBIM-Net, to solve the full-wave ISPs with significantly improved structural rationality and inversion quality.

All-day Depth Completion

no code implementations27 May 2024 Vadim Ezhov, Hyoungseob Park, Zhaoyang Zhang, Rishi Upadhyay, Howard Zhang, Chethan Chinder Chandrappa, Achuta Kadambi, Yunhao Ba, Julie Dorsey, Alex Wong

In poorly illuminated regions where photometric intensities do not afford the inference of local shape, the coarse approximation of scene depth serves as a prior; the uncertainty map is then used with the image to guide refinement through an uncertainty-driven residual learning (URL) scheme.

All Depth Completion +2

ReVideo: Remake a Video with Motion and Content Control

no code implementations22 May 2024 Chong Mou, Mingdeng Cao, Xintao Wang, Zhaoyang Zhang, Ying Shan, Jian Zhang

In this paper, we present a novel attempt to Remake a Video (ReVideo) which stands out from existing methods by allowing precise video editing in specific areas through the specification of both content and motion.

Video Editing Video Generation

Evolving Semantic Communication with Generative Model

1 code implementation29 Mar 2024 Shunpu Tang, Qianqian Yang, Deniz Gündüz, Zhaoyang Zhang

In this paper, we explore an evolving semantic communication system for image transmission, referred to as ESemCom, with the capability to continuously enhance transmission efficiency.

model Semantic Communication

Homogeneous Tokenizer Matters: Homogeneous Visual Tokenizer for Remote Sensing Image Understanding

1 code implementation27 Mar 2024 Run Shao, Zhaoyang Zhang, Chao Tao, Yunsheng Zhang, Chengli Peng, Haifeng Li

Compared to Patch Embed, which requires more than one hundred tokens for one image, HOOK requires only 6 and 8 tokens for sparse and dense tasks, respectively, resulting in efficiency improvements of 1. 5 to 2. 8 times.

Language Modelling Large Language Model

TernaryVote: Differentially Private, Communication Efficient, and Byzantine Resilient Distributed Optimization on Heterogeneous Data

no code implementations16 Feb 2024 Richeng Jin, Yujie Gu, Kai Yue, Xiaofan He, Zhaoyang Zhang, Huaiyu Dai

In this paper, we propose TernaryVote, which combines a ternary compressor and the majority vote mechanism to realize differential privacy, gradient compression, and Byzantine resilience simultaneously.

Distributed Optimization

Channel Mapping Based on Interleaved Learning with Complex-Domain MLP-Mixer

no code implementations7 Jan 2024 Zirui Chen, Zhaoyang Zhang, Zhaohui Yang, Lei Liu

For such a channel mapping task, inspired by the intrinsic coupling across the space and frequency domains, this letter proposes to use interleaved learning with partial antenna and subcarrier characteristics to represent the whole MIMO-OFDM channel.

Representation Learning

Point Cloud in the Air

no code implementations1 Jan 2024 Yulin Shao, Chenghong Bian, Li Yang, Qianqian Yang, Zhaoyang Zhang, Deniz Gunduz

Acquisition and processing of point clouds (PCs) is a crucial enabler for many emerging applications reliant on 3D spatial data, such as robot navigation, autonomous vehicles, and augmented reality.

Autonomous Vehicles Robot Navigation

Cached Transformers: Improving Transformers with Differentiable Memory Cache

1 code implementation20 Dec 2023 Zhaoyang Zhang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, Ping Luo

This work introduces a new Transformer model called Cached Transformer, which uses Gated Recurrent Cached (GRC) attention to extend the self-attention mechanism with a differentiable memory cache of tokens.

Image Classification Instance Segmentation +7

Robust Target Detection of Intelligent Integrated Optical Camera and mmWave Radar System

no code implementations12 Dec 2023 Chen Zhu, Zhouxiang Zhao, Zejing Shan, Lijie Yang, Sijie Ji, Zhaohui Yang, Zhaoyang Zhang

To improve the target detection performance under complex real-world scenarios, this paper proposes an intelligent integrated optical camera and millimeter-wave (mmWave) radar system.

AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion

1 code implementation16 Oct 2023 Yitong Jiang, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu

Specifically, AutoDIR consists of two key stages: a Blind Image Quality Assessment (BIQA) stage based on a semantic-agnostic vision-language model which automatically detects unknown image degradations for input images, an All-in-One Image Restoration (AIR) stage utilizes structural-corrected latent diffusion which handles multiple types of image degradations.

All Image Restoration +2

Semantic Information Extraction for Text Data with Probability Graph

no code implementations16 Sep 2023 Zhouxiang Zhao, Zhaohui Yang, Ye Hu, Licheng Lin, Zhaoyang Zhang

In this paper, the problem of semantic information extraction for resource constrained text data transmission is studied.

Semantic Similarity Semantic Textual Similarity

Mean Field Game-based Waveform Precoding Design for Mobile Crowd Integrated Sensing, Communication, and Computation Systems

no code implementations6 Sep 2023 Dezhi Wang, Chongwen Huang, Jiguang He, Xiaoming Chen, Wei Wang, Zhaoyang Zhang, Zhu Han, Mérouane Debbah

In this paper, we consider the environment sensing problem in the large-scale mobile crowd ISCC systems and propose an efficient waveform precoding design algorithm based on the mean field game~(MFG).

Integrated sensing and communication ISAC

Deep Joint Source-Channel Coding for Wireless Image Transmission with Entropy-Aware Adaptive Rate Control

no code implementations5 Jun 2023 Weixuan Chen, Yuhao Chen, Qianqian Yang, Chongwen Huang, Qian Wang, Zhaoyang Zhang

Adaptive rate control for deep joint source and channel coding (JSCC) is considered as an effective approach to transmit sufficient information in scenarios with limited communication resources.

MIMO Precoding Design with QoS and Per-Antenna Power Constraints

no code implementations4 Jun 2023 Kaiyi Chi, Yingzhi Huang, Qianqian Yang, Zhaohui Yang, Zhaoyang Zhang

Precoding design for the downlink of multiuser multiple-input multiple-output (MU-MIMO) systems is a fundamental problem.

Distributed Learning over Networks with Graph-Attention-Based Personalization

1 code implementation22 May 2023 Zhuojun Tian, Zhaoyang Zhang, Zhaohui Yang, Richeng Jin, Huaiyu Dai

In conventional distributed learning over a network, multiple agents collaboratively build a common machine learning model.

Graph Attention

Semantic-aware Digital Twin for Metaverse: A Comprehensive Review

no code implementations12 May 2023 Senthil Kumar Jagatheesaperumal, Zhaohui Yang, Qianqian Yang, Chongwen Huang, Wei Xu, Mohammad Shikh-Bahaei, Zhaoyang Zhang

To facilitate the deployment of digital twins in Metaverse, the paradigm with semantic awareness has been proposed as a means for enabling accurate and task-oriented information extraction with inherent intelligence.

Management Semantic Communication

Musketeer: Joint Training for Multi-task Vision Language Model with Task Explanation Prompts

1 code implementation11 May 2023 Zhaoyang Zhang, Yantao Shen, Kunyu Shi, Zhaowei Cai, Jun Fang, Siqi Deng, Hao Yang, Davide Modolo, Zhuowen Tu, Stefano Soatto

We present a vision-language model whose parameters are jointly trained on all tasks and fully shared among multiple heterogeneous tasks which may interfere with each other, resulting in a single model which we named Musketeer.

Language Modeling Language Modelling

From Data-driven Learning to Physics-inspired Inferring: A Novel Mobile MIMO Channel Prediction Scheme Based on Neural ODE

no code implementations9 Apr 2023 Zhuoran Xiao, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang, Chongwen Huang, Xiaoming Chen

Then, we design a novel physics-inspired spatial channel gradient network (SCGnet), which represents the derivative process of channel varying as a special neural network and can obtain the gradients at any relative displacement needed for the ODE solving.

Prediction

Real-time Controllable Denoising for Image and Video

1 code implementation CVPR 2023 Zhaoyang Zhang, Yitong Jiang, Wenqi Shao, Xiaogang Wang, Ping Luo, Kaimo Lin, Jinwei Gu

Controllable image denoising aims to generate clean samples with human perceptual priors and balance sharpness and smoothness.

Image Denoising Video Denoising

Robust Millimeter Beamforming via Self-Supervised Hybrid Deep Learning

no code implementations9 Mar 2023 Fenghao Zhu, Bohao Wang, Zhaohui Yang, Chongwen Huang, Zhaoyang Zhang, George C. Alexandropoulos, Chau Yuen, Merouane Debbah

Beamforming with large-scale antenna arrays has been widely used in recent years, which is acknowledged as an important part in 5G and incoming 6G.

Deep Learning

Breaking the Communication-Privacy-Accuracy Tradeoff with $f$-Differential Privacy

no code implementations NeurIPS 2023 Richeng Jin, Zhonggen Su, Caijun Zhong, Zhaoyang Zhang, Tony Quek, Huaiyu Dai

We consider a federated data analytics problem in which a server coordinates the collaborative data analysis of multiple users with privacy concerns and limited communication capability.

Data Compression Federated Learning

Distributed Machine Learning for UAV Swarms: Computing, Sensing, and Semantics

no code implementations3 Jan 2023 Yahao Ding, Zhaohui Yang, Quoc-Viet Pham, Zhaoyang Zhang, Mohammad Shikh-Bahaei

In this survey, we first introduce several popular DL algorithms such as federated learning (FL), multi-agent Reinforcement Learning (MARL), distributed inference, and split learning, and present a comprehensive overview of their applications for UAV swarms, such as trajectory design, power control, wireless resource allocation, user assignment, perception, and satellite communications.

Federated Learning Multi-agent Reinforcement Learning +1

WAIR-D: Wireless AI Research Dataset

no code implementations5 Dec 2022 Yourui Huangfu, Jian Wang, Shengchen Dai, Rong Li, Jun Wang, Chongwen Huang, Zhaoyang Zhang

The statistical data hinder the trained AI models from further fine-tuning for a specific scenario, and ray-tracing data with limited environments lower down the generalization capability of the trained AI models.

Intelligent Communication

Holographic MIMO Communications: Theoretical Foundations, Enabling Technologies, and Future Directions

no code implementations2 Dec 2022 Tierui Gong, Panagiotis Gavriilidis, Ran Ji, Chongwen Huang, George C. Alexandropoulos, Li Wei, Zhaoyang Zhang, Mérouane Debbah, H. Vincent Poor, Chau Yuen

In this survey, we present a comprehensive overview of the latest advances in the HMIMO communications paradigm, with a special focus on their physical aspects, their theoretical foundations, as well as the enabling technologies for HMIMO systems.

Generative Model Based Highly Efficient Semantic Communication Approach for Image Transmission

no code implementations18 Nov 2022 Tianxiao Han, Jiancheng Tang, Qianqian Yang, Yiping Duan, Zhaoyang Zhang, Zhiguo Shi

Deep learning (DL) based semantic communication methods have been explored to transmit images efficiently in recent years.

Semantic Communication

False: False Negative Samples Aware Contrastive Learning for Semantic Segmentation of High-Resolution Remote Sensing Image

2 code implementations15 Nov 2022 Zhaoyang Zhang, Xuying Wang, Xiaoming Mei, Chao Tao, Haifeng Li

This indicates that the SSCL model has the ability to self-differentiate FNS and that the FALSE effectively mitigates the SCI in self-supervised contrastive learning.

Contrastive Learning Segmentation +1

Over-the-Air Split Learning with MIMO-Based Neural Network and Constellation-Based Activation

no code implementations8 Oct 2022 Yuzhi Yang, Zhaoyang Zhang, Zhaohui Yang

The precoding and combining matrices are trainable parameters in such a system, whereas the MIMO channel is implicit.

Over-the-Air Split Machine Learning in Wireless MIMO Networks

no code implementations7 Oct 2022 Yuzhi Yang, Zhaoyang Zhang, Yuqing Tian, Zhaohui Yang, Chongwen Huang, Caijun Zhong, Kai-Kit Wong

In such a split ML system, the precoding and combining matrices are regarded as trainable parameters, while MIMO channel matrix is regarded as unknown (implicit) parameters.

Deep Learning-Based Rate-Splitting Multiple Access for Reconfigurable Intelligent Surface-Aided Tera-Hertz Massive MIMO

no code implementations18 Sep 2022 Minghui Wu, Zhen Gao, Yang Huang, Zhenyu Xiao, Derrick Wing Kwan Ng, Zhaoyang Zhang

Then, to acquire accurate CSI at the BS for the investigated RSMA precoding scheme to achieve higher spectral efficiency, we propose a CSI acquisition network (CAN) with low pilot and feedback signaling overhead, where the downlink pilot transmission, CSI feedback at the user equipments (UEs), and CSI reconstruction at the BS are modeled as an end-to-end neural network based on Transformer.

Mobile MIMO Channel Prediction with ODE-RNN: a Physics-Inspired Adaptive Approach

no code implementations8 Jul 2022 Zhuoran Xiao, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang, Richeng Jin

Through exploring the intrinsic correlation among a set of historical CSI instances randomly obtained in a certain communication environment, channel prediction can significantly increase CSI accuracy and save signaling overhead.

Prediction

Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space

1 code implementation7 Jul 2022 Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo

It is challenging because the ground-truth model ranking for each task can only be generated by fine-tuning the pre-trained models on the target dataset, which is brute-force and computationally expensive.

All Transferability

Environment Sensing Considering the Occlusion Effect: A Multi-View Approach

no code implementations2 Jul 2022 Xin Tong, Zhaoyang Zhang, Yihan Zhang, Zhaohui Yang, Chongwen Huang, Kai-Kit Wong, Merouane Debbah

In this paper, we consider the problem of sensing the environment within a wireless cellular framework.

Semantic-preserved Communication System for Highly Efficient Speech Transmission

no code implementations25 May 2022 Tianxiao Han, Qianqian Yang, Zhiguo Shi, Shibo He, Zhaoyang Zhang

Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years.

Semantic Communication speech-recognition +2

Sufficient-Statistic Memory AMP

no code implementations31 Dec 2021 Lei Liu, Shunqi Huang, Yuzhi Yang, Zhaoyang Zhang, Brian M. Kurkoski

Given an arbitrary MAMP, we can construct an SS-MAMP by damping, which not only ensures the convergence of the state evolution, but also preserves the orthogonality, i. e., its dynamics can be correctly described by state evolution.

C-GRBFnet: A Physics-Inspired Generative Deep Neural Network for Channel Representation and Prediction

no code implementations5 Dec 2021 Zhuoran Xiao, Zhaoyang Zhang, Chongwen Huang, Xiaoming Chen, Caijun Zhong, Mérouane Debbah

Specifically, we first use a forward deep neural network to infer the positions of all possible images of the source reflected by the surrounding scatterers within that environment, and then use the well-known Gaussian Radial Basis Function network (GRBF) to approximate the amplitudes of all possible propagation paths.

JMSNAS: Joint Model Split and Neural Architecture Search for Learning over Mobile Edge Networks

no code implementations16 Nov 2021 Yuqing Tian, Zhaoyang Zhang, Zhaohui Yang, Qianqian Yang

In this paper, a joint model split and neural architecture search (JMSNAS) framework is proposed to automatically generate and deploy a DNN model over a mobile edge network.

Neural Architecture Search

Blind Channel Estimation for MIMO Systems via Variational Inference

no code implementations16 Nov 2021 Jiancheng Tang, Qianqian Yang, Zhaoyang Zhang

In this paper, we investigate the blind channel estimation problem for MIMO systems under Rayleigh fading channel.

Variational Inference

Communication-Efficient Federated Learning with Binary Neural Networks

1 code implementation5 Oct 2021 Yuzhi Yang, Zhaoyang Zhang, Qianqian Yang

{ Numerical results show that the proposed FL framework significantly reduces the communication cost compared to the conventional neural networks with typical real-valued parameters, and the performance loss incurred by the binarization can be further compensated by a hybrid method.

Binarization Federated Learning +1

Joint Multi-User Communication and Sensing Exploiting Both Signal and Environment Sparsity

no code implementations6 Sep 2021 Xin Tong, Zhaoyang Zhang, Jue Wang, Chongwen Huang, Merouane Debbah

As a potential technology feature for 6G wireless networks, the idea of sensing-communication integration requires the system not only to complete reliable multi-user communication but also to achieve accurate environment sensing.

object-detection Object Detection

BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening

no code implementations13 May 2021 Wenqi Shao, Hang Yu, Zhaoyang Zhang, Hang Xu, Zhenguo Li, Ping Luo

To address this problem, we develop a probability-based pruning algorithm, called batch whitening channel pruning (BWCP), which can stochastically discard unimportant channels by modeling the probability of a channel being activated.

FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation

1 code implementation15 Feb 2021 Chaofan Tao, Rui Lin, Quan Chen, Zhaoyang Zhang, Ping Luo, Ngai Wong

Prior arts often discretize the network weights by carefully tuning hyper-parameters of quantization (e. g. non-uniform stepsize and layer-wise bitwidths), which are complicated and sub-optimal because the full-precision and low-precision models have a large discrepancy.

Neural Network Compression Quantization

Multi-hop RIS-Empowered Terahertz Communications: A DRL-based Hybrid Beamforming Design

no code implementations22 Jan 2021 Chongwen Huang, Zhaohui Yang, George C. Alexandropoulos, Kai Xiong, Li Wei, Chau Yuen, Zhaoyang Zhang, Merouane Debbah

We investigate the joint design of digital beamforming matrix at the BS and analog beamforming matrices at the RISs, by leveraging the recent advances in deep reinforcement learning (DRL) to combat the propagation loss.

Deep Reinforcement Learning

Distributed ADMM with Synergetic Communication and Computation

no code implementations29 Sep 2020 Zhuojun Tian, Zhaoyang Zhang, Jue Wang, Xiaoming Chen, Wei Wang, Huaiyu Dai

In this paper, we propose a novel distributed alternating direction method of multipliers (ADMM) algorithm with synergetic communication and computation, called SCCD-ADMM, to reduce the total communication and computation cost of the system.

Hybrid Beamforming for RIS-Empowered Multi-hop Terahertz Communications: A DRL-based Method

no code implementations20 Sep 2020 Chongwen Huang, Zhaohui Yang, George C. Alexandropoulos, Kai Xiong, Li Wei, Chau Yuen, Zhaoyang Zhang

Wireless communication in the TeraHertz band (0. 1--10 THz) is envisioned as one of the key enabling technologies for the future six generation (6G) wireless communication systems.

Deep Reinforcement Learning

Channel Estimation for RIS-Empowered Multi-User MISO Wireless Communications

no code implementations4 Aug 2020 Li Wei, Chongwen Huang, George C. Alexandropoulos, Chau Yuen, Zhaoyang Zhang, Mérouane Debbah

We also discuss the downlink achievable sum rate computation with estimated channels and different precoding schemes for the base station.

AdaX: Adaptive Gradient Descent with Exponential Long Term Memory

1 code implementation21 Apr 2020 Wenjie Li, Zhaoyang Zhang, Xinjiang Wang, Ping Luo

Although adaptive optimization algorithms such as Adam show fast convergence in many machine learning tasks, this paper identifies a problem of Adam by analyzing its performance in a simple non-convex synthetic problem, showing that Adam's fast convergence would possibly lead the algorithm to local minimums.

Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks

no code implementations ICCV 2019 Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang, Xiaogang Wang, Ping Luo

ResNeXt, still suffers from the sub-optimal performance due to manually defining the number of groups as a constant over all of the layers.

Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos

no code implementations15 Aug 2018 Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei zhang

Secondly, TSD significantly reduces the computations to run video action recognition with compressed frames on the cloud, while maintaining high recognition accuracies.

Action Recognition In Videos Temporal Action Localization

Performance Evaluation of Channel Decoding With Deep Neural Networks

1 code implementation1 Nov 2017 Wei Lyu, Zhaoyang Zhang, Chunxu Jiao, Kangjian Qin, Huazi Zhang

With the demand of high data rate and low latency in fifth generation (5G), deep neural network decoder (NND) has become a promising candidate due to its capability of one-shot decoding and parallel computing.

Decoder

Cannot find the paper you are looking for? You can Submit a new open access paper.