Search Results for author: Li Yu

Found 74 papers, 18 papers with code

Multi-Modal Large Models Based Beam Prediction: An Example Empowered by DeepSeek

no code implementations6 Jun 2025 Yizhu Zhao, Li Yu, Lianzheng Shi, Jianhua Zhang, Guangyi Liu

Additionally, it demonstrates few-shot generalization on a real-world dataset, achieving 72. 7% Top-1 accuracy and 92. 4% Top-3 accuracy with only 30% of the dataset, outperforming the existing small models by over 15%.

Beam Prediction Prediction

A Unified RCS Modeling of Typical Targets for 3GPP ISAC Channel Standardization and Experimental Analysis

no code implementations27 May 2025 Yuxiang Zhang, Jianhua Zhang, Xidong Hu, Jiwei Zhang, Hongbo Xing, Huiwen Gong, Shilin Luo, Yifeng Xiong, Li Yu, Zhiqing Yuan, Guangyi Liu, Tao Jiang

Accurate radar cross section (RCS) modeling is crucial for characterizing target scattering and improving the precision of Integrated Sensing and Communication (ISAC) channel modeling.

Integrated sensing and communication ISAC

A Unified Deterministic Channel Model for Multi-Type RIS with Reflective, Transmissive, and Polarization Operations

no code implementations12 May 2025 Yuxiang Zhang, Jianhua Zhang, Zhengfu Zhou, Huiwen Gong, Hongbo Xing, Zhiqiang Yuan, Lei Tian, Li Yu, Guangyi Liu, Tao Jiang

RIS can be categorized into multiple types based on their reflective/transmissive modes and polarization control capabilities, all of which are expected to be widely deployed in practical environments.

Fairness

Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction

no code implementations19 Apr 2025 Li Yu, Xuanzhe Sun, Wei Zhou, Moncef Gabbouj

Therefore, we attempt to simultaneously analyze visual, auditory, and textual modalities in this paper, and propose TAVDiff, a Text-Audio-Visual-conditioned Diffusion Model for video saliency prediction.

Denoising Image Generation +4

DVLTA-VQA: Decoupled Vision-Language Modeling with Text-Guided Adaptation for Blind Video Quality Assessment

no code implementations16 Apr 2025 Li Yu, Situo Wang, Wei Zhou, Moncef Gabbouj

Inspired by the dual-stream theory of the human visual system (HVS) - where the ventral stream is responsible for object recognition and detail analysis, while the dorsal stream focuses on spatial relationships and motion perception - an increasing number of video quality assessment (VQA) works built upon this framework are proposed.

Language Modeling Language Modelling +3

FANeRV: Frequency Separation and Augmentation based Neural Representation for Video

no code implementations9 Apr 2025 Li Yu, Zhihui Li, Chao Yao, Jimin Xiao, Moncef Gabbouj

Neural representations for video (NeRV) have gained considerable attention for their strong performance across various video tasks.

Video Compression

Covariance-Intersection-based Distributed Kalman Filtering: Stability Problems Revisited

no code implementations8 Apr 2025 Zhongyao Hu, Bo Chen, Chao Sun, Li Yu

For the periodic time-varying case, it is proved by a monotonicity analysis method that CI-based distributed Kalman filtering converges periodically for any initial condition.

FedPCA: Noise-Robust Fair Federated Learning via Performance-Capacity Analysis

no code implementations13 Mar 2025 Nannan Wu, Zengqiang Yan, Nong Sang, Li Yu, Chang Wen Chen

In this paper, we attribute this competition to the homogeneity in loss patterns exhibited by rare and mislabeled data clients, preventing existing loss-based fair and robust FL methods from effectively distinguishing and handling these two distinct client types.

Attribute Fairness +1

Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching

1 code implementation12 Mar 2025 Nannan Wu, Zhuo Kuang, Zengqiang Yan, Ping Wang, Li Yu

Existing fair federated learning methods have demonstrated some effectiveness in solving this problem by aligning a single 0th- or 1st-order state of convergence (e. g., training loss or sharpness).

Fairness Federated Learning +3

Distributed Zonotopic Fusion Estimation for Multi-sensor Systems

no code implementations25 Feb 2025 Yuchen Zhang, Bo Chen, Zheming Wang, Wen-An Zhang, Li Yu, Lei Guo

To reduce the conservatism of the DZFE with optimal parameters, we enhance our approach with an improved zonotope fusion criterion, which further improves the estimation performance of this DZFE by constructing tight strips for the intersection.

Road to 6G Digital Twin Networks: Multi-Task Adaptive Ray-Tracing as a Key Enabler

no code implementations20 Feb 2025 Li Yu, Yinghe Miao, Jianhua Zhang, Shaoyi Liu, Yuxiang Zhang, Guangyi Liu

As a virtual, synchronized replica of physical network, the digital twin network (DTN) is envisioned to sense, predict, optimize and manage the intricate wireless technologies and architectures brought by 6G.

Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation

1 code implementation17 Dec 2024 Dongyue Wu, Zilin Guo, Li Yu, Nong Sang, Changxin Gao

Within this framework, we introduce a spatial-aware redundancy metric based on feature maps, thus endowing the pruning process with location sensitivity to better adapt to pruning segmentation networks.

image-classification Image Classification +2

CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection

no code implementations13 Dec 2024 Qibo Chen, Weizhong Jin, Jianyue Ge, Mengdi Liu, Yuchao Yan, Jian Jiang, Li Yu, Xuanjiang Guo, Shuchang Li, Jianzhong Chen

In addition to text prompts, we have designed two practical concept prompt generation methods, visual prompt and optimized prompt, to extract abstract concepts through concrete visual examples and stably reduce alignment bias in downstream tasks.

 Ranked #1 on Zero-Shot Object Detection on LVIS v1.0 val (using extra training data)

object-detection Zero-Shot Object Detection

Multi-Modal Environmental Sensing Based Path Loss Prediction for V2I Communications

no code implementations10 Dec 2024 Kai Wang, Li Yu, Jianhua Zhang, Yixuan Tian, Eryu Guo, Guangyi Liu

The stability and reliability of wireless data transmission in vehicular networks face significant challenges due to the high dynamics of path loss caused by the complexity of rapidly changing environments.

Relevance-guided Audio Visual Fusion for Video Saliency Prediction

no code implementations18 Nov 2024 Li Yu, Xuanzhe Sun, Pan Gao, Moncef Gabbouj

Audio data, often synchronized with video frames, plays a crucial role in guiding the audience's visual attention.

Prediction Saliency Prediction +1

Multi-task Feature Enhancement Network for No-Reference Image Quality Assessment

no code implementations12 Nov 2024 Li Yu

First, existing methods have not explicitly exploited texture details, which significantly influence the image quality.

NR-IQA

High-Frequency Enhanced Hybrid Neural Representation for Video Compression

no code implementations11 Nov 2024 Li Yu, Zhihui Li, Jimin Xiao, Moncef Gabbouj

Next, we design the High-Frequency Feature Modulation (HFM) block, which leverages the extracted high-frequency embeddings to enhance the fitting process of the decoder.

Decoder Video Compression

ChannelGPT: A Large Model to Generate Digital Twin Channel for 6G Environment Intelligence

no code implementations17 Oct 2024 Li Yu, Lianzheng Shi, Jianhua Zhang, Jialin Wang, Zhen Zhang, Yuxiang Zhang, Guangyi Liu

In practice, we also establish a ChannelGPT prototype to generate high-fidelity channel data for varied scenarios to validate the accuracy and generalization ability based on environment intelligence.

Bridging Domain Gap of Point Cloud Representations via Self-Supervised Geometric Augmentation

no code implementations11 Sep 2024 Li Yu, Hongchao Zhong, Longkun Zou, Ke Chen, Pan Gao

Recent progress of semantic point clouds analysis is largely driven by synthetic data (e. g., the ModelNet and the ShapeNet), which are typically complete, well-aligned and noisy free.

Point Cloud Classification Representation Learning +2

MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation

1 code implementation25 Aug 2024 Chaowei Chen, Li Yu, Shiquan Min, Shunfang Wang

State Space Models (SSMs), especially Mamba, have shown great promise in medical image segmentation due to their ability to model long-range dependencies with linear computational complexity.

Image Segmentation Mamba +4

Can Wireless Environmental Information Decrease Pilot Overhead: A CSI Prediction Example

no code implementations13 Aug 2024 Lianzheng Shi, Jianhua Zhang, Li Yu, Yuxiang Zhang, Zhen Zhang, Yichen Cai, Guangyi Liu

Finally, a CNN-based channel prediction network is designed to predict the complete CSI, using the environmental feature map and partial CSI.

Prediction

PASSION: Towards Effective Incomplete Multi-Modal Medical Image Segmentation with Imbalanced Missing Rates

1 code implementation20 Jul 2024 Junjie Shi, Caozhi Shang, Zhaobin Sun, Li Yu, Xin Yang, Zengqiang Yan

In this paper, we, for the first time, formulate such a challenging setting and propose Preference-Aware Self-diStillatION (PASSION) for incomplete multi-modal medical image segmentation under imbalanced missing rates.

Image Segmentation Medical Image Segmentation +2

FedMLP: Federated Multi-Label Medical Image Classification under Task Heterogeneity

1 code implementation27 Jun 2024 Zhaobin Sun, Nannan Wu, Junjie Shi, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan

Experiments on two publicly-available medical datasets validate the superiority of FedMLP against the state-of-the-art both federated semi-supervised and noisy label learning approaches under task heterogeneity.

Federated Learning image-classification +5

Intermittent Encryption Strategies for Anti-Eavesdropping Estimation

no code implementations15 Jun 2024 Zhongyao Hu, Bo Chen, Pindi Weng, Jianzheng Wang, Li Yu

A linear encryption scheme is utilized, which first linearly transforms innovation via an encryption matrix and then encrypts some components of the transformed innovation.

An Enhanced Dynamic Ray Tracing Architecture for Channel Prediction Based on Multipath Bidirectional Geometry and Field Extrapolation

no code implementations5 May 2024 Yinghe Miao, Li Yu, Yuxiang Zhang, Hongbo Xing, Jianhua Zhang

Interestingly, a dynamic ray tracing (DRT) approach for channel prediction has recently been proposed, which utilizes the results of traditional RT to extrapolate the multipath geometry evolution.

Prediction

Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling

no code implementations27 Apr 2024 Di wu, Shicai Fan, Xue Zhou, Li Yu, Yuzhong Deng, Jianxiao Zou, Baihong Lin

In MDPS, the problem of normal image reconstruction is mathematically modeled as multiple diffusion posterior sampling for normal images based on the devised masked noisy observation model and the diffusion-based normal image prior under Bayesian framework.

Image Reconstruction Unsupervised Anomaly Detection

From Optimization to Generalization: Fair Federated Learning against Quality Shift via Inter-Client Sharpness Matching

1 code implementation27 Apr 2024 Nannan Wu, Zhuo Kuang, Zengqiang Yan, Li Yu

In this study, we pioneer the identification and formulation of this new fairness challenge within the context of the imaging quality shift.

Fairness Federated Learning

ControlMol: Adding Substructure Control To Molecule Diffusion Models

no code implementations22 Apr 2024 Qi Zhengyang, Liu Zijing, Zhang Jiying, Cao He, Li Yu

Due to the vast design space of molecules, generating molecules conditioned on a specific sub-structure relevant to a particular function or therapeutic target is a crucial task in computer-aided drug design.

3D Molecule Generation Drug Design +1

Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes

1 code implementation21 Apr 2024 Kang You, Kai Liu, Li Yu, Pan Gao, Dandan Ding

Despite considerable progress being achieved in point cloud geometry compression, there still remains a challenge in effectively compressing large-scale scenes with sparse surfaces.

Decoder

SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts

1 code implementation20 Mar 2024 Xian lin, Yangyang Xiang, Zhehao Wang, Kwang-Ting Cheng, Zengqiang Yan, Li Yu

Specifically, based on SAM, SAMCT is further equipped with a U-shaped CNN image encoder, a cross-branch interaction module, and a task-indicator prompt encoder.

Digital Twin Channel for 6G: Concepts, Architectures and Potential Applications

no code implementations19 Mar 2024 Heng Wang, Jianhua Zhang, Gaofeng Nie, Li Yu, Zhiqiang Yuan, Tongjie Li, Jialin Wang, Guangyi Liu

Digital twin channel (DTC) is the real-time mapping of a wireless channel from the physical world to the digital world, which is expected to provide significant performance enhancements for the sixth-generation (6G) air-interface design.

Panoramic Image Inpainting With Gated Convolution And Contextual Reconstruction Loss

no code implementations5 Feb 2024 Li Yu, Yanjun Gao, Farhad Pakdaman, Moncef Gabbouj

In response to these challenges, we propose a panoramic image inpainting framework that consists of a Face Generator, a Cube Generator, a side branch, and two discriminators.

Image Inpainting SSIM +1

FedA3I: Annotation Quality-Aware Aggregation for Federated Medical Image Segmentation against Heterogeneous Annotation Noise

1 code implementation20 Dec 2023 Nannan Wu, Zhaobin Sun, Zengqiang Yan, Li Yu

Specifically, noise estimation at each client is accomplished through the Gaussian mixture model and then incorporated into model aggregation in a layer-wise manner to up-weight high-quality clients.

Federated Learning Image Segmentation +4

Boosting Facial Action Unit Detection Through Jointly Learning Facial Landmark Detection and Domain Separation and Reconstruction

no code implementations8 Oct 2023 Ziqiao Shang, Li Yu

Recently how to introduce large amounts of unlabeled facial images in the wild into supervised Facial Action Unit (AU) detection frameworks has become a challenging problem.

Action Unit Detection Contrastive Learning +3

Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting

1 code implementation13 Sep 2023 Xian lin, Yangyang Xiang, Li Yu, Zengqiang Yan

End-to-end medical image segmentation is of great value for computer-aided diagnosis dominated by task-specific models, usually suffering from poor generalization.

Image Segmentation Medical Image Segmentation +2

Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

no code implementations6 Sep 2023 Tianchi Cai, Jiyan Jiang, Wenpeng Zhang, Shiji Zhou, Xierui Song, Li Yu, Lihong Gu, Xiaodong Zeng, Jinjie Gu, Guannan Zhang

We further show that this method is guaranteed to converge to the optimal policy, which cannot be achieved by previous value-based reinforcement learning methods for marketing budget allocation.

Deep Reinforcement Learning Marketing +1

Carbon emissions and sustainability of launching 5G mobile networks in China

no code implementations14 Jun 2023 Tong Li, Li Yu, Yibo Ma, Tong Duan, Wenzhen Huang, Yan Zhou, Depeng Jin, Yong Li, Tao Jiang

We show that the decline in carbon efficiency leads to a carbon efficiency trap, estimated to cause additional carbon emissions of 23. 82 +- 1. 07 megatons in China.

Deep Reinforcement Learning

Spatiotemporally Consistent HDR Indoor Lighting Estimation

no code implementations7 May 2023 Zhengqin Li, Li Yu, Mikhail Okunev, Manmohan Chandraker, Zhao Dong

For training, we significantly enhance the OpenRooms public dataset of photorealistic synthetic indoor scenes with around 360K HDR environment maps of much higher resolution and 38K video sequences, rendered with GPU-based path tracing.

Decoder Lighting Estimation +1

Remote State Estimation with Posterior-Based Stochastic Event-Triggered Schedule

no code implementations17 Apr 2023 Zhongyao Hu, Bo Chen, Rusheng Wang, Li Yu

A posterior-based SET mechanism is proposed, which determines whether to transmit data by the effect of the measurement on the posterior estimate.

State Estimation

Cross-Fusion Rule for Personalized Federated Learning

no code implementations6 Feb 2023 Wangzhuo Yang, Bo Chen, Yijun Shen, Jiong Liu, Li Yu

To overcome these challenges, a multi-layer multi-fusion strategy framework is proposed in this paper, i. e., the server adopts the network layer parameters of each client upload model as the basic unit of fusion for information-sharing calculation.

Personalized Federated Learning

Learning stability of partially observed switched linear systems

no code implementations19 Jan 2023 Zheming Wang, Raphaël M. Jungers, Mihály Petreczky, Bo Chen, Li Yu

In this paper, we propose an algorithm for deciding stability of switched linear systems under arbitrary switching based purely on observed output data.

Learning Theory

Hand Gesture Recognition through Reflected Infrared Light Wave Signals

no code implementations14 Jan 2023 Md Zobaer Islam, Li Yu, Hisham Abuella, John F. O'Hara, Christopher Crick, Sabit Ekin

In this study, we present a wireless (non-contact) gesture recognition method using only incoherent light wave signals reflected from a human subject.

Hand Gesture Recognition Hand-Gesture Recognition

Secure Fusion Estimation Against FDI Sensor Attacks in Cyber-Physical Systems

no code implementations30 Dec 2022 Bo Chen, Pindi Weng, Daniel W. C. Ho, Li Yu

This paper is concerned with the problem of secure multi-sensors fusion estimation for cyber-physical systems, where sensor measurements may be tampered with by false data injection (FDI) attacks.

End-to-end Transformer for Compressed Video Quality Enhancement

no code implementations25 Oct 2022 Li Yu, Wenshuai Chang, Shiyu Wu, Moncef Gabbouj

In this work, we propose a transformer-based compressed video quality enhancement (TVQE) method, consisting of Swin-AutoEncoder based Spatio-Temporal feature Fusion (SSTF) module and Channel-wise Attention based Quality Enhancement (CAQE) module.

How to Define the Propagation Environment Semantics and Its Application in Scatterer-Based Beam Prediction

no code implementations17 Sep 2022 Yutong Sun, Jianhua Zhang, Li Yu, Zhen Zhang, Ping Zhang

Inspired by task-oriented semantic communication and machine learning (ML) powered environment-channel mapping methods, this work aims to provide a new view of the environment from the semantic level, which defines the propagation environment semantics (PES) as a limited set of propagation environment semantic symbols (PESS) for diverse application tasks.

Beam Prediction Semantic Communication

Distributed Event-Triggered Nonlinear Fusion Estimation under Resource Constraints

no code implementations3 Aug 2022 Rusheng Wang, Bo Chen, Zhongyao Hu, Li Yu

This paper studies the event-triggered distributed fusion estimation problems for a class of nonlinear networked multisensor fusion systems without noise statistical characteristics.

Dimensionality Reduction

Frequency-Angle Two-Dimensional Reflection Coefficient Modeling Based on Terahertz Channel Measurement

no code implementations11 Jul 2022 Zhaowei Chang, Jianhua Zhang, Pan Tang, Lei Tian, Li Yu, Guangyi Liu, Liang Xia

In this letter, the reflection coefficient of the THz channel is researched based on extensive measurement campaigns.

BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image Segmentation

2 code implementations29 Jun 2022 Xian lin, Li Yu, Kwang-Ting Cheng, Zengqiang Yan

Specifically, to fully explore the benefits of transformers in long-range dependency establishment, a cross-scale global transformer (CGT) module is introduced to jointly utilize multiple small-scale feature maps for richer global features with lower computational complexity.

Image Segmentation Medical Image Segmentation +3

FedIIC: Towards Robust Federated Learning for Class-Imbalanced Medical Image Classification

1 code implementation28 Jun 2022 Nannan Wu, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan

In this paper, we present a privacy-preserving FL method named FedIIC to combat class imbalance from two perspectives: feature learning and classifier learning.

Contrastive Learning Federated Learning +4

Distributed Estimation for Interconnected Systems with Arbitrary Coupling Structures

no code implementations1 Jun 2022 Yuchen Zhang, Bo Chen, Li Yu, Daniel W. C. Ho

By merging these subsystem-level stability conditions and the optimization-based estimator gain design, the distributed, stable and optimal estimators are proposed.

Adversarial Learning for Incentive Optimization in Mobile Payment Marketing

no code implementations28 Dec 2021 Xuanying Chen, Zhining Liu, Li Yu, Sen Li, Lihong Gu, Xiaodong Zeng, Yize Tan, Jinjie Gu

This bias deteriorates the performance of the response model and misleads the linear programming process, dramatically degrading the performance of the resulting allocation policy.

Marketing

Kalman-Like Filter under Binary Sensors

no code implementations27 Oct 2021 Zhongyao Hu, Bo Chen, Yuchen Zhang, Li Yu

When considering linear dynamic systems, a conservative estimation error covariance with adjustable parameters is constructed by matrix inequality, and then an optimal filter gain is derived by minimizing its trace.

Enhanced Sequential Covariance Intersection Fusion

no code implementations13 Oct 2021 Zhongyao Hu, Bo Chen, Wen-An Zhang, Li Yu

For this criterion, it is proved that the fusion results are not affected by the fusion structure, and thus the fusion performance can be guaranteed.

Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching

no code implementations CVPR 2021 ShiYang Yan, Li Yu, Yuan Xie

We propose a novel attention scheme which projects the image and text embedding into a common space and optimises the attention weights directly towards the evaluation metrics.

Image-text matching Text Matching

A generalization of moment-angle manifolds with non-contractible orbit spaces

no code implementations20 Nov 2020 Li Yu

This generalizes the Hochster's formula for the moment-angle manifold over a simple convex polytope.

Algebraic Topology Geometric Topology 57S12 (Primary) 57N65, 57S17, 57S25 (Secondary)

Gesture Recognition using Reflected Visible and Infrared Light Wave Signals

no code implementations16 Jul 2020 Li Yu, Hisham Abuella, Md Zobaer Islam, John F. O'Hara, Christopher Crick, Sabit Ekin

In this paper, we demonstrate the ability to recognize hand gestures in a non-contact, wireless fashion using only incoherent light signals reflected from a human subject.

Hand Gesture Recognition Hand-Gesture Recognition

DA4AD: End-to-End Deep Attention-based Visual Localization for Autonomous Driving

no code implementations ECCV 2020 Yao Zhou, Guowei Wan, Shenhua Hou, Li Yu, Gang Wang, Xiaofei Rui, Shiyu Song

We present a visual localization framework based on novel deep attention aware features for autonomous driving that achieves centimeter level localization accuracy.

Autonomous Driving Deep Attention +1

Skull-RCNN: A CNN-based network for the skull fracture detection

no code implementations MIDL 2019 Zhuo Kuang, Xianbo Deng, Li Yu, Hang Zhang, Xian lin, Hui Ma

Guiding by the morphological features of the skull, a skeleton-based region proposal method is proposed to make candidate boxes more concentrated in key regions and reduce invalid boxes.

Fracture detection Region Proposal

Accelerate CU Partition in HEVC using Large-Scale Convolutional Neural Network

no code implementations23 Sep 2018 Chenying Wang, Li Yu, Shengwei Wang

High efficiency video coding (HEVC) suffers high encoding computational complexity, partly attributed to the rate-distortion optimization quad-tree search in CU partition decision.

Context-Aware Online Learning for Course Recommendation of MOOC Big Data

no code implementations11 Oct 2016 Yifan Hou, Pan Zhou, Ting Wang, Li Yu, Yuchong Hu, Dapeng Wu

In this respect, the key challenge is how to realize personalized course recommendation as well as to reduce the computing and storage costs for the tremendous course data.

Recommendation Systems

Monocular Urban Localization using Street View

no code implementations17 May 2016 Li Yu, Cyril Joly, Guillaume Bresson, Fabien Moutarde

This paper presents a metric global localization in the urban environment only with a monocular camera and the Google Street View database.

Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.