Search Results for author: Jun Li

Found 285 papers, 70 papers with code

Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection

no code implementations13 Jan 2025 Zhourui Zhang, Jun Li, JiaYan Li, Zhijian Wu, Jianhua Xu

Different from previous methods in which global masking is performed on single-scale feature maps, we explore the scale-aware feature masking by performing feature distillation across various scales, such that the object-aware locality is encoded for improved feature reconstruction.

object-detection

SFADNet: Spatio-temporal Fused Graph based on Attention Decoupling Network for Traffic Prediction

1 code implementation7 Jan 2025 Mei Wu, Wenchao Weng, Jun Li, Yiqian Lin, Jing Chen, Dewen Seng

In recent years, traffic flow prediction has played a crucial role in the management of intelligent transportation systems.

Management Time Series +1

Unleashing Correlation and Continuity for Hyperspectral Reconstruction from RGB Images

no code implementations2 Jan 2025 Fuxiang Feng, Runmin Cong, Shoushui Wei, YiPeng Zhang, Jun Li, Sam Kwong, Wei zhang

Therefore, we fully explore these inter-spectral relationships and propose a Correlation and Continuity Network (CCNet) for HSI reconstruction from RGB images.

Spectral Reconstruction

Completion as Enhancement: A Degradation-Aware Selective Image Guided Network for Depth Completion

no code implementations26 Dec 2024 Zhiqiang Yan, Zhengxue Wang, Kun Wang, Jun Li, Jian Yang

In this paper, we introduce the Selective Image Guided Network (SigNet), a novel degradation-aware framework that transforms depth completion into depth enhancement for the first time.

Depth Completion Mamba

Multi-View Fusion Neural Network for Traffic Demand Prediction

no code implementations24 Dec 2024 Dongran Zhang, Jun Li

In this approach, spatial local features are extracted through the use of a graph convolutional network (GCN), and spatial global features are extracted using a cosine re-weighting linear attention mechanism (CLA).

Multimodal joint prediction of traffic spatial-temporal data with graph sparse attention mechanism and bidirectional temporal convolutional network

no code implementations24 Dec 2024 Dongran Zhang, Jiangnan Yan, Kemal Polat, Adi Alhudhaif, Jun Li

First, we use a multimodal graph multiplied by self-attention weights to capture spatial local features, and then employ the Top-U sparse attention mechanism to obtain spatial global features.

Feature Correlation

Guided Real Image Dehazing using YCbCr Color Space

1 code implementation23 Dec 2024 Wenxuan Fang, Junkai Fan, Yu Zheng, Jiangwei Weng, Ying Tai, Jun Li

Image dehazing, particularly with learning-based methods, has gained significant attention due to its importance in real-world applications.

Image Dehazing

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking

1 code implementation20 Dec 2024 Xiantao Hu, Ying Tai, Xu Zhao, Chen Zhao, Zhenyu Zhang, Jun Li, Bineng Zhong, Jian Yang

These temporal information tokens are used to guide the localization of the target in the next time state, establish long-range contextual relationships between video frames, and capture the temporal trajectory of the target.

Mamba Rgb-T Tracking +1

Efficient Self-Supervised Video Hashing with Selective State Spaces

1 code implementation19 Dec 2024 Jinpeng Wang, Niu Lian, Jun Li, Yuting Wang, Yan Feng, Bin Chen, Yongbing Zhang, Shu-Tao Xia

We introduce S5VH, a Mamba-based video hashing model with an improved self-supervised learning paradigm.

Decoder Mamba +1

Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video

no code implementations16 Dec 2024 Junkai Fan, Kun Wang, Zhiqiang Yan, Xiang Chen, Shangbing Gao, Jun Li, Jian Yang

In this paper, we study the challenging problem of simultaneously removing haze and estimating depth from real monocular hazy videos.

Depth Estimation

Neural Garment Dynamic Super-Resolution

1 code implementation9 Dec 2024 Meng Zhang, Jun Li

In this paper, we introduce a lightweight, learning-based method for garment dynamic super-resolution, designed to efficiently enhance high-resolution, high-frequency details in low-resolution garment simulations.

Super-Resolution

IRS Aided Federated Learning: Multiple Access and Fundamental Tradeoff

no code implementations30 Nov 2024 Guangji Chen, Jun Li, Qingqing Wu, Yiyang Ni

Under the three protocols, we minimize the per-round latency subject to a given training loss by jointly optimizing the device scheduling, IRS phase-shifts, and communicationcomputation resource allocation.

Federated Learning Scheduling

Dynamic Trajectory and Power Control in Ultra-Dense UAV Networks: A Mean-Field Reinforcement Learning Approach

no code implementations21 Nov 2024 Fei Song, Zhe Wang, Jun Li, Long Shi, Wen Chen, Shi Jin

In ultra-dense unmanned aerial vehicle (UAV) networks, it is challenging to coordinate the resource allocation and interference management among large-scale UAVs, for providing flexible and efficient service coverage to the ground users (GUs).

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

no code implementations17 Nov 2024 Lei Yang, Xinyu Zhang, Jun Li, Chen Wang, Zhiying Song, Tong Zhao, Ziying Song, Li Wang, Mo Zhou, Yang shen, Kai Wu, Chen Lv

Previous studies have demonstrated the effectiveness of cooperative perception in extending the perception range and overcoming occlusions, thereby improving the safety of autonomous driving.

Autonomous Driving

Affine Frequency Division Multiplexing with Index Modulation: Full Diversity Condition, Performance Analysis, and Low-Complexity Detection

no code implementations15 Nov 2024 Yiwei Tao, Miaowen Wen, Yao Ge, Jun Li, Ertugrul Basar, Naofal Al-Dhahir

Finally, BER simulation results confirm that our proposed CDD-AFDM-IM schemes with both the ML and DLMP detections outperform the benchmark schemes over the LTV channels.

Diversity

Try-On-Adapter: A Simple and Flexible Try-On Paradigm

no code implementations15 Nov 2024 Hanzhong Guo, Jianfeng Zhang, Cheng Zou, Jun Li, Meng Wang, Ruxue Wen, Pingzhong Tang, Jingdong Chen, Ming Yang

A key challenge of try-on is to generate realistic images of the model wearing the garments while preserving the details of the garments.

Virtual Try-on

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

1 code implementation10 Nov 2024 Zhennan Chen, Yajie Li, Haofan Wang, Zhibo Chen, Zhengkai Jiang, Jun Li, Qian Wang, Jian Yang, Ying Tai

Regional prompting, or compositional generation, which enables fine-grained spatial control, has gained increasing attention for its practicality in real-world applications.

Attribute RAG +1

Novel Object Synthesis via Adaptive Text-Image Harmony

no code implementations28 Oct 2024 Zeren Xiong, Zedong Zhang, Zikun Chen, Shuo Chen, Xiang Li, Gan Sun, Jian Yang, Jun Li

In this paper, we study an object synthesis task that combines an object text with an object image to create a new object image.

Object

Citywide Electric Vehicle Charging Demand Prediction Approach Considering Urban Region and Dynamic Influences

no code implementations24 Oct 2024 Haoxuan Kuang, Kunxiang Deng, Linlin You, Jun Li

To tackle these issues, we propose a learning approach for citywide electric vehicle charging demand prediction, named CityEVCP.

Graph Attention Variable Selection

DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain

1 code implementation19 Oct 2024 Kun Wang, Zhiqiang Yan, Junkai Fan, Wanlu Zhu, Xiang Li, Jun Li, Jian Yang

In this paper, we introduce DCDepth, a novel framework for the long-standing monocular depth estimation task.

Monocular Depth Estimation

Organizing Unstructured Image Collections using Natural Language

no code implementations7 Oct 2024 Mingxuan Liu, Zhun Zhong, Jun Li, Gianni Franchi, Subhankar Roy, Elisa Ricci

Our framework, Text Driven Semantic Multiple Clustering (TeDeSC), uses text as a proxy to concurrently reason over large image collections, discover partitioning criteria, expressed in natural language, and reveal semantic substructures.

Clustering Deep Clustering

IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification

no code implementations7 Oct 2024 Yan He, Bing Tu, Puzhao Jiang, Bo Liu, Jun Li, Antonio Plaza

In light of this, this paper investigates a lightweight Interval Group Spatial-Spectral Mamba framework (IGroupSS-Mamba) for HSI classification, which allows for multi-directional and multi-scale global spatial-spectral information extraction in a grouping and hierarchical manner.

Computational Efficiency Hyperspectral Image Classification +2

Towards TMA-Based Transmissive RIS Transceiver Enabled Downlink Communication Networks: A Consensus-ADMM Approach

no code implementations4 Oct 2024 Zhendong Li, Wen Chen, Haoran Qin, Qingqing Wu, Xusheng Zhu, Ziheng Zhang, Jun Li

This paper presents a novel multi-stream downlink communication system that utilizes a transmissive reconfigurable intelligent surface (RIS) transceiver.

Fairness

3DGR-CAR: Coronary artery reconstruction from ultra-sparse 2D X-ray views with a 3D Gaussians representation

1 code implementation1 Oct 2024 Xueming Fu, Yingtai Li, Fenghe Tang, Jun Li, Mingyue Zhao, Gao-Jun Teng, S. Kevin Zhou

We leverage 3D Gaussian representation to avoid the inefficiency caused by the extreme sparsity of coronary artery data and propose a Gaussian center predictor to overcome the noisy Gaussian initialization from ultra-sparse view projections.

3D Reconstruction

FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks

no code implementations1 Oct 2024 Peiran Wu, Che Liu, Canyu Chen, Jun Li, Cosmin I. Bercea, Rossella Arcucci

In response, we propose FMBench, the first benchmark designed to evaluate the fairness of MLLMs performance across diverse demographic attributes.

Benchmarking Fairness +2

Adversarial Federated Consensus Learning for Surface Defect Classification Under Data Heterogeneity in IIoT

no code implementations24 Sep 2024 Jixuan Cui, Jun Li, Zhen Mei, Yiyang Ni, Wen Chen, Zengxiang Li

In this paper, we propose a novel personalized FL (PFL) approach, named Adversarial Federated Consensus Learning (AFedCL), for the challenge of data heterogeneity across different clients in SDC.

Federated Learning

GMFL-Net: A Global Multi-geometric Feature Learning Network for Repetitive Action Counting

no code implementations31 Aug 2024 Jun Li, Jinying Wu, Qiming Li, Feifei Guo

Then, to improve the feature representation from a global perspective, we also design a GBFL-Module that enhances the inter-dependencies between point-wise and channel-wise elements and combines them with the rich local information generated by the MIA-Module to synthesise a comprehensive and most representative global feature representation.

Pose Estimation Repetitive Action Counting +2

Resource Efficient Asynchronous Federated Learning for Digital Twin Empowered IoT Network

no code implementations26 Aug 2024 Shunfeng Chu, Jun Li, Jianxin Wang, Yiyang Ni, Kang Wei, Wen Chen, Shi Jin

We utilize the Lyapunov method to decouple the formulated problem into a series of one-slot optimization problems and develop a two-stage optimization algorithm to achieve the optimal transmission power control and IoT device scheduling strategies.

Federated Learning Scheduling

Flatten: Video Action Recognition is an Image Classification task

no code implementations17 Aug 2024 Junlin Chen, Chengcheng Xu, Yangfan Xu, Jian Yang, Jun Li, Zhiping Shi

In recent years, video action recognition, as a fundamental task in the field of video understanding, has been deeply explored by numerous researchers. Most traditional video action recognition methods typically involve converting videos into three-dimensional data that encapsulates both spatial and temporal information, subsequently leveraging prevalent image understanding models to model and analyze these data.

Action Recognition Image Classification +2

Garment Animation NeRF with Color Editing

1 code implementation29 Jul 2024 Renke Wang, Meng Zhang, Jun Li, Jian Yan

Our approach infers garment dynamic features from body motion, providing a preliminary overview of garment structure.

Neural Rendering

DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection

no code implementations18 Jul 2024 Zhourui Zhang, Jun Li, Zhijian Wu, Jifeng Shen, Jianhua Xu

In this study, a novel dual feature-masking heterogeneous distillation framework termed DFMSD is proposed for object detection.

Knowledge Distillation Object +2

Unsupervised Domain Adaptive Lane Detection via Contextual Contrast and Aggregation

no code implementations18 Jul 2024 Kunyang Zhou, Yunjian Feng, Jun Li

This paper focuses on two crucial issues in domain-adaptive lane detection, i. e., how to effectively learn discriminative features and transfer knowledge across domains.

Lane Detection

EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing

1 code implementation18 Jul 2024 Wei zhang, Miaoxin Cai, Tong Zhang, Jun Li, Yin Zhuang, Xuerui Mao

Specifically, a shared visual encoding method is developed to establish the spatial pattern interpretation relationships between the multi-scale representations of input images and various visual prompts.

Instruction Following Language Modeling +3

ISPO: An Integrated Ontology of Symptom Phenotypes for Semantic Integration of Traditional Chinese Medical Data

no code implementations8 Jul 2024 Zixin Shu, Rui Hua, Dengying Yan, Chenxia Lu, Ning Xu, Jun Li, Hui Zhu, Jia Zhang, Dan Zhao, Chenyang Hui, Junqiu Ye, Chu Liao, Qi Hao, Wen Ye, Cheng Luo, Xinyan Wang, Chuang Cheng, XiaoDong Li, Baoyan Liu, Xiaji Zhou, Runshun Zhang, Min Xu, Xuezhong Zhou

Methods: To construct an integrated ontology of symptom phenotypes (ISPO), we manually annotated classical TCM textbooks and large-scale Chinese electronic medical records (EMRs) to collect symptom terms with support from a medical text annotation system.

text annotation

Decision Transformer for IRS-Assisted Systems with Diffusion-Driven Generative Channels

no code implementations28 Jun 2024 Jie Zhang, Jun Li, Zhe Wang, Yu Han, Long Shi, Bin Cao

In this paper, we propose a novel diffusion-decision transformer (D2T) architecture to optimize the beamforming strategies for intelligent reflecting surface (IRS)-assisted multiple-input single-output (MISO) communication systems.

Reinforcement Learning (RL)

Deep Imbalanced Regression to Estimate Vascular Age from PPG Data: a Novel Digital Biomarker for Cardiovascular Health

2 code implementations21 Jun 2024 Guangkun Nie, Qinghao Zhao, Gongzheng Tang, Jun Li, Shenda Hong

Photoplethysmography (PPG) is emerging as a crucial tool for monitoring human hemodynamics, with recent studies highlighting its potential in assessing vascular aging through deep learning.

Deep imbalanced regression Photoplethysmography (PPG)

Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

no code implementations2 Jun 2024 Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang

In this work, we propose a novel framework for automatic ultrasound report generation, leveraging a combination of unsupervised and supervised learning methods to aid the report generation process.

Trustworthy DNN Partition for Blockchain-enabled Digital Twin in Wireless IIoT Networks

no code implementations28 May 2024 Xiumei Deng, Jun Li, Long Shi, Kang Wei, Ming Ding, Yumeng Shao, Wen Chen, Shi Jin

To promote the efficiency and trustworthiness of DT for wireless IIoT networks, we propose a blockchain-enabled DT (B-DT) framework that employs deep neural network (DNN) partitioning technique and reputation-based consensus mechanism, wherein the DTs maintained at the gateway side execute DNN inference tasks using the data collected from their associated IIoT devices.

POS Stochastic Optimization

Towards Communication-efficient Federated Learning via Sparse and Aligned Adaptive Optimization

no code implementations28 May 2024 Xiumei Deng, Jun Li, Kang Wei, Long Shi, Zeihui Xiong, Ming Ding, Wen Chen, Shi Jin, H. Vincent Poor

Driven by this issue, we propose a novel sparse FedAdam algorithm called FedAdam-SSM, wherein distributed devices sparsify the updates of local model parameters and moment estimates and subsequently upload the sparse representations to the centralized server.

Federated Learning

MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space

no code implementations25 May 2024 Jiangwei Weng, Zhiqiang Yan, Ying Tai, Jianjun Qian, Jian Yang, Jun Li

In this paper, we introduce MambaLLIE, an implicit Retinex-aware low light enhancer featuring a global-then-local state space design.

Long-range modeling Low-Light Image Enhancement +1

3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification

no code implementations21 May 2024 Yan He, Bing Tu, Bo Liu, Jun Li, Antonio Plaza

To overcome the limitations of traditional Mamba, which is confined to modeling causal sequences and inadaptable to high-dimensional scenarios, a 3D-Spectral-Spatial Selective Scanning (3DSS) mechanism is introduced, which performs pixel-wise selective scanning on 3D hyperspectral tokens along the spectral and spatial dimensions.

Computational Efficiency Hyperspectral Image Classification +1

Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance

no code implementations CVPR 2024 Junkai Fan, Jiangwei Weng, Kun Wang, Yijun Yang, Jianjun Qian, Jun Li, Jian Yang

Firstly, we introduce a non-aligned reference frame matching module, leveraging an adaptive sliding window to match high-quality reference frames from clear videos.

Direct Learning of Mesh and Appearance via 3D Gaussian Splatting

no code implementations11 May 2024 Ancheng Lin, Jun Li

The model creates an effective information pathway to supervise the learning of both 3DGS and mesh.

Deploying Graph Neural Networks in Wireless Networks: A Link Stability Viewpoint

no code implementations9 May 2024 Jun Li, Weiwei Zhang, Kang Wei, Guangji Chen, Long Shi, Wen Chen

In practical wireless systems, the communication links among nodes are usually unreliable due to wireless fading and receiver noise, consequently resulting in performance degradation of GNNs.

Combinatorial Optimization

A Knowledge-driven Memetic Algorithm for the Energy-efficient Distributed Homogeneous Flow Shop Scheduling Problem

no code implementations28 Apr 2024 Yunbao Xu, Xuemei Jiang, Jun Li, Lining Xing, Yanjie Song

Furthermore, several algorithmic improvements including update strategy, local search strategy, and carbon reduction strategy are employed to improve the search performance of the algorithm.

Scheduling

Meply: A Large-scale Dataset and Baseline Evaluations for Metastatic Perirectal Lymph Node Detection and Segmentation

1 code implementation13 Apr 2024 Weidong Guo, Hantao Zhang, Shouhong Wan, Bingbing Zou, Wanqin Wang, Chenyang Qiu, Jun Li, Peiquan Jin

The CoSAM utilizes sequence-based detection to guide the segmentation of metastatic lymph nodes in rectal cancer, contributing to improved localization performance for the segmentation model.

Segmentation

Integrated Sensing and Communication Under DISCO Physical-Layer Jamming Attacks

no code implementations11 Apr 2024 Huan Huang, Hongliang Zhang, Weidong Mei, Jun Li, Yi Cai, A. Lee Swindlehurst, Zhu Han

Moreover, a theoretical analysis is conducted to quantify the impact of DISCO jamming attacks.

Language Models Meet Anomaly Detection for Better Interpretability and Generalizability

1 code implementation11 Apr 2024 Jun Li, Su Hwan Kim, Philip Müller, Lina Felsner, Daniel Rueckert, Benedikt Wiestler, Julia A. Schnabel, Cosmin I. Bercea

This research explores the integration of language models and unsupervised anomaly detection in medical imaging, addressing two key questions: (1) Can language models enhance the interpretability of anomaly detection maps?

Language Modelling Natural Language Inference +3

Decision Transformers for Wireless Communications: A New Paradigm of Resource Management

no code implementations8 Apr 2024 Jie Zhang, Jun Li, Long Shi, Zhe Wang, Shi Jin, Wen Chen, H. Vincent Poor

By leveraging the power of DT models learned over offline datasets, the proposed architecture is expected to achieve rapid convergence with many fewer training epochs and higher performance in new scenarios with different state and action spaces, compared with DRL.

Deep Reinforcement Learning Edge-computing +2

Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training

2 code implementations30 Mar 2024 Tongkun Su, Jun Li, Xi Zhang, Haibo Jin, Hao Chen, Qiong Wang, Faqin Lv, Baoliang Zhao, Yin Hu

We leverage descriptions in medical reports to design multi-granular question-answer pairs associated with different diseases, which assist the framework in pre-training without requiring extra annotations from experts.

Contrastive Learning Question Answering +1

Diff-Reg v1: Diffusion Matching Model for Registration Problem

1 code implementation29 Mar 2024 Qianliang Wu, Haobo Jiang, Lei Luo, Jun Li, Yaqing Ding, Jin Xie, Jian Yang

Establishing reliable correspondences is essential for registration tasks such as 3D and 2D3D registration.

Denoising

Tri-Perspective View Decomposition for Geometry-Aware Depth Completion

no code implementations CVPR 2024 Zhiqiang Yan, Yuankai Lin, Kun Wang, Yupeng Zheng, YuFei Wang, Zhenyu Zhang, Jun Li, Jian Yang

Depth completion is a vital task for autonomous driving, as it involves reconstructing the precise 3D geometry of a scene from sparse and noisy depth measurements.

3D geometry Autonomous Driving +1

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

no code implementations19 Mar 2024 Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu, Guoqing Zhao

Although some approaches work towards gloss-free SLT through jointly training the visual encoder and translation network, these efforts still suffer from poor performance and inefficient use of the powerful Large Language Model (LLM).

Gloss-free Sign Language Translation Language Modeling +4

Harmonious Group Choreography with Trajectory-Controllable Diffusion

no code implementations10 Mar 2024 Yuqin Dai, Wanlu Zhu, Ronghui Li, Zeping Ren, Xiangzheng Zhou, Xiu Li, Jun Li, Jian Yang

Specifically, to tackle dancer collisions, we introduce a Dance-Beat Navigator capable of generating trajectories for multiple dancers based on the music, complemented by a Distance-Consistency loss to maintain appropriate spacing among trajectories within a reasonable threshold.

Motion Synthesis

On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder

1 code implementation6 Mar 2024 Tingxu Han, Shenghan Huang, Ziqi Ding, Weisong Sun, Yebo Feng, Chunrong Fang, Jun Li, Hanwei Qian, Cong Wu, Quanjun Zhang, Yang Liu, Zhenyu Chen

Distillation aims to distill knowledge from a given model (a. k. a the teacher net) and transfer it to another (a. k. a the student net).

Image Classification

Performance Evaluation and Analysis of Thresholding-based Interference Mitigation for Automotive Radar Systems

no code implementations21 Feb 2024 Jun Li, Jihwan Youn, Ryan Wu, Jeroen Overdevest, Shunqiao Sun

In automotive radar, time-domain thresholding (TD-TH) and time-frequency domain thresholding (TFD-TH) are crucial techniques underpinning numerous interference mitigation methods.

Private Knowledge Sharing in Distributed Learning: A Survey

no code implementations8 Feb 2024 Yasas Supeksala, Dinh C. Nguyen, Ming Ding, Thilina Ranbaduge, Calson Chua, Jun Zhang, Jun Li, H. Vincent Poor

In this light, it is crucial to utilize information in learning processes that are either distributed or owned by different entities.

Survey

PVLR: Prompt-driven Visual-Linguistic Representation Learning for Multi-Label Image Recognition

no code implementations31 Jan 2024 Hao Tan, Zichang Tan, Jun Li, Jun Wan, Zhen Lei

In contrast to the unidirectional fusion in previous works, we introduce a Dual-Modal Attention (DMA) that enables bidirectional interaction between textual and visual features, yielding context-aware label representations and semantic-related visual representations, which are subsequently used to calculate similarities and generate final predictions for all labels.

Multi-Label Image Recognition Representation Learning

SGV3D:Towards Scenario Generalization for Vision-based Roadside 3D Object Detection

1 code implementation29 Jan 2024 Lei Yang, Xinyu Zhang, Jun Li, Li Wang, Chuang Zhang, Li Ju, Zhiwei Li, Yang shen

Our method surpasses all previous methods by a significant margin in new scenes, including +42. 57% for vehicle, +5. 87% for pedestrian, and +14. 89% for cyclist compared to BEVHeight on the DAIR-V2X-I heterologous benchmark.

3D Object Detection Autonomous Vehicles +1

Exploring Multi-Modal Control in Music-Driven Dance Generation

no code implementations1 Jan 2024 Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Li

Existing music-driven 3D dance generation methods mainly concentrate on high-quality dance generation, but lack sufficient control during the generation process.

Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval

1 code implementation19 Dec 2023 Zhihang Liu, Jun Li, Hongtao Xie, Pandeng Li, Jiannan Ge, Sun-Ao Liu, Guoqing Jin

In this paper, we introduce Modal-Enhanced Semantic Modeling (MESM), a novel framework for more balanced alignment through enhancing features at two levels.

cross-modal alignment Moment Retrieval +2

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

1 code implementation11 Dec 2023 Hao Tan, Jun Li, Yizhuang Zhou, Jun Wan, Zhen Lei, Xiangyu Zhang

We introduce text supervision to the optimization of prompts, which enables two benefits: 1) releasing the model reliance on the pre-defined category names during inference, thereby enabling more flexible prompt generation; 2) reducing the number of inputs to the text encoder, which decreases GPU memory consumption significantly.

Domain Generalization

Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS

no code implementations21 Oct 2023 Li Wang, Xinyu Zhang, Fachuan Zhao, Chuze Wu, Yichen Wang, Ziying Song, Lei Yang, Jun Li, Huaping Liu

The proposed Fuzzy-NMS module combines the volume and clustering density of candidate bounding boxes, refining them with a fuzzy classification method and optimizing the appropriate suppression thresholds to reduce uncertainty in the NMS process.

3D Object Detection object-detection

FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer

no code implementations20 Oct 2023 Xinyu Zhang, Li Wang, Zhiqiang Jiang, Kun Dai, Tao Xie, Lei Yang, Wenhao Yu, Yang shen, Jun Li

However, these methods only integrate long-range context information among keypoints with a fixed receptive field, which constrains the network from reconciling the importance of features with different receptive fields to realize complete image perception, hence limiting the matching accuracy.

Homography Estimation Pose Estimation +1

Federated Meta-Learning for Few-Shot Fault Diagnosis with Representation Encoding

no code implementations13 Oct 2023 Jixuan Cui, Jun Li, Zhen Mei, Kang Wei, Sha Wei, Ming Ding, Wen Chen, Song Guo

However, the domain discrepancy and data scarcity problems among clients deteriorate the performance of the global FL model.

Federated Learning Meta-Learning +1

Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving

no code implementations13 Oct 2023 Feng Jiang, Chaoping Tu, Gang Zhang, Jun Li, Hanqing Huang, Junyu Lin, Di Feng, Jian Pu

LiDAR and camera are two critical sensors for multi-modal 3D semantic segmentation and are supposed to be fused efficiently and robustly to promise safety in various real-world scenarios.

3D Semantic Segmentation Autonomous Driving +1

Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving

1 code implementation11 Oct 2023 Xinyu Zhang, Li Wang, Jian Chen, Cheng Fang, Lei Yang, Ziying Song, Guangqi Yang, Yichen Wang, Xiaofei Zhang, Jun Li, Zhiwei Li, Qingshan Yang, Zhenlin Zhang, Shuzhi Sam Ge

Compared with commonly used 3D radars, the latest 4D radars have precise vertical resolution and higher point cloud density, making it a highly promising sensor for autonomous driving in complex environmental perception.

3D Object Detection Autonomous Driving +1

Dynamic Appearance Particle Neural Radiance Field

no code implementations11 Oct 2023 Ancheng Lin, Jun Li

In this work, we propose Dynamic Appearance Particle Neural Radiance Field (DAP-NeRF), which introduces particle-based representation to model the motions of visual elements in a dynamic 3D scene.

Affine Frequency Division Multiplexing With Index Modulation

no code implementations9 Oct 2023 Yiwei Tao, Miaowen Wen, Yao Ge, Jun Li

In the proposed AFDM-IM scheme, the information bits are carried by the activation state of the subsymbols in discrete affine Fourier (DAF) domain in addition to the conventional constellation symbols.

Diversity

TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling

no code implementations3 Oct 2023 Jun Li, Zedong Zhang, Jian Yang

Generating creative combinatorial objects from two seemingly unrelated object texts is a challenging task in text-to-image synthesis, often hindered by a focus on emulating existing data distributions.

Object Text-to-Image Generation

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

no code implementations30 Sep 2023 Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

We discover that most existing monocular 3D object detectors rely on the ego-vehicle prior assumption that the optical axis of the camera is parallel to the ground.

Autonomous Driving Monocular 3D Object Detection +1

BEVHeight++: Toward Robust Visual Centric 3D Object Detection

no code implementations28 Sep 2023 Lei Yang, Tao Tang, Jun Li, Peng Chen, Kun Yuan, Li Wang, Yi Huang, Xinyu Zhang, Kaicheng Yu

In essence, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

3D Object Detection Autonomous Driving +2

Federated Learning in Intelligent Transportation Systems: Recent Applications and Open Problems

no code implementations20 Sep 2023 Shiying Zhang, Jun Li, Long Shi, Ming Ding, Dinh C. Nguyen, Wuzheng Tan, Jian Weng, Zhu Han

Intelligent transportation systems (ITSs) have been fueled by the rapid development of communication technologies, sensor technologies, and the Internet of Things (IoT).

Federated Learning Object Recognition

TSSAT: Two-Stage Statistics-Aware Transformation for Artistic Style Transfer

1 code implementation12 Sep 2023 Haibo Chen, Lei Zhao, Jun Li, Jian Yang

To address this issue, we imitate the drawing process of humans and propose a Two-Stage Statistics-Aware Transformation (TSSAT) module, which first builds the global style foundation by aligning the global statistics of content and style features and then further enriches local style details by swapping the local statistics (instead of local features) in a patch-wise manner, significantly improving the stylization effects.

Style Transfer

A physics-informed and attention-based graph learning approach for regional electric vehicle charging demand prediction

1 code implementation11 Sep 2023 Haohao Qu, Haoxuan Kuang, Jun Li, Linlin You

Along with the proliferation of electric vehicles (EVs), optimizing the use of EV charging space can significantly alleviate the growing load on intelligent transportation systems.

Graph Learning Meta-Learning +1

Create Your World: Lifelong Text-to-Image Diffusion

no code implementations8 Sep 2023 Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, Yang Cong

Text-to-image generative models can produce diverse high-quality images of concepts with a text prompt, which have demonstrated excellent ability in image generation, image translation, etc.

Attribute Image Generation

DMKD: Improving Feature-based Knowledge Distillation for Object Detection Via Dual Masking Augmentation

no code implementations6 Sep 2023 Guang Yang, Yin Tang, Zhijian Wu, Jun Li, Jianhua Xu, Xili Wan

Recent mainstream masked distillation methods function by reconstructing selectively masked areas of a student network from the feature map of its teacher counterpart.

Knowledge Distillation object-detection +1

RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion

no code implementations1 Sep 2023 Zhiqiang Yan, Xiang Li, Le Hui, Zhenyu Zhang, Jun Li, Jian Yang

To tackle these challenges, we explore a repetitive design in our image guided network to gradually and sufficiently recover depth values.

Depth Completion Depth Estimation +1

Fragment and Integrate Network (FIN): A Novel Spatial-Temporal Modeling Based on Long Sequential Behavior for Online Food Ordering Click-Through Rate Prediction

no code implementations30 Aug 2023 Jun Li, Jingjian Wang, Hongwei Wang, Xing Deng, Jielong Chen, Bing Cao, Zekun Wang, Guanjie Xu, Ge Zhang, Feng Shi, Hualei Liu

(ii) Integrate Network (IN) builds a new integrated sequence by utilizing spatial-temporal interaction on MSS and captures the comprehensive spatial-temporal representation by modeling the integrated sequence with a complicated attention.

Click-Through Rate Prediction Recommendation Systems

SkipcrossNets: Adaptive Skip-cross Fusion for Road Detection

no code implementations24 Aug 2023 Yan Gong, Xinyu Zhang, Hao liu, Xinmin Jiang, Zhiwei Li, Xin Gao, Lei Lin, Dafeng Jin, Jun Li, Huaping Liu

Specifically, skip-cross fusion strategy connects each layer to each layer in a feed-forward manner, and for each layer, the feature maps of all previous layers are used as input and its own feature maps are used as input to all subsequent layers for the other modality, enhancing feature propagation and multi-modal features fusion.

Autonomous Driving

Dynamic Dual-Graph Fusion Convolutional Network For Alzheimer's Disease Diagnosis

no code implementations5 Aug 2023 Fanshi Li, Zhihui Wang, Yifan Guo, Congcong Liu, Yanjie Zhu, Yihang Zhou, Jun Li, Dong Liang, Haifeng Wang

In this paper, a dynamic dual-graph fusion convolutional network is proposed to improve Alzheimer's disease (AD) diagnosis performance.

Graph Learning

Analysis and Optimization of Wireless Federated Learning with Data Heterogeneity

no code implementations4 Aug 2023 Xuefeng Han, Jun Li, Wen Chen, Zhen Mei, Kang Wei, Ming Ding, H. Vincent Poor

With the rapid proliferation of smart mobile devices, federated learning (FL) has been widely considered for application in wireless networks for distributed model training.

Federated Learning Scheduling

Aspect based sentimental analysis for travellers' reviews

no code implementations1 Aug 2023 Mohammed Saad M Alaydaa, Jun Li, Karl Jinkins

The results provide tangible reasons to use aspect based sentimental analysis in order to understand more the travellers and spot airport services that are in need for improvement.

Management TAG

UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming

no code implementations31 Jul 2023 Hao Lin, Ke wu, Jie Li, Jun Li, Wu-Jun Li

To the best of our knowledge, UniAP is the first parallel method that can jointly optimize the two categories of parallel strategies to find an optimal solution.

Creative Birds: Self-Supervised Single-View 3D Style Transfer

2 code implementations ICCV 2023 Renke Wang, Guimin Que, Shuo Chen, Xiang Li, Jun Li, Jian Yang

Our focus lies primarily on birds, a popular subject in 3D reconstruction, for which no existing single-view 3D transfer methods have been developed. The method we propose seeks to generate a 3D mesh shape and texture of a bird from two single-view images.

3D Reconstruction Style Transfer

Bayesian Linear Regression with Cauchy Prior and Its Application in Sparse MIMO Radar

no code implementations20 Jul 2023 Jun Li, Ryan Wu, I-Tai Lu, Dongyin Ren

In this paper, a sparse signal recovery algorithm using Bayesian linear regression with Cauchy prior (BLRC) is proposed.

regression Single Particle Analysis

General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds with One Stone

no code implementations19 Jul 2023 Zenghao Bao, Zichang Tan, Jun Li, Jun Wan, Xibo Ma, Zhen Lei

Driven by this, some works suggest that each class should be treated equally to improve performance in tail classes (with a minority of samples), which can be summarized as Long-tailed Age Estimation.

Age Estimation MORPH

Mobility-Aware Joint User Scheduling and Resource Allocation for Low Latency Federated Learning

no code implementations18 Jul 2023 Kecheng Fan, Wen Chen, Jun Li, Xiumei Deng, Xuefeng Han, Ming Ding

As an efficient distributed machine learning approach, Federated learning (FL) can obtain a shared model by iterative local model training at the user side and global model aggregating at the central server side, thereby protecting privacy of users.

Federated Learning Scheduling

NCL++: Nested Collaborative Learning for Long-Tailed Visual Recognition

1 code implementation29 Jun 2023 Zichang Tan, Jun Li, Jinhao Du, Jun Wan, Zhen Lei, Guodong Guo

To achieve the collaborative learning in long-tailed learning, the balanced online distillation is proposed to force the consistent predictions among different experts and augmented copies, which reduces the learning uncertainties.

Learnable Differencing Center for Nighttime Depth Perception

1 code implementation26 Jun 2023 Zhiqiang Yan, Yupeng Zheng, Chongyi Li, Jun Li, Jian Yang

Depth completion is the task of recovering dense depth maps from sparse ones, usually with the help of color images.

Depth Completion Depth Estimation

Variable Radiance Field for Real-Life Category-Specifc Reconstruction from Single Image

no code implementations8 Jun 2023 Kun Wang, Zhiqiang Yan, Zhenyu Zhang, Xiang Li, Jun Li, Jian Yang

Our key contributions are: (1) We parameterize the geometry and appearance of the object using a multi-scale global feature extractor, which avoids frequent point-wise feature retrieval and camera dependency.

Contrastive Learning Object +1

EfficientSRFace: An Efficient Network with Super-Resolution Enhancement for Accurate Face Detection

no code implementations4 Jun 2023 Guangtao Wang, Jun Li, Jie Xie, Jianhua Xu, Bo Yang

In face detection, low-resolution faces, such as numerous small faces of a human group in a crowded scene, are common in dense face prediction tasks.

Benchmarking Face Detection +1

Self-Aware Trajectory Prediction for Safe Autonomous Driving

no code implementations16 May 2023 Wenbo Shao, Jun Li, Hong Wang

Trajectory prediction is one of the key components of the autonomous driving software stack.

Autonomous Driving Trajectory Prediction

Path Planning for Air-Ground Robot Considering Modal Switching Point Optimization

no code implementations14 May 2023 Xiaoyu Wang, Kangyao Huang, Xinyu Zhang, Honglin Sun, Wenzhuo LIU, Huaping Liu, Jun Li, Pingping Lu

A robot for the field application environment was proposed, and a lightweight global spatial planning technique for the robot based on the graph-search algorithm taking mode switching point optimization into account, with an emphasis on energy efficiency, searching speed, and the viability of real deployment.

Human Machine Co-adaption Interface via Cooperation Markov Decision Process System

no code implementations3 May 2023 Kairui Guo, Adrian Cheng, Yaqi Li, Jun Li, Rob Duffield, Steven W. Su

Based on the proposed co-adaptive MDPs, the simulation study indicates the non-stationary problem can be mitigated using various proposed Policy Improvement approaches.

Model-based Reinforcement Learning Multi-agent Reinforcement Learning +2

ScatterFormer: Locally-Invariant Scattering Transformer for Patient-Independent Multispectral Detection of Epileptiform Discharges

1 code implementation26 Apr 2023 Ruizhe Zheng, Jun Li, Yi Wang, Tian Luo, Yuguo Yu

Patient-independent detection of epileptic activities based on visual spectral representation of continuous EEG (cEEG) has been widely used for diagnosing epilepsy.

EEG Seizure Detection

Informative Data Selection with Uncertainty for Multi-modal Object Detection

no code implementations23 Apr 2023 Xinyu Zhang, Zhiwei Li, Zhenhong Zou, Xin Gao, Yijin Xiong, Dafeng Jin, Jun Li, Huaping Liu

To quantify the correlation in multi-modal information, we model the uncertainty, as the inverse of data information, in different modalities and embed it in the bounding box generation.

Informativeness object-detection +1

Gradient Sparsification for Efficient Wireless Federated Learning with Differential Privacy

no code implementations9 Apr 2023 Kang Wei, Jun Li, Chuan Ma, Ming Ding, Feng Shu, Haitao Zhao, Wen Chen, Hongbo Zhu

Specifically, we first design a random sparsification algorithm to retain a fraction of the gradient elements in each client's local training, thereby mitigating the performance degradation induced by DP and and reducing the number of transmission parameters over wireless channels.

Federated Learning Scheduling +1

Design of Two-Level Incentive Mechanisms for Hierarchical Federated Learning

no code implementations9 Apr 2023 Shunfeng Chu, Jun Li, Kang Wei, Yuwen Qian, Kunlun Wang, Feng Shu, Wen Chen

In this paper, we design two-level incentive mechanisms for the HFL with a two-tiered computing structure to encourage the participation of entities in each tier in the HFL training.

Federated Learning Vocal Bursts Valence Prediction

LogoNet: a fine-grained network for instance-level logo sketch retrieval

1 code implementation5 Apr 2023 Binbin Feng, Jun Li, Jianhua Xu

To our knowledge, this is the first publicly available instance-level logo sketch dataset.

2k Benchmarking +2

Boundary-to-Solution Mapping for Groundwater Flows in a Toth Basin

no code implementations28 Mar 2023 Jingwei Sun, Jun Li, Yonghong Hao, Cuiting Qi, Chunmei Ma, Huazhi Sun, Negash Begashaw, Gurcan Comet, Yi Sun, Qi Wang

In this paper, the authors propose a new approach to solving the groundwater flow equation in the Toth basin of arbitrary top and bottom topographies using deep learning.

Pi-ViMo: Physiology-inspired Robust Vital Sign Monitoring using mmWave Radars

no code implementations24 Mar 2023 Bo Zhang, Boyu Jiang, Rong Zheng, XiaoPing Zhang, Jun Li, Qiang Xu

In this paper, we address these limitations and present "Pi-ViMo", a non-contact Physiology-inspired Robust Vital Sign Monitoring system, using mmWave radars.

Template Matching

Frozen Language Model Helps ECG Zero-Shot Learning

no code implementations22 Mar 2023 Jun Li, Che Liu, Sibo Cheng, Rossella Arcucci, Shenda Hong

In downstream classification tasks, METS achieves around 10% improvement in performance without using any annotated data via zero-shot classification, compared to other supervised and SSL baselines that rely on annotated data.

Language Modeling Language Modelling +2

BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection

1 code implementation CVPR 2023 Lei Yang, Kaicheng Yu, Tao Tang, Jun Li, Kun Yuan, Li Wang, Xinyu Zhang, Peng Chen

In essence, instead of predicting the pixel-wise depth, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods.

3D Object Detection Autonomous Driving +1

O2CTA: Introducing Annotations from OCT to CCTA in Coronary Plaque Analysis

no code implementations11 Mar 2023 Jun Li, Kexin Li, Yafeng Zhou, S. Kevin Zhou

Therefore, it is clinically critical to introduce annotations of plaque tissue and lumen characteristics from OCT to paired CCTA scans, denoted as \textbf{the O2CTA problem} in this paper.

Non-aligned supervision for Real Image Dehazing

no code implementations8 Mar 2023 Junkai Fan, Fei Guo, Jianjun Qian, Xiang Li, Jun Li, Jian Yang

In particular, we explore a non-alignment scenario that a clear reference image, unaligned with the input hazy image, is utilized to supervise the dehazing network.

Image Dehazing

Amplitude-Varying Perturbation for Balancing Privacy and Utility in Federated Learning

no code implementations7 Mar 2023 Xin Yuan, Wei Ni, Ming Ding, Kang Wei, Jun Li, H. Vincent Poor

The contribution of the new DP mechanism to the convergence and accuracy of privacy-preserving FL is corroborated, compared to the state-of-the-art Gaussian noise mechanism with a persistent noise amplitude.

Federated Learning Privacy Preserving

EfficientFace: An Efficient Deep Network with Feature Enhancement for Accurate Face Detection

no code implementations23 Feb 2023 Guangtao Wang, Jun Li, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang

Besides, this is conducive to estimating the locations of faces and enhancing the descriptive power of face features.

Descriptive Face Detection

InOR-Net: Incremental 3D Object Recognition Network for Point Cloud Representation

no code implementations20 Feb 2023 Jiahua Dong, Yang Cong, Gan Sun, Lixu Wang, Lingjuan Lyu, Jun Li, Ender Konukoglu

Moreover, they cannot explore which 3D geometric characteristics are essential to alleviate the catastrophic forgetting on old classes of 3D objects.

3D Object Recognition Fairness

Recurrent Structure Attention Guidance for Depth Super-Resolution

no code implementations31 Jan 2023 Jiayi Yuan, Haobo Jiang, Xiang Li, Jianjun Qian, Jun Li, Jian Yang

Second, instead of the coarse concatenation guidance, we propose a recurrent structure attention block, which iteratively utilizes the latest depth estimation and the image features to jointly select clear patterns and boundaries, aiming at providing refined guidance for accurate depth recovery.

Depth Estimation Super-Resolution