no code implementations • 25 Feb 2025 • Weijun Gao, Chong Han, Zhi Chen, Yong Chen, Yuanzhi He, Wenjun Zhang
To achieve ubiquitous connectivity in next-generation networks through aerospace communications while maintaining high data rates, Terahertz (THz) band communications (0. 1-10 THz) with large continuous bandwidths are considered a promising candidate technology.
no code implementations • 25 Feb 2025 • Chong Han, Weijun Gao, Zhepu Yin, Chuang Yang, Mugen Peng, Wenjun Zhang
The transition from isolated systems to integrated solutions has driven the development of space-air-ground integrated networks (SAGIN) as well as the integration of communication and radar sensing functionalities.
1 code implementation • 23 Feb 2025 • Wenjun Zhang, Shakes Chandra, Aaron Nicolson
Medical Phrase Grounding (MPG) maps radiological findings described in medical reports to specific regions in medical images.
no code implementations • 22 Feb 2025 • Junhao Du, Chuqin Zhou, Ning Cao, Gang Chen, Yunuo Chen, Zhengxue Cheng, Li Song, Guo Lu, Wenjun Zhang
However, a significant challenge remains in bridging the gap between the textual prior knowledge within LLMs and lossless image compression.
no code implementations • 11 Feb 2025 • Mingkai Xu, Yongpeng Wu, Yuxuan Shi, Xiang-Gen Xia, Wenjun Zhang, Ping Zhang
A latent denoising semantic communication (SemCom) framework is proposed for robust image transmission over noisy channels.
no code implementations • 2 Feb 2025 • Yunuo Chen, Qian Li, Bing He, Donghui Feng, Ronghua Wu, Qi Wang, Li Song, Guo Lu, Wenjun Zhang
Transformers have achieved significant success in learned image compression (LIC), with Swin Transformers emerging as the mainstream choice for nonlinear transforms.
no code implementations • 21 Jan 2025 • Jethro Odeyemi, Wenjun Zhang
In this paper, we evaluate the performance of four randomized optimization algorithms: Randomized Hill Climbing (RHC), Simulated Annealing (SA), Genetic Algorithms (GA), and MIMIC (Mutual Information Maximizing Input Clustering), across three distinct types of problems: binary, permutation, and combinatorial.
no code implementations • 23 Dec 2024 • Shen Wang, Zhengxue Cheng, Donghui Feng, Guo Lu, Li Song, Wenjun Zhang
Learned image compression (LIC) methods often employ symmetrical encoder and decoder architectures, evitably increasing decoding time.
no code implementations • 16 Dec 2024 • Chuqin Zhou, Guo Lu, Jiangchuan Li, Xiangyu Chen, Zhengxue Cheng, Li Song, Wenjun Zhang
Neural image compression often faces a challenging trade-off among rate, distortion and perception.
no code implementations • 2 Dec 2024 • Zijian Chen, Tingzhu Chen, Wenjun Zhang, Guangtao Zhai
We introduce OBI-Bench, a holistic benchmark crafted to systematically evaluate large multi-modal models (LMMs) on whole-process oracle bone inscriptions (OBI) processing tasks demanding expert-level domain knowledge and deliberate cognition.
no code implementations • 15 Nov 2024 • Jingyi Cao, Xiangyi Chen, Bo Liu, Ming Ding, Rong Xie, Li Song, Zhu Li, Wenjun Zhang
The widespread use of image acquisition technologies, along with advances in facial recognition, has raised serious privacy concerns.
no code implementations • 11 Nov 2024 • Nan Xue, Yaping Sun, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Liang Qian, Shuguang Cui, Wenjun Zhang, Ping Zhang
In this paper, we propose a wireless distributed Mixture of Experts (WDMoE) architecture to enable collaborative deployment of LLMs across edge servers at the base station (BS) and mobile devices in wireless networks.
1 code implementation • 25 Oct 2024 • Jianning Chi, Zelan Li, Huixuan Wu, Wenjun Zhang, Ying Huang
Weakly-supervised methods typically guided the pixel-wise training by comparing the predictions to single-level labels containing diverse segmentation-related information at once, but struggled to represent delicate feature differences between nodule and background regions and confused incorrect information, resulting in underfitting or overfitting in the segmentation predictions.
no code implementations • 16 Oct 2024 • Yujie Cao, Tong Wu, Zhiyong Chen, Yin Xu, Meixia Tao, Wenjun Zhang
In the MTML-RSC scheme, the source node broadcasts a signal using semantic communications, and the relay node forwards the signal to the destination.
no code implementations • 25 Sep 2024 • Tong Wu, Zhiyong Chen, Meixia Tao, Yaping Sun, Xiaodong Xu, Wenjun Zhang, Ping Zhang
Lightweight and efficient neural network models for deep joint source-channel coding (JSCC) are crucial for semantic communications.
no code implementations • 20 Sep 2024 • Cui Zhang, Wenjun Zhang, Qiong Wu, Pingyi Fan, Nan Cheng, Wen Chen, Khaled B. Letaief
The Internet of Vehicles (IoV) network can address the issue of limited computing resources and data processing capabilities of individual vehicles, but it also brings the risk of privacy leakage to vehicle users.
no code implementations • 9 Sep 2024 • Xiongkuo Min, Yixuan Gao, Yuqin Cao, Guangtao Zhai, Wenjun Zhang, Huifang Sun, Chang Wen Chen
RichIQA is characterized by two key novel designs: (1) a three-stage image quality prediction network which exploits the powerful feature representation capability of the Convolutional vision Transformer (CvT) and mimics the short-term and long-term memory mechanisms of human brain; (2) a multi-label training strategy in which rich subjective quality information like MOS, SOS and DOS are concurrently used to train the quality prediction network.
1 code implementation • 23 Aug 2024 • Bing He, Yunuo Chen, Guo Lu, Qi Wang, Qunshan Gu, Rong Xie, Li Song, Wenjun Zhang
To address these challenges, we introduce a novel approach for streaming 4D real-world reconstruction utilizing discrete 3D control points.
1 code implementation • 12 Aug 2024 • Jiameng Li, Yue Shi, JieZhang Cao, Bingbing Ni, Wenjun Zhang, Kai Zhang, Luc van Gool
3D Gaussian Splatting (3DGS) has attracted great attention in novel view synthesis because of its superior rendering efficiency and high fidelity.
no code implementations • 2 Aug 2024 • Zhixin Yan, Yan Zhong, Hongbin Xu, Wenjun Zhang, Shangru Yi, Lin Shu, Wenxiong Kang
Remote Photoplethysmography (rPPG) enables non-contact physiological signal extraction from facial videos, offering applications in psychological state analysis, medical assistance, and anti-face spoofing.
no code implementations • 11 Jul 2024 • Cui Zhang, Wenjun Zhang, Qiong Wu, Pingyi Fan, Qiang Fan, Jiangzhou Wang, Khaled B. Letaief
Federated Learning (FL) can protect the privacy of the vehicles in vehicle edge computing (VEC) to a certain extent through sharing the gradients of vehicles' local models instead of local data.
no code implementations • 4 Jul 2024 • Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Rong Xie, Li Song, Wenjun Zhang
In this work, we delve into the potential of utilizing pre-trained stable diffusion for image restoration and propose MRIR, a diffusion-based restoration method with multimodal insights.
no code implementations • 4 Jul 2024 • Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Zhengxue Cheng, Rong Xie, Li Song, Wenjun Zhang
Image restoration is a classic low-level problem aimed at recovering high-quality images from low-quality images with various degradations such as blur, noise, rain, haze, etc.
no code implementations • 15 Jun 2024 • Tong Wu, Zhiyong Chen, Meixia Tao, Bin Xia, Wenjun Zhang
In the proposed method, the transmitter extracts semantic features for two users separately.
1 code implementation • 10 Jun 2024 • Zijian Chen, Wei Sun, Yuan Tian, Jun Jia, ZiCheng Zhang, Jiarui Wang, Ru Huang, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang
Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV).
no code implementations • 30 May 2024 • Bingyan Xie, Yongpeng Wu, Yuxuan Shi, Wenjun Zhang, Shuguang Cui, Merouane Debbah
In this paper, we incorporate MIMO CSI into system designs from a new perspective and propose the learnable CSI fusion semantic communication (LCFSC) framework, where CSI is treated as side information by the semantic extractor to enhance the semantic coding.
no code implementations • 15 Apr 2024 • Han Xue, Qianru Sun, Li Song, Wenjun Zhang, Zhiwu Huang
Secondly, it standardizes the training of different tasks into a general in-context learning, where "in-context" means the input comprises an example input-output pair of the target task and a query image.
no code implementations • 31 Mar 2024 • Chen Peng, Zhiqin Qian, Kunyu Wang, Qi Luo, Zhuming Bi, Wenjun Zhang
In the study reported in this paper, based on the well-known hybridization principle, we proposed a method to combine CNN and Transformer to retain the strengths of both, and we applied this method to build a system called MugenNet for colonic polyp image segmentation.
2 code implementations • 26 Feb 2024 • Chunyi Li, Guo Lu, Donghui Feng, HaoNing Wu, ZiCheng Zhang, Xiaohong Liu, Guangtao Zhai, Weisi Lin, Wenjun Zhang
With the evolution of storage and communication protocols, ultra-low bitrate image compression has become a highly demanding topic.
no code implementations • 23 Jan 2024 • Yue Hu, Xianghe Pang, Xiaoqi Qin, Yonina C. Eldar, Siheng Chen, Ping Zhang, Wenjun Zhang
Following this strategy, we first formulate a mathematical optimization framework for the perception-communication trade-off and then propose PragComm, a multi-agent collaborative perception system with two key components: i) single-agent detection and tracking and ii) pragmatic collaboration.
1 code implementation • 9 Dec 2023 • Zijian Chen, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang
In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images.
no code implementations • 30 Nov 2023 • Zijian Chen, Wei Sun, ZiCheng Zhang, Ru Huang, Fangfang Lu, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang
Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc.
no code implementations • 26 Oct 2023 • Runnan Liu, Liang Liu, Yin Xu, Dazhi He, Wenjun Zhang, Chang Wen Chen
We first categorize two types of channel covariance matrix changes based on their impact on system design: Type I change, which denotes the change in the BS receive covariance matrix, and Type II change, which denotes the change in the IRS transmit/receive covariance matrix.
no code implementations • 7 Aug 2023 • Bingyan Xie, Yongpeng Wu, Yuxuan Shi, Derrick Wing Kwan Ng, Wenjun Zhang
Multi-node communication, which refers to the interaction among multiple devices, has attracted lots of attention in many Internet-of-Things (IoT) scenarios.
no code implementations • 28 Jul 2023 • Ziyang Zhou, Yingtie Lei, Xuhang Chen, Shenghong Luo, Wenjun Zhang, Chi-Man Pun, Zhen Wang
Shadows in scanned documents pose significant challenges for document analysis and recognition tasks due to their negative impact on visual quality and readability.
no code implementations • 24 Apr 2023 • Shunli Ren, Zixing Lei, Zi Wang, Mehrdad Dianati, Yafei Wang, Siheng Chen, Wenjun Zhang
To achieve comprehensive recovery, we design a communication-adaptive multi-scale spatial-temporal prediction model to extract multi-scale spatial-temporal features based on V2X communication conditions and capture the most significant information for the prediction of the missing information.
1 code implementation • CVPR 2023 • Yurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang
Current top-leading solutions for video object segmentation (VOS) typically follow a matching-based regime: for each query frame, the segmentation mask is inferred according to its correspondence to previously processed and the first annotated frames.
1 code implementation • CVPR 2023 • Han Xue, Zhiwu Huang, Qianru Sun, Li Song, Wenjun Zhang
In this work, we explore the freestyle capability of the model, i. e., how far can it generate unseen semantics (e. g., classes, attributes, and styles) onto a given layout, and call the task Freestyle LIS (FLIS).
1 code implementation • CVPR 2023 • Yi Zhang, Xiaoyang Huang, Bingbing Ni, Teng Li, Wenjun Zhang
We develop an effective point cloud rendering pipeline for novel view synthesis, which enables high fidelity local detail reconstruction, real-time rendering and user-friendly editing.
no code implementations • 21 Feb 2023 • Yue Shi, Yuxuan Xiong, Jingyi Chai, Bingbing Ni, Wenjun Zhang
To address these issues, we propose an unsupervised separated 3D garments and human reconstruction model (USR), which reconstructs the human body and authentic textured clothes in layers without 3D models.
1 code implementation • 30 Jan 2023 • Xiaoyang Huang, Yanjun Wang, Yang Liu, Bingbing Ni, Wenjun Zhang, Jinxian Liu, Teng Li
To this end, we propose to achieve personalized spatial audio by reconstructing 3D human ears with single-view images.
1 code implementation • ICCV 2023 • Xiaoyang Huang, Yi Zhang, Kai Chen, Teng Li, Wenjun Zhang, Bingbing Ni
In this work, a novel regularization term named Implicit Convexity Regularization (ICR) imposed on implicit primitive learning is proposed to tackle this problem.
no code implementations • 5 Dec 2022 • Yue Shi, Dingyi Rong, Bingbing Ni, Chang Chen, Wenjun Zhang
To address these issues, we propose Geometry-Aware Generalized Neural Radiance Field (GARF) with a geometry-aware dynamic sampling (GADS) strategy to perform real-time novel view rendering and unsupervised depth estimation on unseen scenes without per-scene optimization.
1 code implementation • 27 Oct 2022 • Xiaoyang Huang, Yi Zhang, Bingbing Ni, Teng Li, Kai Chen, Wenjun Zhang
In this work, we focus on boosting the image quality of point clouds rendering with a compact model design.
no code implementations • 22 Aug 2022 • Shunli Ren, Siheng Chen, Wenjun Zhang
Perception is one of the crucial module of the autonomous driving system, which has made great progress recently.
1 code implementation • 18 Jul 2022 • Zixing Lei, Shunli Ren, Yue Hu, Wenjun Zhang, Siheng Chen
Collaborative perception has recently shown great potential to improve perception capabilities over single-agent perception.
no code implementations • 5 Jul 2022 • Runnan Liu, Liang Liu, Dazhi He, Wenjun Zhang, Erik G. Larsson
Our paper aims to adopt the classic change detection theory to detect the change in the channel covariance matrix as accurately and quickly as possible such that the new covariance matrix can be re-estimated in time.
no code implementations • 1 May 2022 • Jiling Cao, Jeong-Hoon Kim, Xi Li, Wenjun Zhang
In this paper, we derive closed-form formulas of first-order approximation for down-and-out barrier and floating strike lookback put option prices under a stochastic volatility model, by using an asymptotic approach.
1 code implementation • CVPR 2022 • Chenxin Xu, Weibo Mao, Wenjun Zhang, Siheng Chen
However, in this way, the model parameters come from all seen instances, which means a huge amount of irrelevant seen instances might also involve in predicting the current situation, disturbing the performance.
Ranked #7 on
Trajectory Prediction
on Stanford Drone
no code implementations • 22 Mar 2022 • Kai Song, Biqian Feng, Yongpeng Wu, Wenjun Zhang
Finally, with our work, a PLC network based on E-PMAC mechanism can be realized.
1 code implementation • ICLR 2022 • Xiaoyang Huang, Jiancheng Yang, Yanjun Wang, Ziyu Chen, Linguo Li, Teng Li, Bingbing Ni, Wenjun Zhang
In this study, we present Representation-Agnostic Shape Fields (RASF), a generalizable and computation-efficient shape embedding module for 3D deep learning.
no code implementations • 16 Mar 2022 • Zefan Li, Bingbing Ni, Teng Li, Wenjun Zhang, Wen Gao
GCGD consists of two plug-in modules: 1) inspired by the idea of gradient prediction, we propose a \textbf{GC-W} module for weight gradient correction; 2) based on Neural ODE, we propose a \textbf{GC-ODE} module for hidden states gradient correction.
no code implementations • 26 Feb 2022 • Boxiao Shen, Yongpeng Wu, Jianping An, Chengwen Xing, Lian Zhao, Wenjun Zhang
Next, by exploring the sparsity of channel in the delay-Doppler-angle domain, a two-dimensional pattern coupled hierarchical prior with the sparse Bayesian learning and covariance-free method (TDSBL-CF) is developed for the channel estimation.
2 code implementations • NeurIPS 2021 • Yiming Li, Shunli Ren, Pengxiang Wu, Siheng Chen, Chen Feng, Wenjun Zhang
Our approach is validated on V2X-Sim 1. 0, a large-scale multi-agent perception dataset that we synthesized using CARLA and SUMO co-simulation.
Ranked #3 on
3D Object Detection
on V2X-SIM
1 code implementation • 9 Sep 2021 • Runnan Liu, Liang Liu, Dazhi He, Wenjun Zhang, Erik G. Larsson
This result verifies the possibility to detect the channel covariance change both accurately and quickly in practice.
no code implementations • CVPR 2021 • Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao
For a given unsupervised task, we design multilevel tasks and define different learning stages for the deep network.
1 code implementation • CVPR 2021 • Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang, Wenjun Zhang
In this work, we propose a Cross-view Contrastive Learning framework for unsupervised 3D skeleton-based action Representation (CrosSCLR), by leveraging multi-view complementary supervision signal.
5 code implementations • 29 Mar 2021 • Yuncong Li, Fang Wang, Wenjun Zhang, Sheng-hua Zhong, Cunxiang Yin, Yancheng He
Aspect Sentiment Triplet Extraction (ASTE) aims to extract aspect term, sentiment and opinion term triplets from sentences and tries to provide a complete solution for aspect-based sentiment analysis (ABSA).
Ranked #1 on
Aspect-Sentiment-Opinion Triplet Extraction
on Res14
Aspect-Based Sentiment Analysis
Aspect-Sentiment-Opinion Triplet Extraction
+3
no code implementations • ICCV 2021 • Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang
The challenges of the latter task are two folds: (1) pose failure (i. e., pose mismatching -- different skeleton definitions in dataset and SMPL , and pose ambiguity -- endpoints have arbitrary joint angle configurations for the same 3D joint coordinates).
no code implementations • ICCV 2021 • Zhenbo Yu, Bingbing Ni, Jingwei Xu, Junjie Wang, Chenglong Zhao, Wenjun Zhang
Furthermore, two temporal constraints are proposed to alleviate the scale and pose ambiguity respectively.
Monocular 3D Human Pose Estimation
Unsupervised 3D Human Pose Estimation
no code implementations • ICCV 2021 • Yue Shi, Bingbing Ni, Jinxian Liu, Dingyi Rong, Ye Qian, Wenjun Zhang
Pixel-to-mesh has wide applications, especially in virtual or augmented reality, animation and game industry.
no code implementations • 1 Jan 2021 • Minghao Xu, Hang Wang, Bingbing Ni, Wenjun Zhang, Jian Tang
We propose to disentangle graph structure and node attributes into two distinct sets of representations, and such disentanglement can be done in either the input or the embedding space.
1 code implementation • ECCV 2020 • Hang Wang, Minghao Xu, Bingbing Ni, Wenjun Zhang
Transferring knowledges learned from multiple source domains to target domain is a more practical and challenging task than conventional single-source domain adaptation.
Domain Adaptation
Multi-Source Unsupervised Domain Adaptation
1 code implementation • CVPR 2020 • Minghao Xu, Hang Wang, Bingbing Ni, Qi Tian, Wenjun Zhang
To mitigate these problems, we propose a Graph-induced Prototype Alignment (GPA) framework to seek for category-level domain alignment via elaborate prototype representations.
no code implementations • 12 Dec 2019 • Yucheng Zhu, Xiongkuo Min, Dandan Zhu, Ke Gu, Jiantao Zhou, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang
The saliency annotations of head and eye movements for both original and augmented videos are collected and together constitute the ARVR dataset.
1 code implementation • 4 Dec 2019 • Minghao Xu, Jian Zhang, Bingbing Ni, Teng Li, Chengjie Wang, Qi Tian, Wenjun Zhang
In this paper, we present adversarial domain adaptation with domain mixup (DM-ADA), which guarantees domain-invariance in a more continuous latent space and guides the domain discriminator in judging samples' difference relative to source and target domains.
no code implementations • 6 Apr 2019 • Muming Zhao, Jian Zhang, Chongyang Zhang, Wenjun Zhang
Towards this problem, in this paper we propose a constrained multi-stage Convolutional Neural Networks (CNNs) to jointly pursue locally consistent density map from two aspects.
1 code implementation • ECCV 2018 • Ji Zhu, Hua Yang, Nian Liu, Minyoung Kim, Wenjun Zhang, Ming-Hsuan Yang
In this paper, we propose an online Multi-Object Tracking (MOT) approach which integrates the merits of single object tracking and data association methods in a unified framework to handle noisy detections and frequent interactions between targets.
Ranked #5 on
Online Multi-Object Tracking
on MOT16
no code implementations • 20 Apr 2018 • Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, Wenjun Zhang
Transferring a low-dynamic-range (LDR) image to a high-dynamic-range (HDR) image, which is the so-called inverse tone mapping (iTM), is an important imaging technique to improve visual effects of imaging devices.
no code implementations • ECCV 2018 • Yan Wang, Lingxi Xie, Siyuan Qiao, Ya zhang, Wenjun Zhang, Alan L. Yuille
Convolution is spatially-symmetric, i. e., the visual features are independent of its position in the image, which limits its ability to utilize contextual cues for visual recognition.
no code implementations • ICCV 2017 • Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao
Input binarization has shown to be an effective way for network acceleration.
no code implementations • ICCV 2017 • Yan Wang, Lingxi Xie, Chenxi Liu, Ya zhang, Wenjun Zhang, Alan Yuille
In this paper, we reveal the importance and benefits of introducing second-order operations into deep neural networks.
no code implementations • 3 Mar 2017 • Yan Wang, Lingxi Xie, Ya zhang, Wenjun Zhang, Alan Yuille
We formulate the function of a convolutional layer as learning a large visual vocabulary, and propose an alternative way, namely Deep Collaborative Learning (DCL), to reduce the computational complexity.
no code implementations • 30 Oct 2016 • Teh Raihana Nazirah Roslan, Wenjun Zhang, Jiling Cao
This paper considers the case of pricing discretely-sampled variance swaps under the class of equity-interest rate hybridization.
no code implementations • 30 Oct 2016 • Rory P. Bunker, Wenjun Zhang, M. Asif Naeem
In this paper, we investigate the extent to which features derived from bank statements provided by loan applicants, and which are not declared on an application form, can enhance a credit scoring model for a New Zealand lending company.