Search Results for author: Wenjun Zhang

Found 75 papers, 24 papers with code

Terahertz Aerospace Communications: Enabling Technologies and Future Directions

no code implementations25 Feb 2025 Weijun Gao, Chong Han, Zhi Chen, Yong Chen, Yuanzhi He, Wenjun Zhang

To achieve ubiquitous connectivity in next-generation networks through aerospace communications while maintaining high data rates, Terahertz (THz) band communications (0. 1-10 THz) with large continuous bandwidths are considered a promising candidate technology.

Joint Communication and Radar Sensing for Terahertz Space-Air-Ground Integrated Networks (SAGIN)

no code implementations25 Feb 2025 Chong Han, Weijun Gao, Zhepu Yin, Chuang Yang, Mugen Peng, Wenjun Zhang

The transition from isolated systems to integrated solutions has driven the development of space-air-ground integrated networks (SAGIN) as well as the integration of communication and radar sensing functionalities.

Anatomical grounding pre-training for medical phrase grounding

1 code implementation23 Feb 2025 Wenjun Zhang, Shakes Chandra, Aaron Nicolson

Medical Phrase Grounding (MPG) maps radiological findings described in medical reports to specific regions in medical images.

Phrase Grounding Zero-Shot Learning

Large Language Model for Lossless Image Compression with Visual Prompts

no code implementations22 Feb 2025 Junhao Du, Chuqin Zhou, Ning Cao, Gang Chen, Yunuo Chen, Zhengxue Cheng, Li Song, Guo Lu, Wenjun Zhang

However, a significant challenge remains in bridging the gap between the textual prior knowledge within LLMs and lossless image compression.

Image Compression Language Modeling +2

Learnable Residual-based Latent Denoising in Semantic Communication

no code implementations11 Feb 2025 Mingkai Xu, Yongpeng Wu, Yuxuan Shi, Xiang-Gen Xia, Wenjun Zhang, Ping Zhang

A latent denoising semantic communication (SemCom) framework is proposed for robust image transmission over noisy channels.

Denoising Semantic Communication

S2CFormer: Reorienting Learned Image Compression from Spatial Interaction to Channel Aggregation

no code implementations2 Feb 2025 Yunuo Chen, Qian Li, Bing He, Donghui Feng, Ronghua Wu, Qi Wang, Li Song, Guo Lu, Wenjun Zhang

Transformers have achieved significant success in learned image compression (LIC), with Swin Transformers emerging as the mainstream choice for nonlinear transforms.

Image Compression

Benchmarking Randomized Optimization Algorithms on Binary, Permutation, and Combinatorial Problem Landscapes

no code implementations21 Jan 2025 Jethro Odeyemi, Wenjun Zhang

In this paper, we evaluate the performance of four randomized optimization algorithms: Randomized Hill Climbing (RHC), Simulated Annealing (SA), Genetic Algorithms (GA), and MIMIC (Mutual Information Maximizing Input Clustering), across three distinct types of problems: binary, permutation, and combinatorial.

Benchmarking

AsymLLIC: Asymmetric Lightweight Learned Image Compression

no code implementations23 Dec 2024 Shen Wang, Zhengxue Cheng, Donghui Feng, Guo Lu, Li Song, Wenjun Zhang

Learned image compression (LIC) methods often employ symmetrical encoder and decoder architectures, evitably increasing decoding time.

Decoder Image Compression

OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?

no code implementations2 Dec 2024 Zijian Chen, Tingzhu Chen, Wenjun Zhang, Guangtao Zhai

We introduce OBI-Bench, a holistic benchmark crafted to systematically evaluate large multi-modal models (LMMs) on whole-process oracle bone inscriptions (OBI) processing tasks demanding expert-level domain knowledge and deliberate cognition.

Face De-identification: State-of-the-art Methods and Comparative Studies

no code implementations15 Nov 2024 Jingyi Cao, Xiangyi Chen, Bo Liu, Ming Ding, Rong Xie, Li Song, Zhu Li, Wenjun Zhang

The widespread use of image acquisition technologies, along with advances in facial recognition, has raised serious privacy concerns.

De-identification Survey

WDMoE: Wireless Distributed Mixture of Experts for Large Language Models

no code implementations11 Nov 2024 Nan Xue, Yaping Sun, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Liang Qian, Shuguang Cui, Wenjun Zhang, Ping Zhang

In this paper, we propose a wireless distributed Mixture of Experts (WDMoE) architecture to enable collaborative deployment of LLMs across edge servers at the base station (BS) and mobile devices in wireless networks.

Beyond Point Annotation: A Weakly Supervised Network Guided by Multi-Level Labels Generated from Four-Point Annotation for Thyroid Nodule Segmentation in Ultrasound Image

1 code implementation25 Oct 2024 Jianning Chi, Zelan Li, Huixuan Wu, Wenjun Zhang, Ying Huang

Weakly-supervised methods typically guided the pixel-wise training by comparing the predictions to single-level labels containing diverse segmentation-related information at once, but struggled to represent delicate feature differences between nodule and background regions and confused incorrect information, resulting in underfitting or overfitting in the segmentation predictions.

Segmentation

Two Birds with One Stone: Multi-Task Semantic Communications Systems over Relay Channel

no code implementations16 Oct 2024 Yujie Cao, Tong Wu, Zhiyong Chen, Yin Xu, Meixia Tao, Wenjun Zhang

In the MTML-RSC scheme, the source node broadcasts a signal using semantic communications, and the relay node forwards the signal to the destination.

Classification Image Reconstruction

MambaJSCC: Adaptive Deep Joint Source-Channel Coding with Generalized State Space Model

no code implementations25 Sep 2024 Tong Wu, Zhiyong Chen, Meixia Tao, Yaping Sun, Xiaodong Xu, Wenjun Zhang, Ping Zhang

Lightweight and efficient neural network models for deep joint source-channel coding (JSCC) are crucial for semantic communications.

Efficient Neural Network State Space Models

Blockchain-Enabled Variational Information Bottleneck for Data Extraction Based on Mutual Information in Internet of Vehicles

no code implementations20 Sep 2024 Cui Zhang, Wenjun Zhang, Qiong Wu, Pingyi Fan, Nan Cheng, Wen Chen, Khaled B. Letaief

The Internet of Vehicles (IoV) network can address the issue of limited computing resources and data processing capabilities of individual vehicles, but it also brings the risk of privacy leakage to vehicle users.

Data Compression Data Interaction

Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild

no code implementations9 Sep 2024 Xiongkuo Min, Yixuan Gao, Yuqin Cao, Guangtao Zhai, Wenjun Zhang, Huifang Sun, Chang Wen Chen

RichIQA is characterized by two key novel designs: (1) a three-stage image quality prediction network which exploits the powerful feature representation capability of the Convolutional vision Transformer (CvT) and mimics the short-term and long-term memory mechanisms of human brain; (2) a multi-label training strategy in which rich subjective quality information like MOS, SOS and DOS are concurrently used to train the quality prediction network.

Image Quality Assessment

S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points

1 code implementation23 Aug 2024 Bing He, Yunuo Chen, Guo Lu, Qi Wang, Qunshan Gu, Rong Xie, Li Song, Wenjun Zhang

To address these challenges, we introduce a novel approach for streaming 4D real-world reconstruction utilizing discrete 3D control points.

3D Reconstruction 4D reconstruction

Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering

1 code implementation12 Aug 2024 Jiameng Li, Yue Shi, JieZhang Cao, Bingbing Ni, Wenjun Zhang, Kai Zhang, Luc van Gool

3D Gaussian Splatting (3DGS) has attracted great attention in novel view synthesis because of its superior rendering efficiency and high fidelity.

3DGS NeRF +1

PhysMamba: State Space Duality Model for Remote Physiological Measurement

no code implementations2 Aug 2024 Zhixin Yan, Yan Zhong, Hongbin Xu, Wenjun Zhang, Shangru Yi, Lin Shu, Wenxiong Kang

Remote Photoplethysmography (rPPG) enables non-contact physiological signal extraction from facial videos, offering applications in psychological state analysis, medical assistance, and anti-face spoofing.

State Space Models

Distributed Deep Reinforcement Learning Based Gradient Quantization for Federated Learning Enabled Vehicle Edge Computing

no code implementations11 Jul 2024 Cui Zhang, Wenjun Zhang, Qiong Wu, Pingyi Fan, Qiang Fan, Jiangzhou Wang, Khaled B. Letaief

Federated Learning (FL) can protect the privacy of the vehicles in vehicle edge computing (VEC) to a certain extent through sharing the gradients of vehicles' local models instead of local data.

Deep Reinforcement Learning Edge-computing +2

MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration

no code implementations4 Jul 2024 Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Rong Xie, Li Song, Wenjun Zhang

In this work, we delve into the potential of utilizing pre-trained stable diffusion for image restoration and propose MRIR, a diffusion-based restoration method with multimodal insights.

Denoising Image Restoration +3

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration

no code implementations4 Jul 2024 Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Zhengxue Cheng, Rong Xie, Li Song, Wenjun Zhang

Image restoration is a classic low-level problem aimed at recovering high-quality images from low-quality images with various degradations such as blur, noise, rain, haze, etc.

Decoder Image Restoration +1

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

1 code implementation10 Jun 2024 Zijian Chen, Wei Sun, Yuan Tian, Jun Jia, ZiCheng Zhang, Jiarui Wang, Ru Huang, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV).

Action Quality Assessment

Robust Image Semantic Coding with Learnable CSI Fusion Masking over MIMO Fading Channels

no code implementations30 May 2024 Bingyan Xie, Yongpeng Wu, Yuxuan Shi, Wenjun Zhang, Shuguang Cui, Merouane Debbah

In this paper, we incorporate MIMO CSI into system designs from a new perspective and propose the learnable CSI fusion semantic communication (LCFSC) framework, where CSI is treated as side information by the semantic extractor to enhance the semantic coding.

Semantic Communication

In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation

no code implementations15 Apr 2024 Han Xue, Qianru Sun, Li Song, Wenjun Zhang, Zhiwu Huang

Secondly, it standardizes the training of different tasks into a general in-context learning, where "in-context" means the input comprises an example input-output pair of the target task and a query image.

Conditional Image Generation Denoising +5

MugenNet: A Novel Combined Convolution Neural Network and Transformer Network with its Application for Colonic Polyp Image Segmentation

no code implementations31 Mar 2024 Chen Peng, Zhiqin Qian, Kunyu Wang, Qi Luo, Zhuming Bi, Wenjun Zhang

In the study reported in this paper, based on the well-known hybridization principle, we proposed a method to combine CNN and Transformer to retain the strengths of both, and we applied this method to build a system called MugenNet for colonic polyp image segmentation.

Computational Efficiency Image Segmentation +2

MISC: Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model

2 code implementations26 Feb 2024 Chunyi Li, Guo Lu, Donghui Feng, HaoNing Wu, ZiCheng Zhang, Xiaohong Liu, Guangtao Zhai, Weisi Lin, Wenjun Zhang

With the evolution of storage and communication protocols, ultra-low bitrate image compression has become a highly demanding topic.

Decoder Image Compression +1

Pragmatic Communication in Multi-Agent Collaborative Perception

no code implementations23 Jan 2024 Yue Hu, Xianghe Pang, Xiaoqi Qin, Yonina C. Eldar, Siheng Chen, Ping Zhang, Wenjun Zhang

Following this strategy, we first formulate a mathematical optimization framework for the perception-communication trade-off and then propose PragComm, a multi-agent collaborative perception system with two key components: i) single-agent detection and tracking and ii) pragmatic collaboration.

3D Object Detection object-detection

Exploring the Naturalness of AI-Generated Images

1 code implementation9 Dec 2023 Zijian Chen, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhongpeng Ji, Fengyu Sun, Shangling Jui, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images.

FS-BAND: A Frequency-Sensitive Banding Detector

no code implementations30 Nov 2023 Zijian Chen, Wei Sun, ZiCheng Zhang, Ru Huang, Fangfang Lu, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc.

Image Quality Assessment

Detecting Abrupt Change of Channel Covariance Matrix in IRS-Assisted Communication

no code implementations26 Oct 2023 Runnan Liu, Liang Liu, Yin Xu, Dazhi He, Wenjun Zhang, Chang Wen Chen

We first categorize two types of channel covariance matrix changes based on their impact on system design: Type I change, which denotes the change in the BS receive covariance matrix, and Type II change, which denotes the change in the IRS transmit/receive covariance matrix.

Communication-Efficient Framework for Distributed Image Semantic Wireless Transmission

no code implementations7 Aug 2023 Bingyan Xie, Yongpeng Wu, Yuxuan Shi, Derrick Wing Kwan Ng, Wenjun Zhang

Multi-node communication, which refers to the interaction among multiple devices, has attracted lots of attention in many Internet-of-Things (IoT) scenarios.

Federated Learning Semantic Communication

DocDeshadower: Frequency-Aware Transformer for Document Shadow Removal

no code implementations28 Jul 2023 Ziyang Zhou, Yingtie Lei, Xuhang Chen, Shenghong Luo, Wenjun Zhang, Chi-Man Pun, Zhen Wang

Shadows in scanned documents pose significant challenges for document analysis and recognition tasks due to their negative impact on visual quality and readability.

Document Shadow Removal

Interruption-Aware Cooperative Perception for V2X Communication-Aided Autonomous Driving

no code implementations24 Apr 2023 Shunli Ren, Zixing Lei, Zi Wang, Mehrdad Dianati, Yafei Wang, Siheng Chen, Wenjun Zhang

To achieve comprehensive recovery, we design a communication-adaptive multi-scale spatial-temporal prediction model to extract multi-scale spatial-temporal features based on V2X communication conditions and capture the most significant information for the prediction of the missing information.

Autonomous Driving Knowledge Distillation

Boosting Video Object Segmentation via Space-time Correspondence Learning

1 code implementation CVPR 2023 Yurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang

Current top-leading solutions for video object segmentation (VOS) typically follow a matching-based regime: for each query frame, the segmentation mask is inferred according to its correspondence to previously processed and the first annotated frames.

Object Segmentation +3

Freestyle Layout-to-Image Synthesis

1 code implementation CVPR 2023 Han Xue, Zhiwu Huang, Qianru Sun, Li Song, Wenjun Zhang

In this work, we explore the freestyle capability of the model, i. e., how far can it generate unseen semantics (e. g., classes, attributes, and styles) onto a given layout, and call the task Freestyle LIS (FLIS).

Image Classification Layout-to-Image Generation +2

Frequency-Modulated Point Cloud Rendering with Easy Editing

1 code implementation CVPR 2023 Yi Zhang, Xiaoyang Huang, Bingbing Ni, Teng Li, Wenjun Zhang

We develop an effective point cloud rendering pipeline for novel view synthesis, which enables high fidelity local detail reconstruction, real-time rendering and user-friendly editing.

NeRF Novel View Synthesis +1

USR: Unsupervised Separated 3D Garment and Human Reconstruction via Geometry and Semantic Consistency

no code implementations21 Feb 2023 Yue Shi, Yuxuan Xiong, Jingyi Chai, Bingbing Ni, Wenjun Zhang

To address these issues, we propose an unsupervised separated 3D garments and human reconstruction model (USR), which reconstructs the human body and authentic textured clothes in layers without 3D models.

3D geometry Virtual Try-on

AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio

1 code implementation30 Jan 2023 Xiaoyang Huang, Yanjun Wang, Yang Liu, Bingbing Ni, Wenjun Zhang, Jinxian Liu, Teng Li

To this end, we propose to achieve personalized spatial audio by reconstructing 3D human ears with single-view images.

Depth Estimation

Learning Shape Primitives via Implicit Convexity Regularization

1 code implementation ICCV 2023 Xiaoyang Huang, Yi Zhang, Kai Chen, Teng Li, Wenjun Zhang, Bingbing Ni

In this work, a novel regularization term named Implicit Convexity Regularization (ICR) imposed on implicit primitive learning is proposed to tackle this problem.

GARF:Geometry-Aware Generalized Neural Radiance Field

no code implementations5 Dec 2022 Yue Shi, Dingyi Rong, Bingbing Ni, Chang Chen, Wenjun Zhang

To address these issues, we propose Geometry-Aware Generalized Neural Radiance Field (GARF) with a geometry-aware dynamic sampling (GADS) strategy to perform real-time novel view rendering and unsupervised depth estimation on unseen scenes without per-scene optimization.

3D geometry Decoder +4

Boosting Point Clouds Rendering via Radiance Mapping

1 code implementation27 Oct 2022 Xiaoyang Huang, Yi Zhang, Bingbing Ni, Teng Li, Kai Chen, Wenjun Zhang

In this work, we focus on boosting the image quality of point clouds rendering with a compact model design.

NeRF

Collaborative Perception for Autonomous Driving: Current Status and Future Trend

no code implementations22 Aug 2022 Shunli Ren, Siheng Chen, Wenjun Zhang

Perception is one of the crucial module of the autonomous driving system, which has made great progress recently.

Autonomous Driving

Latency-Aware Collaborative Perception

1 code implementation18 Jul 2022 Zixing Lei, Shunli Ren, Yue Hu, Wenjun Zhang, Siheng Chen

Collaborative perception has recently shown great potential to improve perception capabilities over single-agent perception.

Autonomous Driving

Detecting Abrupt Changes in Channel Covariance Matrix for MIMO Communication

no code implementations5 Jul 2022 Runnan Liu, Liang Liu, Dazhi He, Wenjun Zhang, Erik G. Larsson

Our paper aims to adopt the classic change detection theory to detect the change in the channel covariance matrix as accurately and quickly as possible such that the new covariance matrix can be re-estimated in time.

Change Detection

Pricing Path-dependent Options under Stochastic Volatility via Mellin Transform

no code implementations1 May 2022 Jiling Cao, Jeong-Hoon Kim, Xi Li, Wenjun Zhang

In this paper, we derive closed-form formulas of first-order approximation for down-and-out barrier and floating strike lookback put option prices under a stochastic volatility model, by using an asymptotic approach.

Form

Remember Intentions: Retrospective-Memory-based Trajectory Prediction

1 code implementation CVPR 2022 Chenxin Xu, Weibo Mao, Wenjun Zhang, Siheng Chen

However, in this way, the model parameters come from all seen instances, which means a huge amount of irrelevant seen instances might also involve in predicting the current situation, disturbing the performance.

Prediction Trajectory Prediction

Enhanced Preamble Based MAC Mechanism for IIoT-oriented PLC Network

no code implementations22 Mar 2022 Kai Song, Biqian Feng, Yongpeng Wu, Wenjun Zhang

Finally, with our work, a PLC network based on E-PMAC mechanism can be realized.

Representation-Agnostic Shape Fields

1 code implementation ICLR 2022 Xiaoyang Huang, Jiancheng Yang, Yanjun Wang, Ziyu Chen, Linguo Li, Teng Li, Bingbing Ni, Wenjun Zhang

In this study, we present Representation-Agnostic Shape Fields (RASF), a generalizable and computation-efficient shape embedding module for 3D deep learning.

Deep Learning

Gradient Correction beyond Gradient Descent

no code implementations16 Mar 2022 Zefan Li, Bingbing Ni, Teng Li, Wenjun Zhang, Wen Gao

GCGD consists of two plug-in modules: 1) inspired by the idea of gradient prediction, we propose a \textbf{GC-W} module for weight gradient correction; 2) based on Neural ODE, we propose a \textbf{GC-ODE} module for hidden states gradient correction.

Random Access with Massive MIMO-OTFS in LEO Satellite Communications

no code implementations26 Feb 2022 Boxiao Shen, Yongpeng Wu, Jianping An, Chengwen Xing, Lian Zhao, Wenjun Zhang

Next, by exploring the sparsity of channel in the delay-Doppler-angle domain, a two-dimensional pattern coupled hierarchical prior with the sparse Bayesian learning and covariance-free method (TDSBL-CF) is developed for the channel estimation.

Action Detection Activity Detection

Learning Distilled Collaboration Graph for Multi-Agent Perception

2 code implementations NeurIPS 2021 Yiming Li, Shunli Ren, Pengxiang Wu, Siheng Chen, Chen Feng, Wenjun Zhang

Our approach is validated on V2X-Sim 1. 0, a large-scale multi-agent perception dataset that we synthesized using CARLA and SUMO co-simulation.

3D Object Detection Knowledge Distillation +1

Detection of Abrupt Change in Channel Covariance Matrix for Multi-Antenna Communication

1 code implementation9 Sep 2021 Runnan Liu, Liang Liu, Dazhi He, Wenjun Zhang, Erik G. Larsson

This result verifies the possibility to detect the channel covariance change both accurately and quickly in practice.

Change Detection

Progressive Stage-wise Learning for Unsupervised Feature Representation Enhancement

no code implementations CVPR 2021 Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao

For a given unsupervised task, we design multilevel tasks and define different learning stages for the deep network.

3D Human Action Representation Learning via Cross-View Consistency Pursuit

1 code implementation CVPR 2021 Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang, Wenjun Zhang

In this work, we propose a Cross-view Contrastive Learning framework for unsupervised 3D skeleton-based action Representation (CrosSCLR), by leveraging multi-view complementary supervision signal.

Action Recognition Contrastive Learning +2

A More Fine-Grained Aspect-Sentiment-Opinion Triplet Extraction Task

5 code implementations29 Mar 2021 Yuncong Li, Fang Wang, Wenjun Zhang, Sheng-hua Zhong, Cunxiang Yin, Yancheng He

Aspect Sentiment Triplet Extraction (ASTE) aims to extract aspect term, sentiment and opinion term triplets from sentences and tries to provide a complete solution for aspect-based sentiment analysis (ABSA).

Aspect-Based Sentiment Analysis Aspect-Sentiment-Opinion Triplet Extraction +3

Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery

no code implementations ICCV 2021 Zhenbo Yu, Junjie Wang, Jingwei Xu, Bingbing Ni, Chenglong Zhao, Minsi Wang, Wenjun Zhang

The challenges of the latter task are two folds: (1) pose failure (i. e., pose mismatching -- different skeleton definitions in dataset and SMPL , and pose ambiguity -- endpoints have arbitrary joint angle configurations for the same 3D joint coordinates).

3D Pose Estimation Human Mesh Recovery

Geometric Granularity Aware Pixel-To-Mesh

no code implementations ICCV 2021 Yue Shi, Bingbing Ni, Jinxian Liu, Dingyi Rong, Ye Qian, Wenjun Zhang

Pixel-to-mesh has wide applications, especially in virtual or augmented reality, animation and game industry.

GraphSAD: Learning Graph Representations with Structure-Attribute Disentanglement

no code implementations1 Jan 2021 Minghao Xu, Hang Wang, Bingbing Ni, Wenjun Zhang, Jian Tang

We propose to disentangle graph structure and node attributes into two distinct sets of representations, and such disentanglement can be done in either the input or the embedding space.

Attribute Disentanglement +1

Learning to Combine: Knowledge Aggregation for Multi-Source Domain Adaptation

1 code implementation ECCV 2020 Hang Wang, Minghao Xu, Bingbing Ni, Wenjun Zhang

Transferring knowledges learned from multiple source domains to target domain is a more practical and challenging task than conventional single-source domain adaptation.

Domain Adaptation Multi-Source Unsupervised Domain Adaptation

Cross-domain Detection via Graph-induced Prototype Alignment

1 code implementation CVPR 2020 Minghao Xu, Hang Wang, Bingbing Ni, Qi Tian, Wenjun Zhang

To mitigate these problems, we propose a Graph-induced Prototype Alignment (GPA) framework to seek for category-level domain alignment via elaborate prototype representations.

Domain Adaptation object-detection +1

Toward Better Understanding of Saliency Prediction in Augmented 360 Degree Videos

no code implementations12 Dec 2019 Yucheng Zhu, Xiongkuo Min, Dandan Zhu, Ke Gu, Jiantao Zhou, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang

The saliency annotations of head and eye movements for both original and augmented videos are collected and together constitute the ARVR dataset.

Object Recognition Optical Flow Estimation +1

Adversarial Domain Adaptation with Domain Mixup

1 code implementation4 Dec 2019 Minghao Xu, Jian Zhang, Bingbing Ni, Teng Li, Chengjie Wang, Qi Tian, Wenjun Zhang

In this paper, we present adversarial domain adaptation with domain mixup (DM-ADA), which guarantees domain-invariance in a more continuous latent space and guides the domain discriminator in judging samples' difference relative to source and target domains.

Domain Adaptation

Towards Locally Consistent Object Counting with Constrained Multi-stage Convolutional Neural Networks

no code implementations6 Apr 2019 Muming Zhao, Jian Zhang, Chongyang Zhang, Wenjun Zhang

Towards this problem, in this paper we propose a constrained multi-stage Convolutional Neural Networks (CNNs) to jointly pursue locally consistent density map from two aspects.

Object Object Counting

Online Multi-Object Tracking with Dual Matching Attention Networks

1 code implementation ECCV 2018 Ji Zhu, Hua Yang, Nian Liu, Minyoung Kim, Wenjun Zhang, Ming-Hsuan Yang

In this paper, we propose an online Multi-Object Tracking (MOT) approach which integrates the merits of single object tracking and data association methods in a unified framework to handle noisy detections and frequent interactions between targets.

Multi-Object Tracking Object +1

Learning an Inverse Tone Mapping Network with a Generative Adversarial Regularizer

no code implementations20 Apr 2018 Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, Wenjun Zhang

Transferring a low-dynamic-range (LDR) image to a high-dynamic-range (HDR) image, which is the so-called inverse tone mapping (iTM), is an important imaging technique to improve visual effects of imaging devices.

Tone Mapping

Multi-Scale Spatially-Asymmetric Recalibration for Image Classification

no code implementations ECCV 2018 Yan Wang, Lingxi Xie, Siyuan Qiao, Ya zhang, Wenjun Zhang, Alan L. Yuille

Convolution is spatially-symmetric, i. e., the visual features are independent of its position in the image, which limits its ability to utilize contextual cues for visual recognition.

Classification General Classification +2

SORT: Second-Order Response Transform for Visual Recognition

no code implementations ICCV 2017 Yan Wang, Lingxi Xie, Chenxi Liu, Ya zhang, Wenjun Zhang, Alan Yuille

In this paper, we reveal the importance and benefits of introducing second-order operations into deep neural networks.

Deep Collaborative Learning for Visual Recognition

no code implementations3 Mar 2017 Yan Wang, Lingxi Xie, Ya zhang, Wenjun Zhang, Alan Yuille

We formulate the function of a convolutional layer as learning a large visual vocabulary, and propose an alternative way, namely Deep Collaborative Learning (DCL), to reduce the computational complexity.

General Classification Image Classification

Pricing variance swaps with stochastic volatility and stochastic interest rate under full correlation structure

no code implementations30 Oct 2016 Teh Raihana Nazirah Roslan, Wenjun Zhang, Jiling Cao

This paper considers the case of pricing discretely-sampled variance swaps under the class of equity-interest rate hybridization.

Improving a Credit Scoring Model by Incorporating Bank Statement Derived Features

no code implementations30 Oct 2016 Rory P. Bunker, Wenjun Zhang, M. Asif Naeem

In this paper, we investigate the extent to which features derived from bank statements provided by loan applicants, and which are not declared on an application form, can enhance a credit scoring model for a New Zealand lending company.

Form

Cannot find the paper you are looking for? You can Submit a new open access paper.