Search Results for author: Xiao Wu

Found 29 papers, 8 papers with code

Neural Shrödinger Bridge Matching for Pansharpening

no code implementations17 Apr 2024 ZiHan Cao, Xiao Wu, Liang-Jian Deng

In this paper, we identify shortcomings in directly applying DPMs to the task of pansharpening as an inverse problem: 1) initiating sampling directly from Gaussian noise neglects the low-resolution multispectral image (LRMS) as a prior; 2) low sampling efficiency often necessitates a higher number of sampling steps.

Pansharpening

SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening

no code implementations17 Apr 2024 Yu Zhong, Xiao Wu, Liang-Jian Deng, ZiHan Cao

Pansharpening is a significant image fusion technique that merges the spatial content and spectral characteristics of remote sensing images to generate high-resolution multispectral images.

Denoising Image Generation +1

A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion

no code implementations14 Apr 2024 ZiHan Cao, Xiao Wu, Liang-Jian Deng, Yu Zhong

However, due to the nature of images different from casual language sequences, the limited state capacity of Mamba weakens its ability to model image information.

Pansharpening

Content-Adaptive Non-Local Convolution for Remote Sensing Pansharpening

2 code implementations11 Apr 2024 Yule Duan, Xiao Wu, Haoyu Deng, Liang-Jian Deng

In this paper, we introduce a so-called content-adaptive non-local convolution (CANConv), a novel method tailored for remote sensing image pansharpening.

Pansharpening

CMT: Cross Modulation Transformer with Hybrid Loss for Pansharpening

no code implementations1 Apr 2024 Wen-Jie Shu, Hong-Xia Dou, Rui Wen, Xiao Wu, Liang-Jian Deng

In response, we present the Cross Modulation Transformer (CMT), a pioneering method that modifies the attention mechanism.

Pansharpening

An FPGA-Based Accelerator Enabling Efficient Support for CNNs with Arbitrary Kernel Sizes

no code implementations22 Feb 2024 Miaoxin Wang, Xiao Wu, Jun Lin, Zhongfeng Wang

Particularly, it demonstrates efficient support for large-kernel CNNs, achieving throughputs of 169. 68 GOPS and 244. 55 GOPS for RepLKNet-31 and PyConvResNet-50, respectively, both of which are implemented on hardware for the first time.

Computational Efficiency

Improving Anomaly Segmentation with Multi-Granularity Cross-Domain Alignment

no code implementations16 Aug 2023 Ji Zhang, Xiao Wu, Zhi-Qi Cheng, Qi He, Wei Li

Anomaly segmentation plays a pivotal role in identifying atypical objects in images, crucial for hazard detection in autonomous driving systems.

Autonomous Driving Contrastive Learning

DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion

no code implementations10 Apr 2023 ZiHan Cao, ShiQi Cao, Xiao Wu, JunMing Hou, Ran Ran, Liang-Jian Deng

Denosing diffusion model, as a generative model, has received a lot of attention in the field of image generation recently, thanks to its powerful generation capability.

Denoising Image-to-Image Translation +1

PPCR: Learning Pyramid Pixel Context Recalibration Module for Medical Image Classification

no code implementations3 Mar 2023 Xiaoqing Zhang, Zunjie Xiao, Xiao Wu, Jiansheng Fang, Junyong Shen, Yan Hu, Risa Higashita, Jiang Liu

Spatial attention mechanism has been widely incorporated into deep convolutional neural networks (CNNs) via long-range dependency capturing, significantly lifting the performance in computer vision, but it may perform poorly in medical imaging.

Decision Making Image Classification +1

U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion

1 code implementation13 Dec 2022 Siran Peng, Chenhao Guo, Xiao Wu, Liang-Jian Deng

The U2Net utilizes a spatial U-Net and a spectral U-Net to extract spatial details and spectral characteristics, which allows for the discriminative and hierarchical learning of features from diverse images.

Hyperspectral Image Super-Resolution Image Super-Resolution +1

SIT: A Bionic and Non-Linear Neuron for Spiking Neural Network

no code implementations30 Mar 2022 Cheng Jin, Rui-Jie Zhu, Xiao Wu, Liang-Jian Deng

Spiking Neural Networks (SNNs) have piqued researchers' interest because of their capacity to process temporal information and low power consumption.

Image Classification

Solve routing problems with a residual edge-graph attention neural network

1 code implementation6 May 2021 Kun Lei, Peng Guo, Yi Wang, Xiao Wu, Wenchao Zhao

In this paper, an end-to-end deep reinforcement learning framework is proposed to solve this type of combinatorial optimization problems.

Combinatorial Optimization Graph Attention +1

Dynamic Cross Feature Fusion for Remote Sensing Pansharpening

no code implementations ICCV 2021 Xiao Wu, Ting-Zhu Huang, Liang-Jian Deng, Tian-Jing Zhang

In order to enhance the relationships of inter-branches, dynamic cross feature transfers are embedded into multiple branches to obtain high-resolution representations.

Pansharpening

Improving the Learning of Multi-column Convolutional Neural Network for Crowd Counting

no code implementations17 Sep 2019 Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Jun-Yan He, Alexander Hauptmann

By minimizing the mutual information, each column is guided to learn features with different image scales.

Crowd Counting

Learning Spatial Awareness to Improve Crowd Counting

no code implementations ICCV 2019 Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander Hauptmann

Although the Maximum Excess over SubArrays (MESA) loss has been previously proposed to address the above issues by finding the rectangular subregion whose predicted density map has the maximum difference from the ground truth, it cannot be solved by gradient descent, thus can hardly be integrated into the deep learning framework.

Crowd Counting Weakly-supervised Learning

Adversarial Multimodal Network for Movie Question Answering

no code implementations24 Jun 2019 Zhaoquan Yuan, Siyuan Sun, Lixin Duan, Xiao Wu, Changsheng Xu

In AMN, as inspired by generative adversarial networks, we propose to learn multimodal feature representations by finding a more coherent subspace for video clips and the corresponding texts (e. g., subtitles and questions).

Question Answering Video Question Answering +1

Optimizing Interim Analysis Timing for Bayesian Adaptive Commensurate Designs

1 code implementation17 May 2019 Xiao Wu, Yi Xu, Bradley P. Carlin

In developing products for rare diseases, statistical challenges arise due to the limited number of patients available for participation in drug trials and other clinical research.

Applications Computation Methodology

Matching on Generalized Propensity Scores with Continuous Exposures

1 code implementation17 Dec 2018 Xiao Wu, Fabrizia Mealli, Marianthi-Anna Kioumourtzoglou, Francesca Dominici, Danielle Braun

We apply our proposed method to estimate the average causal exposure-response function between long-term PM$_{2. 5}$ exposure and all-cause mortality among 68. 5 million Medicare enrollees, 2000-2016.

Methodology Applications

Perceiving Physical Equation by Observing Visual Scenarios

no code implementations29 Nov 2018 Siyu Huang, Zhi-Qi Cheng, Xi Li, Xiao Wu, Zhongfei Zhang, Alexander Hauptmann

To tackle this challenge, we present a novel pipeline comprised of an Observer Engine and a Physicist Engine by respectively imitating the actions of an observer and a physicist in the real world.

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

2 code implementations CVPR 2017 Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua

For the video side, deep visual features are extracted from detected object regions in each frame, and further fed into a Long Short-Term Memory (LSTM) framework for sequence modeling, which captures the temporal dynamics in videos.

On the Selection of Anchors and Targets for Video Hyperlinking

no code implementations14 Apr 2018 Zhi-Qi Cheng, Hao Zhang, Xiao Wu, Chong-Wah Ngo

A principle way of hyperlinking can be carried out by picking centers of clusters as anchors and from there reach out to targets within or outside of clusters with consideration of neighborhood complexity.

Causal inference in the context of an error prone exposure: air pollution and mortality

1 code implementation2 Dec 2017 Xiao Wu, Danielle Braun, Marianthi-Anna Kioumourtzoglou, Christine Choirat, Qian Di, Francesca Dominici

We propose a new approach for estimating causal effects when the exposure is measured with error and confounding adjustment is performed via a generalized propensity score (GPS).

Methodology Applications

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search

no code implementations CVPR 2017 Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan

We introduce a new fashion search protocol where attribute manipulation is allowed within the interaction between users and search engines, e. g. manipulating the color attribute of the clothing from red to blue.

Attribute Representation Learning

Multi-View Image Generation from a Single-View

no code implementations17 Apr 2017 Bo Zhao, Xiao Wu, Zhi-Qi Cheng, Hao liu, Zequn Jie, Jiashi Feng

This paper addresses a challenging problem -- how to generate multi-view cloth images from only a single view input.

Image Generation Variational Inference

Diversified Visual Attention Networks for Fine-Grained Object Classification

no code implementations28 Jun 2016 Bo Zhao, Xiao Wu, Jiashi Feng, Qiang Peng, Shuicheng Yan

Fine-grained object classification is a challenging task due to the subtle inter-class difference and large intra-class variation.

Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.