Search Results for author: Xuan Dong

Found 14 papers, 4 papers with code

Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration

no code implementations17 Dec 2024 Xinlong Cheng, Tiantian Cao, Guoan Cheng, BangXuan Huang, Xinghan Tian, Ye Wang, Xiaoyu He, Weixin Li, Tianfan Xue, Xuan Dong

In this work, we address the limitations of denoising diffusion models (DDMs) in image restoration tasks, particularly the shape and color distortions that can compromise image quality.

Denoising Image Generation +1

PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models

no code implementations12 Dec 2024 Chenyu Yang, Xuan Dong, Xizhou Zhu, Weijie Su, Jiahao Wang, Hao Tian, Zhe Chen, Wenhai Wang, Lewei Lu, Jifeng Dai

To this end, we extend each image into a "static" video and introduce a unified token compression strategy called Progressive Visual Token Compression (PVC), where the tokens of each frame are progressively encoded and adaptively compressed to supplement the information not extracted from previous frames.

Video Understanding

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

1 code implementation11 Jun 2024 Chenyu Yang, Xizhou Zhu, Jinguo Zhu, Weijie Su, Junjie Wang, Xuan Dong, Wenhai Wang, Lewei Lu, Bin Li, Jie zhou, Yu Qiao, Jifeng Dai

Recently, vision model pre-training has evolved from relying on manually annotated datasets to leveraging large-scale, web-crawled image-text data.

Contrastive Learning

ReWiTe: Realistic Wide-angle and Telephoto Dual Camera Fusion Dataset via Beam Splitter Camera Rig

no code implementations16 Apr 2024 Chunli Peng, Xuan Dong, Tiantian Cao, Zhengqing Li, Kun Dong, Weixin Li

The fusion of images from dual camera systems featuring a wide-angle and a telephoto camera has become a hotspot problem recently.

View Transition based Dual Camera Image Fusion

no code implementations18 Dec 2023 Tiantian Cao, Xuan Dong, Chunli Peng, Zhengqing Li, Xinyu Guo, Weixin Li

Our insight is to minimize the occlusion area and thus maximize the use of pixels from $\bf{T}$ images.

CUR Transformer: A Convolutional Unbiased Regional Transformer for Image Denoising

1 code implementation journal 2023 Kang Xu, Weixin Li, Xia Wang, Xiaoyan Hu, Ke Yan, Xiaojie Wang, Xuan Dong

Based on the prior that, for each pixel, its similar pixels are usually spatially close, our insights are that (1) we partition the image into non-overlapped windows and perform regional self-attention to reduce the search range of each pixel, and (2) we encourage pixels across different windows to communicate with each other.

Image Denoising Jpeg Compression Artifact Reduction +1

MIEHDR CNN: Main Image Enhancement based Ghost-Free High Dynamic Range Imaging using Dual-Lens Systems

no code implementations AAAI Technical Track on Computer Vision I 2021 Xuan Dong, Xiaoyan Hu, Weixin Li, Xiaojie Wang;Yunhong Wang

In most of the related HDR imaging methods, the problem is usually solved by Multiple Images Merging, i. e. the final HDR image is fused from pixels of all the input LDR images.

Denoising Image Enhancement

A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals

no code implementations31 Jul 2020 Xuan Dong, Donald S. Williamson

The real-world capabilities of objective speech quality measures are limited since current measures (1) are developed from simulated data that does not adequately model real environments; or they (2) predict objective scores that are not always strongly correlated with subjective ratings.

Cycle-CNN for Colorization towards Real Monochrome-Color Camera Systems

1 code implementation AAAI Technical Track: Vision 2020 Xuan Dong, Weixin Li, Xiaojie Wang, Yunhong Wang

We present a new CNN model, named cycle CNN, which can directly use the real data from monochrome-color camera systems for training.

Colorization

Chain of Reasoning for Visual Question Answering

no code implementations NeurIPS 2018 Chenfei Wu, Jinlai Liu, Xiaojie Wang, Xuan Dong

A chain of reasoning (CoR) is constructed for supporting multi-step and dynamic reasoning on changed relations and objects.

Object Question Answering +3

Ground-truth dataset and baseline evaluations for image base-detail separation algorithms

no code implementations21 Nov 2015 Xuan Dong, Boyan Bonev, Weixin Li, Weichao Qiu, Xianjie Chen, Alan Yuille

Base-detail separation is a fundamental computer vision problem consisting of modeling a smooth base layer with the coarse structures, and a detail layer containing the texture-like structures.

Fidelity-Naturalness Evaluation of Single Image Super Resolution

no code implementations21 Nov 2015 Xuan Dong, Yu Zhu, Weixin Li, Lingxi Xie, Alex Wong, Alan Yuille

In this paper, we proposed to use both fidelity (the difference with original images) and naturalness (human visual perception of super resolved images) for evaluation.

Image Quality Assessment Image Super-Resolution

Cannot find the paper you are looking for? You can Submit a new open access paper.