Search Results for author: Mai Xu

Found 46 papers, 33 papers with code

MarsQE: Semantic-Informed Quality Enhancement for Compressed Martian Image

no code implementations15 Apr 2024 Chengfeng Liu, Mai Xu, Qunliang Xing, Xin Zou

Lossy image compression is essential for Mars exploration missions, due to the limited bandwidth between Earth and Mars.

Image Compression

Enhancing Quality of Compressed Images by Mitigating Enhancement Bias Towards Compression Domain

no code implementations27 Feb 2024 Qunliang Xing, Mai Xu, Shengxi Li, Xin Deng, Meisong Zheng, Huaida Liu, Ying Chen

However, these methods exhibit a pervasive enhancement bias towards the compression domain, inadvertently regarding it as more realistic than the raw domain.

Blind Multimodal Quality Assessment of Low-light Images

no code implementations18 Mar 2023 Miaohui Wang, Zhuowei Xu, Mai Xu, Weisi Lin

Qualitative and quantitative results on Dark-4K show that BMQA achieves superior performance to existing BIQA approaches as long as a pre-trained model is provided to generate text description.

4k Autonomous Driving +1

Neural Characteristic Function Learning for Conditional Image Generation

1 code implementation ICCV 2023 Shengxi Li, Jialu Zhang, Yifei Li, Mai Xu, Xin Deng, Li Li

The emergence of conditional generative adversarial networks (cGANs) has revolutionised the way we approach and control the generation, by means of adversarially learning joint distributions of data and auxiliary information.

Conditional Image Generation Generative Adversarial Network

PIRNet: Privacy-Preserving Image Restoration Network via Wavelet Lifting

no code implementations ICCV 2023 Xin Deng, Chao GAO, Mai Xu

In this paper, we propose a novel method namely PIRNet, which operates privacy-preserving image restoration in the steganographic domain.

Deblurring Image Denoising +3

DINN360: Deformable Invertible Neural Network for Latitude-Aware 360deg Image Rescaling

1 code implementation CVPR 2023 Yichen Guo, Mai Xu, Lai Jiang, Leonid Sigal, Yunjin Chen

To alleviate this issue, we propose the first attempt at 360deg image rescaling, which refers to downscaling a 360deg image to a visually valid low-resolution (LR) counterpart and then upscaling to a high-resolution (HR) 360deg image given the LR variant.

valid

DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus

1 code implementation20 Nov 2022 Qunliang Xing, Mai Xu, Xin Deng, Yichen Guo

Image defocus is inherent in the physics of image formation caused by the optical aberration of lenses, providing plentiful information on image quality.

Progressive Training of A Two-Stage Framework for Video Restoration

2 code implementations21 Apr 2022 Meisong Zheng, Qunliang Xing, Minglang Qiao, Mai Xu, Lai Jiang, Huaida Liu, Ying Chen

As a widely studied task, video restoration aims to enhance the quality of the videos with multiple potential degradations, such as noises, blurs and compression artifacts.

Transfer Learning Video Restoration +2

Does Text Attract Attention on E-Commerce Images: A Novel Saliency Prediction Dataset and Method

1 code implementation CVPR 2022 Lai Jiang, Yifei Li, Shengxi Li, Mai Xu, Se Lei, Yichen Guo, Bo Huang

E-commerce images are playing a central role in attracting people's attention when retailing and shopping online, and an accurate attention prediction is of significant importance for both customers and retailers, where its research is yet to start.

Multi-Task Learning Saliency Prediction +1

Blind VQA on 360° Video via Progressively Learning from Pixels, Frames and Video

1 code implementation18 Nov 2021 Li Yang, Mai Xu, Shengxi Li, Yichen Guo, Zulin Wang

When assessing the quality of 360{\textdegree} video, human tends to perceive its quality degradation from the viewport-based spatial distortion of each spherical frame to motion artifact across adjacent frames, ending with the video-level quality score, i. e., a progressive quality assessment paradigm.

Visual Question Answering (VQA)

Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos

1 code implementation5 Nov 2021 Minglang Qiao, Yufan Liu, Mai Xu, Xin Deng, Bing Li, Weiming Hu, Ali Borji

In this paper, we propose a multitask learning method for visual-audio saliency prediction and sound source localization on multi-face video by leveraging visual, audio and face information.

Saliency Prediction

Domain Adaptation for Underwater Image Enhancement

1 code implementation22 Aug 2021 Zhengyong Wang, Liquan Shen, Mei Yu, Kun Wang, Yufei Lin, Mai Xu

However, these methods ignore the significant domain gap between the synthetic and real data (i. e., interdomain gap), and thus the models trained on synthetic data often fail to generalize well to real underwater scenarios.

Domain Adaptation Image Enhancement

Patch-Wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video

1 code implementation journal 2021 Qing Ding, Liquan Shen, Liangwei Yu, Hao Yang, Mai Xu

To overcome these limitations, we propose a patch-wise spatial-temporal quality enhancement network which firstly extracts spatial and temporal features, then recalibrates and fuses the obtained spatial and temporal features.

Quantization Video Enhancement

Saliency-Guided Image Translation

no code implementations CVPR 2021 Lai Jiang, Mai Xu, Xiaofei Wang, Leonid Sigal

In this paper, we propose a novel task for saliency-guided image translation, with the goal of image-to-image translation conditioned on the user specified saliency map.

Generative Adversarial Network Image-to-Image Translation +1

Deep Homography for Efficient Stereo Image Compression

1 code implementation CVPR 2021 Xin Deng, Wenzhe Yang, Ren Yang, Mai Xu, Enpeng Liu, Qianhan Feng, Radu Timofte

To fully explore the mutual information across two stereo images, we use a deep regression model to estimate the homography matrix, i. e., H matrix.

Image Compression

LAU-Net: Latitude Adaptive Upscaling Network for Omnidirectional Image Super-Resolution

no code implementations CVPR 2021 Xin Deng, Hao Wang, Mai Xu, Yichen Guo, Yuhang Song, Li Yang

In addition, we propose a deep reinforcement learning scheme with a latitude adaptive reward, in order to automatically select optimal upscaling factors for different latitude bands.

Image Super-Resolution

Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model

1 code implementation ECCV 2020 Yufan Liu, Minglang Qiao, Mai Xu, Bing Li, Weiming Hu, Ali Borji

Inspired by the findings of our investigation, we propose a novel multi-modal video saliency model consisting of three branches: visual, audio and face.

Saliency Prediction

Spatial Attention-based Non-reference Perceptual Quality Prediction Network for Omnidirectional Images

1 code implementation10 Mar 2021 Li Yang, Mai Xu, Deng Xin, Bo Feng

To alleviate this issue, we propose a spatial attention-based perceptual quality prediction network for non-reference quality assessment on ODIs (SAP-net).

Image Quality Assessment

Early Exit or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images

1 code implementation ECCV 2020 Qunliang Xing, Mai Xu, Tianyi Li, Zhenyu Guan

Recently, extensive approaches have been proposed to reduce image compression artifacts at the decoder side; however, they require a series of architecture-identical models to process images with different quality, which are inefficient and resource-consuming.

Image Enhancement Image Restoration

DeepQTMT: A Deep Learning Approach for Fast QTMT-based CU Partition of Intra-mode VVC

1 code implementation23 Jun 2020 Tianyi Li, Mai Xu, Runzhi Tang, Ying Chen, Qunliang Xing

In VVC, the quad-tree plus multi-type tree (QTMT) structure of coding unit (CU) partition accounts for over 97% of the encoding time, due to the brute-force search for recursive rate-distortion (RD) optimization.

Removing Rain in Videos: A Large-scale Database and A Two-stream ConvLSTM Approach

no code implementations6 Jun 2019 Tie Liu, Mai Xu, Zulin Wang

In this paper, we establish a large-scale video database for rain removal (LasVR), which consists of 316 rain videos.

Rain Removal

Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

1 code implementation12 May 2019 Yuhang Song, Jianyi Wang, Thomas Lukasiewicz, Zhenghua Xu, Shangtong Zhang, Andrzej Wojcicki, Mai Xu

Intrinsic rewards were introduced to simulate how human intelligence works; they are usually evaluated by intrinsically-motivated play, i. e., playing games without extrinsic rewards but evaluated with extrinsic rewards.

Saliency Prediction on Omnidirectional Images with Generative Adversarial Imitation Learning

no code implementations15 Apr 2019 Mai Xu, Li Yang, Xiaoming Tao, Yiping Duan, Zulin Wang

According to these findings, our SalGAIL approach applies deep reinforcement learning (DRL) to predict the head fixations of one subject, in which GAIL learns the reward of DRL, rather than the traditional human-designed reward.

Imitation Learning Saliency Prediction

Attention Based Glaucoma Detection: A Large-scale Database and CNN Model

1 code implementation CVPR 2019 Liu Li, Mai Xu, Xiaofei Wang, Lai Jiang, Hanruo Liu

The attention maps of the ophthalmologists are also collected in LAG database through a simulated eye-tracking experiment.

Quality-Gated Convolutional LSTM for Enhancing Compressed Video

1 code implementation11 Mar 2019 Ren Yang, Xiaoyan Sun, Mai Xu, Wen-Jun Zeng

The past decade has witnessed great success in applying deep learning to enhance the quality of compressed video.

MFQE 2.0: A New Approach for Multi-frame Quality Enhancement on Compressed Video

1 code implementation26 Feb 2019 Qunliang Xing, Zhenyu Guan, Mai Xu, Ren Yang, Tie Liu, Zulin Wang

Finally, experiments validate the effectiveness and generalization ability of our MFQE approach in advancing the state-of-the-art quality enhancement of compressed video.

Video Enhancement Video Restoration

Diversity-Driven Extensible Hierarchical Reinforcement Learning

1 code implementation10 Nov 2018 Yuhang Song, Jianyi Wang, Thomas Lukasiewicz, Zhenghua Xu, Mai Xu

However, HRL with multiple levels is usually needed in many real-world scenarios, whose ultimate goals are highly abstract, while their actions are very primitive.

Hierarchical Reinforcement Learning reinforcement-learning +1

Understanding and Predicting the Memorability of Outdoor Natural Scenes

2 code implementations9 Oct 2018 Jiaxin Lu, Mai Xu, Ren Yang, Zulin Wang

In particular, we find that the high-level feature of scene category is rather correlated with outdoor natural scene memorability, and the deep features learnt by deep neural network (DNN) are also effective in predicting the memorability scores.

DeepVS: A Deep Learning Based Video Saliency Prediction Approach

1 code implementation ECCV 2018 Lai Jiang, Mai Xu, Tie Liu, Minglang Qiao, Zulin Wang

Hence, an object-to-motion convolutional neural network (OM-CNN) is developed to predict the intra-frame saliency for DeepVS, which is composed of the objectness and motion subnets.

Saliency Prediction Video Saliency Detection +1

What Makes Natural Scene Memorable?

no code implementations27 Aug 2018 Jiaxin Lu, Mai Xu, Ren Yang, Zulin Wang

Recent studies on image memorability have shed light on the visual features that make generic images, object images or face photographs memorable.

Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Dataset and a Deep Learning Model

1 code implementation29 Jul 2018 Chen Li, Mai Xu, Xinzhe Du, Zulin Wang

To fill in the gap between subjective quality and human behavior, this paper proposes a large-scale visual quality assessment (VQA) dataset of omnidirectional video, called VQA-OV, which collects 60 reference sequences and 540 impaired sequences.

Visual Question Answering (VQA)

Multi-Frame Quality Enhancement for Compressed Video

1 code implementation CVPR 2018 Ren Yang, Mai Xu, Zulin Wang, Tianyi Li

In this paper, we investigate that heavy quality fluctuation exists across compressed video frames, and thus low quality frames can be enhanced using the neighboring high quality frames, seen as Multi-Frame Quality Enhancement (MFQE).

Motion Compensation Video Enhancement

Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach

1 code implementation30 Oct 2017 Yuhang Song, Mai Xu, Jianyi Wang, Minglang Qiao, Liangyu Huo, Zulin Wang

Finally, the experiments validate that our approach is effective in both offline and online prediction of HM positions for panoramic video, and that the learned offline-DHP model can improve the performance of online-DHP.

Position reinforcement-learning +1

Enhancing Quality for HEVC Compressed Videos

no code implementations20 Sep 2017 Ren Yang, Mai Xu, Tie Liu, Zulin Wang, Zhenyu Guan

Our experimental results validate that our QE-CNN method is effective in enhancing quality for both I and P frames of HEVC videos.

Multimedia

Reducing Complexity of HEVC: A Deep Learning Approach

1 code implementation19 Sep 2017 Mai Xu, Tianyi Li, Zulin Wang, Xin Deng, Ren Yang, Zhenyu Guan

Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra- and inter-modes, which is based on convolutional neural network (CNN) and long- and short-term memory (LSTM) network.

Predicting Video Saliency with Object-to-Motion CNN and Two-layer Convolutional LSTM

1 code implementation19 Sep 2017 Lai Jiang, Mai Xu, Zulin Wang

We further find from our database that there exists a temporal correlation of human attention with a smooth saliency transition across video frames.

Saliency Prediction Video Saliency Prediction

Predicting Salient Face in Multiple-Face Videos

1 code implementation CVPR 2017 Yufan Liu, Songyang Zhang, Mai Xu, Xuming He

On the other hand, we find that the attention of different subjects consistently focuses on a single face in each frame of videos involving multiple faces.

Saliency Prediction

Learning to Predict Saliency on Face Images

no code implementations ICCV 2015 Mai Xu, Yun Ren, Zulin Wang

For modeling attention on faces and facial features, the proposed method learns the Gaussian mixture model (GMM) distribution from the fixations of eye tracking data as the top-down features for saliency detection of face images.

Saliency Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.