Search Results for author: Rong Xie

Found 23 papers, 9 papers with code

NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results

1 code implementation17 Apr 2025 Xin Li, Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Yabin Zhang, Ao-Xiang Zhang, Tianwu Zhi, Jianzhao Liu, Yang Li, Jingwen Xu, Yiting Liao, Yushen Zuo, Mingyang Wu, Renjie Li, Shengyun Zhong, Zhengzhong Tu, Yufan Liu, Xiangguang Chen, Zuowei Cao, Minhao Tang, Shan Liu, Kexin Zhang, Jingfen Xie, Yan Wang, Kai Chen, Shijie Zhao, Yunchen Zhang, Xiangkai Xu, Hong Gao, Ji Shi, Yiming Bao, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu, Ying Liang, Yiwen Wang, Xinning Chai, Yuxuan Zhang, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song, Wei Sun, Kang Fu, Linhan Cao, Dandan Zhu, Kaiwei Zhang, Yucheng Zhu, ZiCheng Zhang, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Zhi Jin, Jiawei Wu, Wei Wang, Wenjian Zhang, Yuhai Lan, Gaoxiong Yi, Hengyuan Na, Wang Luo, Di wu, MingYin Bai, Jiawang Du, Zilong Lu, Zhenyu Jiang, Hui Zeng, Ziguan Cui, Zongliang Gan, Guijin Tang, Xinglin Xie, Kehuan Song, Xiaoqiang Lu, Licheng Jiao, Fang Liu, Xu Liu, Puhua Chen, Ha Thu Nguyen, Katrien De Moor, Seyed Ali Amirshahi, Mohamed-Chaker Larabi, Qi Tang, Linfeng He, Zhiyong Gao, Zixuan Gao, Guohua Zhang, Zhiye Huang, Yi Deng, Qingmiao Jiang, Lu Chen, Yi Yang, Xi Liao, Nourine Mohammed Nadir, YuXuan Jiang, Qiang Zhu, Siyue Teng, Fan Zhang, Shuyuan Zhu, Bing Zeng, David Bull, Meiqin Liu, Chao Yao, Yao Zhao

This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement.

Form Image Super-Resolution +3

Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution

1 code implementation14 Apr 2025 Yiwen Wang, Ying Liang, Yuxuan Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song

Due to the disparity between real-world degradations in user-generated content(UGC) images and synthetic degradations, traditional super-resolution methods struggle to generalize effectively, necessitating a more robust approach to model real-world distortions.

Image Super-Resolution

Face De-identification: State-of-the-art Methods and Comparative Studies

no code implementations15 Nov 2024 Jingyi Cao, Xiangyi Chen, Bo Liu, Ming Ding, Rong Xie, Li Song, Zhu Li, Wenjun Zhang

The widespread use of image acquisition technologies, along with advances in facial recognition, has raised serious privacy concerns.

De-identification Survey

PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation

no code implementations4 Sep 2024 Jun Ling, Yiwen Wang, Han Xue, Rong Xie, Li Song

First, we propose to generate poses from both audio and text prompts, where the audio offers short-term variations and rhythm correspondence of the head movements and the text prompts describe the long-term semantics of head motions.

Pose Prediction Rhythm +1

S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points

1 code implementation23 Aug 2024 Bing He, Yunuo Chen, Guo Lu, Qi Wang, Qunshan Gu, Rong Xie, Li Song, Wenjun Zhang

To address these challenges, we introduce a novel approach for streaming 4D real-world reconstruction utilizing discrete 3D control points.

3D Reconstruction 4D reconstruction

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration

no code implementations4 Jul 2024 Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Zhengxue Cheng, Rong Xie, Li Song, Wenjun Zhang

Image restoration is a classic low-level problem aimed at recovering high-quality images from low-quality images with various degradations such as blur, noise, rain, haze, etc.

Decoder Image Restoration +1

MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration

no code implementations4 Jul 2024 Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Rong Xie, Li Song, Wenjun Zhang

In this work, we delve into the potential of utilizing pre-trained stable diffusion for image restoration and propose MRIR, a diffusion-based restoration method with multimodal insights.

Denoising Image Restoration +3

Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior

no code implementations25 Apr 2024 Han Wang, Xinning Chai, Yiwen Wang, Yuhong Zhang, Rong Xie, Li Song

Existing automatic colorization methods often fail to generate satisfactory results due to incorrect semantic colors and unsaturated colors.

Colorization Decoder +1

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views

no code implementations4 Mar 2024 Shuai Guo, Qiuwen Wang, Yijie Gao, Rong Xie, Li Song

A point cloud is constructed for each input view, characterized within the voxel grid using matrices and vectors.

Autonomous Driving NeRF +1

Disentangled Clothed Avatar Generation from Text Descriptions

no code implementations8 Dec 2023 Jionghao Wang, YuAn Liu, Zhiyang Dou, Zhengming Yu, Yongqing Liang, Cheng Lin, Xin Li, Wenping Wang, Rong Xie, Li Song

In this paper, we introduce a novel text-to-avatar generation method that separately generates the human body and the clothes and allows high-quality animation on the generated avatar.

Virtual Try-on

Implicit-explicit Integrated Representations for Multi-view Video Compression

1 code implementation29 Nov 2023 Chen Zhu, Guo Lu, Bing He, Rong Xie, Li Song

To further enhance the reconstruction quality from the INR codec, we leverage the high-quality reconstructed frames from the explicit codec to achieve inter-view compensation.

Video Compression

360-Degree Panorama Generation from Few Unregistered NFoV Images

1 code implementation28 Aug 2023 Jionghao Wang, Ziyu Chen, Jun Ling, Rong Xie, Li Song

360$^\circ$ panoramas are extensively utilized as environmental light sources in computer graphics.

Learning Dense UV Completion for Human Mesh Recovery

no code implementations20 Jul 2023 Yanjun Wang, Qingping Sun, Wenjia Wang, Jun Ling, Zhongang Cai, Rong Xie, Li Song

Our method utilizes a dense correspondence map to separate visible human features and completes human features on a structured UV map dense human with an attention-based feature completion module.

Human Mesh Recovery

Boosting Video Object Segmentation via Space-time Correspondence Learning

1 code implementation CVPR 2023 Yurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang

Current top-leading solutions for video object segmentation (VOS) typically follow a matching-based regime: for each query frame, the segmentation mask is inferred according to its correspondence to previously processed and the first annotated frames.

Object Segmentation +3

Divide and Conquer: a Two-Step Method for High Quality Face De-identification with Model Explainability

no code implementations ICCV 2023 Yunqian Wen, Bo Liu, Jingyi Cao, Rong Xie, Li Song

To address these issues, we propose IDeudemon, which employs a "divide and conquer" strategy to protect identity and preserve utility step by step while maintaining good explainability.

De-identification NeRF

A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution

no code implementations15 Oct 2022 Hengsheng Zhang, Xueyi Zou, Jiaming Guo, Youliang Yan, Rong Xie, Li Song

In this paper, considering the characteristics of compressed videos, we propose a Codec Information Assisted Framework (CIAF) to boost and accelerate recurrent VSR models for compressed videos.

Motion Estimation Optical Flow Estimation +1

PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

1 code implementation6 Sep 2022 Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song

The temporal information is introduced by the temporal feature aggregation model (TFAM), by conducting an attention mechanism between the context frames and the target frame (i. e., the frame to be detected).

object-detection Video Object Detection

Generative Compression for Face Video: A Hybrid Scheme

no code implementations21 Apr 2022 Anni Tang, Yan Huang, Jun Ling, ZhiYu Zhang, Yiwei Zhang, Rong Xie, Li Song

As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality.

Region-aware Adaptive Instance Normalization for Image Harmonization

1 code implementation CVPR 2021 Jun Ling, Han Xue, Li Song, Rong Xie, Xiao Gu

To ensure the visual style consistency between the foreground and the background, in this paper, we treat image harmonization as a style transfer problem.

Image Harmonization Style Transfer

IdentityDP: Differential Private Identification Protection for Face Images

no code implementations2 Mar 2021 Yunqian Wen, Li Song, Bo Liu, Ming Ding, Rong Xie

We propose IdentityDP, a face anonymization framework that combines a data-driven deep neural network with a differential privacy (DP) mechanism.

De-identification Disentanglement +2

Personalized and Invertible Face De-Identification by Disentangled Identity Information Manipulation

no code implementations ICCV 2021 Jingyi Cao, Bo Liu, Yunqian Wen, Rong Xie, Li Song

The popularization of intelligent devices including smartphones and surveillance cameras results in more serious privacy issues.

De-identification

Toward Fine-grained Facial Expression Manipulation

1 code implementation ECCV 2020 Jun Ling, Han Xue, Li Song, Shuhui Yang, Rong Xie, Xiao Gu

Previous methods edit an input image under the guidance of a discrete emotion label or absolute condition (e. g., facial action units) to possess the desired expression.

Facial Expression Translation Image-to-Image Translation

Learning an Inverse Tone Mapping Network with a Generative Adversarial Regularizer

no code implementations20 Apr 2018 Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, Wenjun Zhang

Transferring a low-dynamic-range (LDR) image to a high-dynamic-range (HDR) image, which is the so-called inverse tone mapping (iTM), is an important imaging technique to improve visual effects of imaging devices.

Tone Mapping

Cannot find the paper you are looking for? You can Submit a new open access paper.