1 code implementation • 17 Apr 2025 • Xin Li, Kun Yuan, Bingchen Li, Fengbin Guan, Yizhen Shao, Zihao Yu, Xijun Wang, Yiting Lu, Wei Luo, Suhang Yao, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Yabin Zhang, Ao-Xiang Zhang, Tianwu Zhi, Jianzhao Liu, Yang Li, Jingwen Xu, Yiting Liao, Yushen Zuo, Mingyang Wu, Renjie Li, Shengyun Zhong, Zhengzhong Tu, Yufan Liu, Xiangguang Chen, Zuowei Cao, Minhao Tang, Shan Liu, Kexin Zhang, Jingfen Xie, Yan Wang, Kai Chen, Shijie Zhao, Yunchen Zhang, Xiangkai Xu, Hong Gao, Ji Shi, Yiming Bao, Xiugang Dong, Xiangsheng Zhou, Yaofeng Tu, Ying Liang, Yiwen Wang, Xinning Chai, Yuxuan Zhang, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song, Wei Sun, Kang Fu, Linhan Cao, Dandan Zhu, Kaiwei Zhang, Yucheng Zhu, ZiCheng Zhang, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Zhi Jin, Jiawei Wu, Wei Wang, Wenjian Zhang, Yuhai Lan, Gaoxiong Yi, Hengyuan Na, Wang Luo, Di wu, MingYin Bai, Jiawang Du, Zilong Lu, Zhenyu Jiang, Hui Zeng, Ziguan Cui, Zongliang Gan, Guijin Tang, Xinglin Xie, Kehuan Song, Xiaoqiang Lu, Licheng Jiao, Fang Liu, Xu Liu, Puhua Chen, Ha Thu Nguyen, Katrien De Moor, Seyed Ali Amirshahi, Mohamed-Chaker Larabi, Qi Tang, Linfeng He, Zhiyong Gao, Zixuan Gao, Guohua Zhang, Zhiye Huang, Yi Deng, Qingmiao Jiang, Lu Chen, Yi Yang, Xi Liao, Nourine Mohammed Nadir, YuXuan Jiang, Qiang Zhu, Siyue Teng, Fan Zhang, Shuyuan Zhu, Bing Zeng, David Bull, Meiqin Liu, Chao Yao, Yao Zhao
This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement.
1 code implementation • 14 Apr 2025 • Yiwen Wang, Ying Liang, Yuxuan Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Rong Xie, Li Song
Due to the disparity between real-world degradations in user-generated content(UGC) images and synthetic degradations, traditional super-resolution methods struggle to generalize effectively, necessitating a more robust approach to model real-world distortions.
no code implementations • 15 Nov 2024 • Jingyi Cao, Xiangyi Chen, Bo Liu, Ming Ding, Rong Xie, Li Song, Zhu Li, Wenjun Zhang
The widespread use of image acquisition technologies, along with advances in facial recognition, has raised serious privacy concerns.
no code implementations • 4 Sep 2024 • Jun Ling, Yiwen Wang, Han Xue, Rong Xie, Li Song
First, we propose to generate poses from both audio and text prompts, where the audio offers short-term variations and rhythm correspondence of the head movements and the text prompts describe the long-term semantics of head motions.
1 code implementation • 23 Aug 2024 • Bing He, Yunuo Chen, Guo Lu, Qi Wang, Qunshan Gu, Rong Xie, Li Song, Wenjun Zhang
To address these challenges, we introduce a novel approach for streaming 4D real-world reconstruction utilizing discrete 3D control points.
no code implementations • 4 Jul 2024 • Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Zhengxue Cheng, Rong Xie, Li Song, Wenjun Zhang
Image restoration is a classic low-level problem aimed at recovering high-quality images from low-quality images with various degradations such as blur, noise, rain, haze, etc.
no code implementations • 4 Jul 2024 • Yuhong Zhang, Hengsheng Zhang, Xinning Chai, Rong Xie, Li Song, Wenjun Zhang
In this work, we delve into the potential of utilizing pre-trained stable diffusion for image restoration and propose MRIR, a diffusion-based restoration method with multimodal insights.
no code implementations • 25 Apr 2024 • Han Wang, Xinning Chai, Yiwen Wang, Yuhong Zhang, Rong Xie, Li Song
Existing automatic colorization methods often fail to generate satisfactory results due to incorrect semantic colors and unsaturated colors.
no code implementations • 4 Mar 2024 • Shuai Guo, Qiuwen Wang, Yijie Gao, Rong Xie, Li Song
A point cloud is constructed for each input view, characterized within the voxel grid using matrices and vectors.
no code implementations • 8 Dec 2023 • Jionghao Wang, YuAn Liu, Zhiyang Dou, Zhengming Yu, Yongqing Liang, Cheng Lin, Xin Li, Wenping Wang, Rong Xie, Li Song
In this paper, we introduce a novel text-to-avatar generation method that separately generates the human body and the clothes and allows high-quality animation on the generated avatar.
1 code implementation • 29 Nov 2023 • Chen Zhu, Guo Lu, Bing He, Rong Xie, Li Song
To further enhance the reconstruction quality from the INR codec, we leverage the high-quality reconstructed frames from the explicit codec to achieve inter-view compensation.
1 code implementation • 28 Aug 2023 • Jionghao Wang, Ziyu Chen, Jun Ling, Rong Xie, Li Song
360$^\circ$ panoramas are extensively utilized as environmental light sources in computer graphics.
no code implementations • 20 Jul 2023 • Yanjun Wang, Qingping Sun, Wenjia Wang, Jun Ling, Zhongang Cai, Rong Xie, Li Song
Our method utilizes a dense correspondence map to separate visible human features and completes human features on a structured UV map dense human with an attention-based feature completion module.
1 code implementation • CVPR 2023 • Yurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang
Current top-leading solutions for video object segmentation (VOS) typically follow a matching-based regime: for each query frame, the segmentation mask is inferred according to its correspondence to previously processed and the first annotated frames.
no code implementations • ICCV 2023 • Yunqian Wen, Bo Liu, Jingyi Cao, Rong Xie, Li Song
To address these issues, we propose IDeudemon, which employs a "divide and conquer" strategy to protect identity and preserve utility step by step while maintaining good explainability.
no code implementations • 15 Oct 2022 • Hengsheng Zhang, Xueyi Zou, Jiaming Guo, Youliang Yan, Rong Xie, Li Song
In this paper, considering the characteristics of compressed videos, we propose a Codec Information Assisted Framework (CIAF) to boost and accelerate recurrent VSR models for compressed videos.
1 code implementation • 6 Sep 2022 • Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song
The temporal information is introduced by the temporal feature aggregation model (TFAM), by conducting an attention mechanism between the context frames and the target frame (i. e., the frame to be detected).
Ranked #7 on
Video Object Detection
on ImageNet VID
no code implementations • 21 Apr 2022 • Anni Tang, Yan Huang, Jun Ling, ZhiYu Zhang, Yiwei Zhang, Rong Xie, Li Song
As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality.
1 code implementation • CVPR 2021 • Jun Ling, Han Xue, Li Song, Rong Xie, Xiao Gu
To ensure the visual style consistency between the foreground and the background, in this paper, we treat image harmonization as a style transfer problem.
Ranked #4 on
Image Harmonization
on HAdobe5k(1024$\times$1024)
no code implementations • 2 Mar 2021 • Yunqian Wen, Li Song, Bo Liu, Ming Ding, Rong Xie
We propose IdentityDP, a face anonymization framework that combines a data-driven deep neural network with a differential privacy (DP) mechanism.
no code implementations • ICCV 2021 • Jingyi Cao, Bo Liu, Yunqian Wen, Rong Xie, Li Song
The popularization of intelligent devices including smartphones and surveillance cameras results in more serious privacy issues.
1 code implementation • ECCV 2020 • Jun Ling, Han Xue, Li Song, Shuhui Yang, Rong Xie, Xiao Gu
Previous methods edit an input image under the guidance of a discrete emotion label or absolute condition (e. g., facial action units) to possess the desired expression.
no code implementations • 20 Apr 2018 • Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, Wenjun Zhang
Transferring a low-dynamic-range (LDR) image to a high-dynamic-range (HDR) image, which is the so-called inverse tone mapping (iTM), is an important imaging technique to improve visual effects of imaging devices.