Search Results for author: Zhenzhong Chen

Found 46 papers, 15 papers with code

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu

This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.

valid Video Quality Assessment +1

Paper
Code

Space-Time Video Super-resolution with Neural Operator

no code implementations • 9 Apr 2024 • Yuantong Zhang, Hanyou Zheng, Daiqin Yang, Zhenzhong Chen, Haichuan Ma, Wenpeng Ding

This paper addresses the task of space-time video super-resolution (ST-VSR).

Motion Compensation Motion Estimation +2

Paper
Add Code

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

no code implementations • 28 Mar 2024 • Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang

Autonomous driving progress relies on large-scale annotated datasets.

Autonomous Driving

Paper
Add Code

Transferable Learned Image Compression-Resistant Adversarial Perturbations

no code implementations • 6 Jan 2024 • Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen

Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.

Adversarial Attack Autonomous Driving +4

Paper
Add Code

Corner-to-Center Long-range Context Model for Efficient Learned Image Compression

no code implementations • 29 Nov 2023 • Yang Sui, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Bo Yuan, Zhenzhong Chen

To tackle this issue, we conduct an in-depth analysis of the performance degradation observed in existing parallel context models, focusing on two aspects: the Quantity and Quality of information utilized for context prediction and decoding.

Image Compression

Paper
Add Code

UnifiedSSR: A Unified Framework of Sequential Search and Recommendation

1 code implementation • 21 Oct 2023 • Jiayi Xie, Shang Liu, Gao Cong, Zhenzhong Chen

In this work, we propose a Unified framework of Sequential Search and Recommendation (UnifiedSSR) for joint learning of user behavior history in both search and recommendation scenarios.

Self-Supervised Learning

Paper
Code

Learning Many-to-Many Mapping for Unpaired Real-World Image Super-resolution and Downscaling

no code implementations • 8 Oct 2023 • Wanjie Sun, Zhenzhong Chen

However, the training of image degradation and SR models in this strategy are separate, ignoring the inherent mutual dependency between downscaling and its inverse upscaling process.

Image Super-Resolution

Paper
Add Code

JPEG Quantized Coefficient Recovery via DCT Domain Spatial-Frequential Transformer

no code implementations • 17 Aug 2023 • Mingyu Ouyang, Zhenzhong Chen

However, the current DCT domain methods typically suffer from limited effectiveness in handling a wide range of compression quality factors, or fall short in recovering sparse quantized coefficients and the components across different colorspace.

JPEG Artifact Removal Quantization

Paper
Add Code

Dynamic Kernel-Based Adaptive Spatial Aggregation for Learned Image Compression

no code implementations • 17 Aug 2023 • Huairui Wang, Nianxiang Fu, Zhenzhong Chen, Shan Liu

In this paper, we focus on extending spatial aggregation capability and propose a dynamic kernel-based transform coding.

Image Compression valid

Paper
Add Code

Improving Generalization of Image Captioning with Unsupervised Prompt Learning

no code implementations • 5 Aug 2023 • Hongchen Wei, Zhenzhong Chen

By exploring the variable and invariant features in the original images and attribute-transferred images, attribute consistency constrains the attribute change direction of both images and sentences to learn domain-specific knowledge.

Attribute Image Captioning +2

Paper
Add Code

Reconstruction Distortion of Learned Image Compression with Imperceptible Perturbations

no code implementations • 1 Jun 2023 • Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen

Learned Image Compression (LIC) has recently become the trending technique for image transmission due to its notable performance.

Image Compression Image Reconstruction

Paper
Add Code

Learning a Single Convolutional Layer Model for Low Light Image Enhancement

no code implementations • 23 May 2023 • Yuantong Zhang, Baoxin Teng, Daiqin Yang, Zhenzhong Chen, Haichuan Ma, Gang Li, Wenpeng Ding

Low-light image enhancement (LLIE) aims to improve the illuminance of images due to insufficient light exposure.

Low-Light Image Enhancement

Paper
Add Code

LaMD: Latent Motion Diffusion for Video Generation

no code implementations • 23 Apr 2023 • Yaosi Hu, Zhenzhong Chen, Chong Luo

We present a latent motion diffusion (LaMD) framework, which consists of a motion-decomposed video autoencoder and a diffusion-based motion generator, to implement this idea.

Video Generation Video Reconstruction

Paper
Add Code

Continuous Space-Time Video Super-Resolution Utilizing Long-Range Temporal Information

no code implementations • 26 Feb 2023 • Yuantong Zhang, Daiqin Yang, Zhenzhong Chen, Wenpeng Ding

To address these problems, we propose a continuous ST-VSR (C-STVSR) method that can convert the given video to any frame rate and spatial resolution.

Optical Flow Estimation Space-time Video Super-resolution +1

Paper
Add Code

Mutually-Regularized Dual Collaborative Variational Auto-encoder for Recommendation Systems

1 code implementation • 21 Nov 2022 • Yaochen Zhu, Zhenzhong Chen

However, since latent item variables are not modeled in UAE, it is difficult to utilize the widely available item content information when ratings are sparse.

Recommendation Systems

Paper
Code

Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

2 code implementations • 7 Nov 2022 • Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He

While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints.

Image Super-Resolution

104

Paper
Code

Hierarchical Transformer with Spatio-Temporal Context Aggregation for Next Point-of-Interest Recommendation

1 code implementation • 4 Sep 2022 • Jiayi Xie, Zhenzhong Chen

The stacking of encoders captures the latent hierarchical structure of the check-in sequence, which is used to predict the next visiting POI.

Paper
Code

Exploring Long- and Short-Range Temporal Information for Learned Video Compression

1 code implementation • 7 Aug 2022 • Huairui Wang, Zhenzhong Chen

Learned video compression methods have gained a variety of interest in the video coding community since they have matched or even exceeded the rate-distortion (RD) performance of traditional video codecs.

Motion Compensation Optical Flow Estimation +1

Paper
Code

Learning Human Cognitive Appraisal Through Reinforcement Memory Unit

no code implementations • 6 Aug 2022 • Yaosi Hu, Zhenzhong Chen

We conceptualize the memory-enhancing mechanism as Reinforcement Memory Unit (RMU) that contains an appraisal state together with two positive and negative reinforcement memories.

Video Quality Assessment

Paper
Add Code

Learning Knowledge Representation with Meta Knowledge Distillation for Single Image Super-Resolution

no code implementations • 18 Jul 2022 • Han Zhu, Zhenzhong Chen, Shan Liu

In addition, the KRNets are optimized in a meta-learning manner to ensure the knowledge transferring and the student learning are beneficial to improving the reconstructed quality of the student.

Image Super-Resolution Knowledge Distillation +1

Paper
Add Code

Learned Video Compression via Heterogeneous Deformable Compensation Network

no code implementations • 11 Jul 2022 • Huairui Wang, Zhenzhong Chen, Chang Wen Chen

In this paper, we propose a learned video compression framework via heterogeneous deformable compensation strategy (HDCVC) to tackle the problems of unstable compression performance caused by single-size deformable kernels in downsampled feature domain.

Motion Compensation Optical Flow Estimation +1

Paper
Add Code

Deep Deconfounded Content-based Tag Recommendation for UGC with Causal Intervention

1 code implementation • 28 May 2022 • Yaochen Zhu, Xubin Ren, Jing Yi, Zhenzhong Chen

We first establish a causal graph to represent the relations among uploader, UGC, and tag, where the uploaders are identified as confounders that spuriously correlate UGC and tag selections.

Recommendation Systems TAG

Paper
Code

Multi-Auxiliary Augmented Collaborative Variational Auto-encoder for Tag Recommendation

no code implementations • 20 Apr 2022 • Jing Yi, Xubin Ren, Zhenzhong Chen

Recommending appropriate tags to items can facilitate content organization, retrieval, consumption and other applications, where hybrid tag recommender systems have been utilized to integrate collaborative information and content information for better recommendations.

Recommendation Systems Retrieval +1

Paper
Add Code

Pyramid Feature Alignment Network for Video Deblurring

no code implementations • 28 Mar 2022 • Leitian Tao, Zhenzhong Chen

To better handle the challenges of complex and large motions, instead of aligning features at each scale separately, lower-scale motion information is used to guide the higher-scale motion estimation.

Deblurring Motion Estimation

Paper
Add Code

Deep Causal Reasoning for Recommendations

1 code implementation • 6 Jan 2022 • Yaochen Zhu, Jing Yi, Jiayi Xie, Zhenzhong Chen

As with all observational studies, hidden confounders, which are factors that affect both item exposures and user ratings, lead to a systematic bias in the estimation.

Recommendation Systems Variational Inference

Paper
Code

Object-Relation Reasoning Graph for Action Recognition

no code implementations • CVPR 2022 • Yangjun Ou, Li Mi, Zhenzhong Chen

By combining an object-level graph (OG) and a relation-level graph (RG), the proposed OR2G catches the attribute transitions of objects and reasons about the relationship transitions between objects simultaneously.

Action Recognition Attribute +3

Paper
Add Code

Make It Move: Controllable Image-to-Video Generation with Text Descriptions

1 code implementation • CVPR 2022 • Yaosi Hu, Chong Luo, Zhenzhong Chen

With both controllable appearance and motion, TI2V aims at generating videos from a static image and a text description.

Image to Video Generation

Paper
Code

Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution

no code implementations • 13 Oct 2021 • Yuantong Zhang, Huairui Wang, Han Zhu, Zhenzhong Chen

In this paper, we consider the task of space-time video super-resolution (ST-VSR), which can increase the spatial resolution and frame rate for a given video simultaneously.

Optical Flow Estimation Space-time Video Super-resolution +2

Paper
Add Code

Cross-modal Variational Auto-encoder for Content-based Micro-video Background Music Recommendation

no code implementations • 15 Jul 2021 • Jing Yi, Yaochen Zhu, Jiayi Xie, Zhenzhong Chen

Moreover, the multimodal information is fused by the product-of-experts (PoE) principle, where the semantic information in visual and textual modalities of the micro-video are weighted according to their variance estimations such that the modality with a lower noise level is given more weights.

Music Recommendation

Paper
Add Code

Predicate correlation learning for scene graph generation

no code implementations • 6 Jul 2021 • Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen

For a typical Scene Graph Generation (SGG) method, there is often a large gap in the performance of the predicates' head classes and tail classes.

Graph Generation Scene Graph Generation

Paper
Add Code

Visual Relationship Forecasting in Videos

no code implementations • 2 Jul 2021 • Li Mi, Yangjun Ou, Zhenzhong Chen

To evaluate the VRF task, we introduce two video datasets named VRF-AG and VRF-VidOR, with a series of spatio-temporally localized visual relation annotations in a video.

Decision Making Object

Paper
Add Code

Dual-Modality Vehicle Anomaly Detection via Bilateral Trajectory Tracing

1 code implementation • 9 Jun 2021 • Jingyuan Chen, Guanchen Ding, Yuchen Yang, Wenwei Han, Kangmin Xu, Tianyi Gao, Zhe Zhang, Wanping Ouyang, Hao Cai, Zhenzhong Chen

For the vehicle detection and tracking module, we adopted YOLOv5 and multi-scale tracking to localize the anomalies.

Anomaly Detection

Paper
Code

Variational Bandwidth Auto-encoder for Hybrid Recommender Systems

1 code implementation • 17 May 2021 • Yaochen Zhu, Zhenzhong Chen

Moreover, by considering the fusion of collaborative and feature variables as a virtual communication channel from an information-theoretic perspective, we introduce a user-dependent channel to dynamically control the information allowed to be accessed from the feature embeddings.

Recommendation Systems

Paper
Code

Towards Visual Distortion in Black-Box Attacks

1 code implementation • 21 Jul 2020 • Nannan Li, Zhenzhong Chen

Constructing adversarial examples in a black-box threat model injures the original images by introducing visual distortion.

Perceptual Distance

Paper
Code

Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution

1 code implementation • 15 Jun 2020 • Xianhang Cheng, Zhenzhong Chen

During the learning process, different intermediate time step can be involved as a control variable by means of an extension of coord-conv trick, allowing the estimated components to vary with different input temporal information.

Motion Estimation Optical Flow Estimation +1

Paper
Code

Towards Mesh Saliency Detection in 6 Degrees of Freedom

no code implementations • 27 May 2020 • Xiaoying Ding, Zhenzhong Chen

Traditional 3D mesh saliency detection algorithms and corresponding databases were proposed under several constraints such as providing limited viewing directions and not taking the subject's movement into consideration.

Saliency Detection

Paper
Add Code

Predicting the Popularity of Micro-videos with Multimodal Variational Encoder-Decoder Framework

1 code implementation • 28 Mar 2020 • Yaochen Zhu, Jiayi Xie, Zhenzhong Chen

As an emerging type of user-generated content, micro-video drastically enriches people's entertainment experiences and social interactions.

Paper
Code

Learning Compact Reward for Image Captioning

no code implementations • 24 Mar 2020 • Nannan Li, Zhenzhong Chen

Adversarial learning has shown its advances in generating natural and diverse descriptions in image captioning.

Image Captioning Reinforcement Learning (RL) +1

Paper
Add Code

Learned Image Downscaling for Upscaling using Content Adaptive Resampler

5 code implementations • 22 Jul 2019 • Wanjie Sun, Zhenzhong Chen

The proposed resampler network generates content adaptive image resampling kernels that are applied to the original HR input to generate pixels on the downscaled image.

Ranked #1 on Image Super-Resolution on DIV2K val - 2x upscaling (using extra training data)

Image Super-Resolution

467

Paper
Code

Obj-GloVe: Scene-Based Contextual Object Embedding

no code implementations • 2 Jul 2019 • Canwen Xu, Zhenzhong Chen, Chenliang Li

Recently, with the prevalence of large-scale image dataset, the co-occurrence information among classes becomes rich, calling for a new way to exploit it to facilitate inference.

Dimensionality Reduction Image Generation +3

Paper
Add Code

A Review-Driven Neural Model for Sequential Recommendation

no code implementations • 1 Jul 2019 • Chenliang Li, Xichuan Niu, Xiangyang Luo, Zhenzhong Chen, Cong Quan

Given a sequence of historical purchased items for a user, we devise a novel hierarchical attention over attention mechanism to capture sequential patterns at both union-level and individual-level.

Collaborative Filtering Sequential Recommendation

Paper
Add Code

Multi-Level Fusion Based 3D Object Detection From Monocular Images

no code implementations • CVPR 2018 • Bin Xu, Zhenzhong Chen

In this paper, we present an end-to-end deep learning based framework for 3D object detection from a single monocular image.

Ranked #12 on Vehicle Pose Estimation on KITTI Cars Hard

3D Object Detection 3D Object Detection From Monocular Images +4

Paper
Add Code

Person Re-Identification With Cascaded Pairwise Convolutions

no code implementations • CVPR 2018 • Yicheng Wang, Zhenzhong Chen, Feng Wu, Gang Wang

In this paper, a novel deep architecture named BraidNet is proposed for person re-identification.

Person Re-Identification

Paper
Add Code

Macroblock Classification Method for Video Applications Involving Motions

no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou

We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.

Change Detection Classification +2

Paper
Add Code

Intra-and-Inter-Constraint-based Video Enhancement based on Piecewise Tone Mapping

no code implementations • 21 Feb 2015 • Yuanzhe Chen, Weiyao Lin, Chongyang Zhang, Zhenzhong Chen, Ning Xu, Jun Xie

In this paper, we propose a new intra-and-inter-constraint-based video enhancement approach aiming to 1) achieve high intra-frame quality of the entire picture where multiple region-of-interests (ROIs) can be adaptively and simultaneously enhanced, and 2) guarantee the inter-frame quality consistencies among video frames.

Tone Mapping Video Enhancement

Paper
Add Code

A Heat-Map-based Algorithm for Recognizing Group Activities in Videos

no code implementations • 21 Feb 2015 • Weiyao Lin, Hang Chu, Jianxin Wu, Bin Sheng, Zhenzhong Chen

In this paper, a new heat-map-based (HMB) algorithm is proposed for group activity recognition.

Group Activity Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.