1 code implementation • 17 Apr 2024 • Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, HaoNing Wu, ZiCheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei LI, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, WangMeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, huimin zheng, JunHao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu
This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i. e., Kuaishou/Kwai Platform.
no code implementations • 9 Apr 2024 • Yuantong Zhang, Hanyou Zheng, Daiqin Yang, Zhenzhong Chen, Haichuan Ma, Wenpeng Ding
This paper addresses the task of space-time video super-resolution (ST-VSR).
no code implementations • 28 Mar 2024 • Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang
Autonomous driving progress relies on large-scale annotated datasets.
no code implementations • 6 Jan 2024 • Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen
Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks.
no code implementations • 29 Nov 2023 • Yang Sui, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Bo Yuan, Zhenzhong Chen
To tackle this issue, we conduct an in-depth analysis of the performance degradation observed in existing parallel context models, focusing on two aspects: the Quantity and Quality of information utilized for context prediction and decoding.
1 code implementation • 21 Oct 2023 • Jiayi Xie, Shang Liu, Gao Cong, Zhenzhong Chen
In this work, we propose a Unified framework of Sequential Search and Recommendation (UnifiedSSR) for joint learning of user behavior history in both search and recommendation scenarios.
no code implementations • 8 Oct 2023 • Wanjie Sun, Zhenzhong Chen
However, the training of image degradation and SR models in this strategy are separate, ignoring the inherent mutual dependency between downscaling and its inverse upscaling process.
no code implementations • 17 Aug 2023 • Mingyu Ouyang, Zhenzhong Chen
However, the current DCT domain methods typically suffer from limited effectiveness in handling a wide range of compression quality factors, or fall short in recovering sparse quantized coefficients and the components across different colorspace.
no code implementations • 17 Aug 2023 • Huairui Wang, Nianxiang Fu, Zhenzhong Chen, Shan Liu
In this paper, we focus on extending spatial aggregation capability and propose a dynamic kernel-based transform coding.
no code implementations • 5 Aug 2023 • Hongchen Wei, Zhenzhong Chen
By exploring the variable and invariant features in the original images and attribute-transferred images, attribute consistency constrains the attribute change direction of both images and sentences to learn domain-specific knowledge.
no code implementations • 1 Jun 2023 • Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen
Learned Image Compression (LIC) has recently become the trending technique for image transmission due to its notable performance.
no code implementations • 23 May 2023 • Yuantong Zhang, Baoxin Teng, Daiqin Yang, Zhenzhong Chen, Haichuan Ma, Gang Li, Wenpeng Ding
Low-light image enhancement (LLIE) aims to improve the illuminance of images due to insufficient light exposure.
no code implementations • 23 Apr 2023 • Yaosi Hu, Zhenzhong Chen, Chong Luo
We present a latent motion diffusion (LaMD) framework, which consists of a motion-decomposed video autoencoder and a diffusion-based motion generator, to implement this idea.
no code implementations • 26 Feb 2023 • Yuantong Zhang, Daiqin Yang, Zhenzhong Chen, Wenpeng Ding
To address these problems, we propose a continuous ST-VSR (C-STVSR) method that can convert the given video to any frame rate and spatial resolution.
Optical Flow Estimation Space-time Video Super-resolution +1
1 code implementation • 21 Nov 2022 • Yaochen Zhu, Zhenzhong Chen
However, since latent item variables are not modeled in UAE, it is difficult to utilize the widely available item content information when ratings are sparse.
2 code implementations • 7 Nov 2022 • Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, Jingang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, Jinwoo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li, Dan Zhu, Mengdi Sun, Ran Duan, Yan Gao, Lingshun Kong, Long Sun, Xiang Li, Xingdong Zhang, Jiawei Zhang, Yaqi Wu, Jinshan Pan, Gaocheng Yu, Jin Zhang, Feng Zhang, Zhe Ma, Hongbin Wang, Hojin Cho, Steve Kim, Huaen Li, Yanbo Ma, Ziwei Luo, Youwei Li, Lei Yu, Zhihong Wen, Qi Wu, Haoqiang Fan, Shuaicheng Liu, Lize Zhang, Zhikai Zong, Jeremy Kwon, Junxi Zhang, Mengyuan Li, Nianxiang Fu, Guanchen Ding, Han Zhu, Zhenzhong Chen, Gen Li, Yuanfan Zhang, Lei Sun, Dafeng Zhang, Neo Yang, Fitz Liu, Jerry Zhao, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Shota Hirose, Kasidis Arunruangsirilert, Luo Ao, Ho Chun Leung, Andrew Wei, Jie Liu, Qiang Liu, Dahai Yu, Ao Li, Lei Luo, Ce Zhu, Seongmin Hong, Dongwon Park, Joonhee Lee, Byeong Hyun Lee, Seunggyu Lee, Se Young Chun, Ruiyuan He, Xuhao Jiang, Haihang Ruan, Xinjian Zhang, Jing Liu, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He
While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints.
1 code implementation • 4 Sep 2022 • Jiayi Xie, Zhenzhong Chen
The stacking of encoders captures the latent hierarchical structure of the check-in sequence, which is used to predict the next visiting POI.
1 code implementation • 7 Aug 2022 • Huairui Wang, Zhenzhong Chen
Learned video compression methods have gained a variety of interest in the video coding community since they have matched or even exceeded the rate-distortion (RD) performance of traditional video codecs.
no code implementations • 6 Aug 2022 • Yaosi Hu, Zhenzhong Chen
We conceptualize the memory-enhancing mechanism as Reinforcement Memory Unit (RMU) that contains an appraisal state together with two positive and negative reinforcement memories.
no code implementations • 18 Jul 2022 • Han Zhu, Zhenzhong Chen, Shan Liu
In addition, the KRNets are optimized in a meta-learning manner to ensure the knowledge transferring and the student learning are beneficial to improving the reconstructed quality of the student.
no code implementations • 11 Jul 2022 • Huairui Wang, Zhenzhong Chen, Chang Wen Chen
In this paper, we propose a learned video compression framework via heterogeneous deformable compensation strategy (HDCVC) to tackle the problems of unstable compression performance caused by single-size deformable kernels in downsampled feature domain.
1 code implementation • 28 May 2022 • Yaochen Zhu, Xubin Ren, Jing Yi, Zhenzhong Chen
We first establish a causal graph to represent the relations among uploader, UGC, and tag, where the uploaders are identified as confounders that spuriously correlate UGC and tag selections.
no code implementations • 20 Apr 2022 • Jing Yi, Xubin Ren, Zhenzhong Chen
Recommending appropriate tags to items can facilitate content organization, retrieval, consumption and other applications, where hybrid tag recommender systems have been utilized to integrate collaborative information and content information for better recommendations.
no code implementations • 28 Mar 2022 • Leitian Tao, Zhenzhong Chen
To better handle the challenges of complex and large motions, instead of aligning features at each scale separately, lower-scale motion information is used to guide the higher-scale motion estimation.
1 code implementation • 6 Jan 2022 • Yaochen Zhu, Jing Yi, Jiayi Xie, Zhenzhong Chen
As with all observational studies, hidden confounders, which are factors that affect both item exposures and user ratings, lead to a systematic bias in the estimation.
no code implementations • CVPR 2022 • Yangjun Ou, Li Mi, Zhenzhong Chen
By combining an object-level graph (OG) and a relation-level graph (RG), the proposed OR2G catches the attribute transitions of objects and reasons about the relationship transitions between objects simultaneously.
1 code implementation • CVPR 2022 • Yaosi Hu, Chong Luo, Zhenzhong Chen
With both controllable appearance and motion, TI2V aims at generating videos from a static image and a text description.
no code implementations • 13 Oct 2021 • Yuantong Zhang, Huairui Wang, Han Zhu, Zhenzhong Chen
In this paper, we consider the task of space-time video super-resolution (ST-VSR), which can increase the spatial resolution and frame rate for a given video simultaneously.
Optical Flow Estimation Space-time Video Super-resolution +2
no code implementations • 15 Jul 2021 • Jing Yi, Yaochen Zhu, Jiayi Xie, Zhenzhong Chen
Moreover, the multimodal information is fused by the product-of-experts (PoE) principle, where the semantic information in visual and textual modalities of the micro-video are weighted according to their variance estimations such that the modality with a lower noise level is given more weights.
no code implementations • 6 Jul 2021 • Leitian Tao, Li Mi, Nannan Li, Xianhang Cheng, Yaosi Hu, Zhenzhong Chen
For a typical Scene Graph Generation (SGG) method, there is often a large gap in the performance of the predicates' head classes and tail classes.
no code implementations • 2 Jul 2021 • Li Mi, Yangjun Ou, Zhenzhong Chen
To evaluate the VRF task, we introduce two video datasets named VRF-AG and VRF-VidOR, with a series of spatio-temporally localized visual relation annotations in a video.
1 code implementation • 9 Jun 2021 • Jingyuan Chen, Guanchen Ding, Yuchen Yang, Wenwei Han, Kangmin Xu, Tianyi Gao, Zhe Zhang, Wanping Ouyang, Hao Cai, Zhenzhong Chen
For the vehicle detection and tracking module, we adopted YOLOv5 and multi-scale tracking to localize the anomalies.
1 code implementation • 17 May 2021 • Yaochen Zhu, Zhenzhong Chen
Moreover, by considering the fusion of collaborative and feature variables as a virtual communication channel from an information-theoretic perspective, we introduce a user-dependent channel to dynamically control the information allowed to be accessed from the feature embeddings.
1 code implementation • 21 Jul 2020 • Nannan Li, Zhenzhong Chen
Constructing adversarial examples in a black-box threat model injures the original images by introducing visual distortion.
1 code implementation • 15 Jun 2020 • Xianhang Cheng, Zhenzhong Chen
During the learning process, different intermediate time step can be involved as a control variable by means of an extension of coord-conv trick, allowing the estimated components to vary with different input temporal information.
no code implementations • 27 May 2020 • Xiaoying Ding, Zhenzhong Chen
Traditional 3D mesh saliency detection algorithms and corresponding databases were proposed under several constraints such as providing limited viewing directions and not taking the subject's movement into consideration.
1 code implementation • 28 Mar 2020 • Yaochen Zhu, Jiayi Xie, Zhenzhong Chen
As an emerging type of user-generated content, micro-video drastically enriches people's entertainment experiences and social interactions.
no code implementations • 24 Mar 2020 • Nannan Li, Zhenzhong Chen
Adversarial learning has shown its advances in generating natural and diverse descriptions in image captioning.
5 code implementations • 22 Jul 2019 • Wanjie Sun, Zhenzhong Chen
The proposed resampler network generates content adaptive image resampling kernels that are applied to the original HR input to generate pixels on the downscaled image.
Ranked #1 on Image Super-Resolution on DIV2K val - 2x upscaling (using extra training data)
no code implementations • 2 Jul 2019 • Canwen Xu, Zhenzhong Chen, Chenliang Li
Recently, with the prevalence of large-scale image dataset, the co-occurrence information among classes becomes rich, calling for a new way to exploit it to facilitate inference.
no code implementations • 1 Jul 2019 • Chenliang Li, Xichuan Niu, Xiangyang Luo, Zhenzhong Chen, Cong Quan
Given a sequence of historical purchased items for a user, we devise a novel hierarchical attention over attention mechanism to capture sequential patterns at both union-level and individual-level.
no code implementations • CVPR 2018 • Bin Xu, Zhenzhong Chen
In this paper, we present an end-to-end deep learning based framework for 3D object detection from a single monocular image.
Ranked #12 on Vehicle Pose Estimation on KITTI Cars Hard
3D Object Detection 3D Object Detection From Monocular Images +4
no code implementations • CVPR 2018 • Yicheng Wang, Zhenzhong Chen, Feng Wu, Gang Wang
In this paper, a novel deep architecture named BraidNet is proposed for person re-identification.
no code implementations • 28 Feb 2015 • Weiyao Lin, Ming-Ting Sun, Hongxiang Li, Zhenzhong Chen, Wei Li, Bing Zhou
We demonstrate that this low-computation-complexity method can efficiently catch the characteristics of the frame.
no code implementations • 21 Feb 2015 • Yuanzhe Chen, Weiyao Lin, Chongyang Zhang, Zhenzhong Chen, Ning Xu, Jun Xie
In this paper, we propose a new intra-and-inter-constraint-based video enhancement approach aiming to 1) achieve high intra-frame quality of the entire picture where multiple region-of-interests (ROIs) can be adaptively and simultaneously enhanced, and 2) guarantee the inter-frame quality consistencies among video frames.
no code implementations • 21 Feb 2015 • Weiyao Lin, Hang Chu, Jianxin Wu, Bin Sheng, Zhenzhong Chen
In this paper, a new heat-map-based (HMB) algorithm is proposed for group activity recognition.