no code implementations • CCL 2020 • Li Song, Ying Liu
《水浒传》是独著还是合著, 施耐庵和罗贯中是何关系一直存在争议。本文将其作者争议粗略归纳为施耐庵作、罗贯中作、施作罗续、罗作他续、施作罗改五种情况, 以罗贯中的《平妖传》为参照, 用假设检验、文本聚类、文本分类、波动风格计量等方法, 结合对文本内容的分析, 考察《水浒传》的写作风格, 试图为其作者身份认定提供参考。结果显示, 只有罗作他续的可能性大, 即前70回为罗贯中所作, 后由他人续写, 其他四种情况可能性都较小。
no code implementations • LILT 2019 • Bin Li, Yuan Wen, Li Song, Weiguang Qu, Nianwen Xue
One significant change we have made to the AMR annotation methodology is the inclusion of the alignment between word tokens in the sentence and the concepts/relations in the CAMR annotation to make it easier for automatic parsers to model the correspondence between a sentence and its meaning representation.
1 code implementation • CVPR 2023 • Yurong Zhang, Liulei Li, Wenguan Wang, Rong Xie, Li Song, Wenjun Zhang
Current top-leading solutions for video object segmentation (VOS) typically follow a matching-based regime: for each query frame, the segmentation mask is inferred according to its correspondence to previously processed and the first annotated frames.
1 code implementation • CVPR 2023 • Han Xue, Zhiwu Huang, Qianru Sun, Li Song, Wenjun Zhang
In this work, we explore the freestyle capability of the model, i. e., how far can it generate unseen semantics (e. g., classes, attributes, and styles) onto a given layout, and call the task Freestyle LIS (FLIS).
no code implementations • 9 Dec 2022 • Anni Tang, Tianyu He, Xu Tan, Jun Ling, Runnan Li, Sheng Zhao, Li Song, Jiang Bian
More specifically, the implicit memory is employed in the audio-to-expression model to capture high-level semantics in the audio-expression shared space, while the explicit memory is employed in the neural-rendering model to help synthesize pixel-level details.
no code implementations • 15 Oct 2022 • Hengsheng Zhang, Xueyi Zou, Jiaming Guo, Youliang Yan, Rong Xie, Li Song
In this paper, considering the characteristics of compressed videos, we propose a Codec Information Assisted Framework (CIAF) to boost and accelerate recurrent VSR models for compressed videos.
1 code implementation • 6 Sep 2022 • Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song
The temporal information is introduced by the temporal feature aggregation model (TFAM), by conducting an attention mechanism between the context frames and the target frame (i. e., the frame to be detected).
Ranked #3 on
Video Object Detection
on ImageNet VID
no code implementations • 29 Aug 2022 • Jun Ling, Xu Tan, Liyang Chen, Runnan Li, Yuchao Zhang, Sheng Zhao, Li Song
In this paper, we conduct systematic analyses on the motion jittering problem based on a state-of-the-art pipeline that uses 3D face representations to bridge the input audio and output video, and improve the motion stability with a series of effective designs.
no code implementations • 13 Jun 2022 • Yan Huang, Jizheng Xu, Li Zhang, Yan Zhao, Li Song
Inspired by rate control algorithms, we propose a scheme to precisely control the intra encoding complexity of VVC.
no code implementations • 21 Apr 2022 • Anni Tang, Yan Huang, Jun Ling, ZhiYu Zhang, Yiwei Zhang, Rong Xie, Li Song
As the latest video coding standard, versatile video coding (VVC) has shown its ability in retaining pixel quality.
no code implementations • 23 Dec 2021 • Hongcheng Zhong, Jun Xu, Chen Zhu, Donghui Feng, Li Song
Current per-shot encoding schemes aim to improve the compression efficiency by shot-level optimization.
no code implementations • 20 Dec 2021 • Jingchuan Hu, Shuai Guo, Kai Zhou, Yu Dong, Jun Xu, Li Song
As an important application form of immersive multimedia services, free-viewpoint video(FVV) enables users with great immersive experience by strong interaction.
1 code implementation • CVPR 2021 • Yi Fang, Jiapeng Tang, Wang Shen, Wei Shen, Xiao Gu, Li Song, Guangtao Zhai
In the third stage, we use the generated dual attention as guidance to perform two sub-tasks: (1) identifying whether the gaze target is inside or out of the image; (2) locating the target if inside.
1 code implementation • CVPR 2021 • Jun Ling, Han Xue, Li Song, Rong Xie, Xiao Gu
To ensure the visual style consistency between the foreground and the background, in this paper, we treat image harmonization as a style transfer problem.
Ranked #3 on
Image Harmonization
on HAdobe5k(1024$\times$1024)
no code implementations • 12 Mar 2021 • Bo Liu, Ming Ding, Hanyu Xue, Tianqing Zhu, Dayong Ye, Li Song, Wanlei Zhou
The excessive use of images in social networks, government databases, and industrial applications has posed great privacy risks and raised serious concerns from the public.
no code implementations • 2 Mar 2021 • Yunqian Wen, Li Song, Bo Liu, Ming Ding, Rong Xie
We propose IdentityDP, a face anonymization framework that combines a data-driven deep neural network with a differential privacy (DP) mechanism.
no code implementations • ICCV 2021 • Jingyi Cao, Bo Liu, Yunqian Wen, Rong Xie, Li Song
The popularization of intelligent devices including smartphones and surveillance cameras results in more serious privacy issues.
no code implementations • LREC 2020 • Li Song, Yuling Dai, Yihuan Liu, Bin Li, Weiguang Qu
The existing lexicons blur senses and frames of predicates, which needs to be refined to meet the tasks like word sense disambiguation and event extraction.
1 code implementation • ECCV 2020 • Jun Ling, Han Xue, Li Song, Shuhui Yang, Rong Xie, Xiao Gu
Previous methods edit an input image under the guidance of a discrete emotion label or absolute condition (e. g., facial action units) to possess the desired expression.
no code implementations • 14 Sep 2019 • Chuan Shi, Xiaotian Han, Li Song, Xiao Wang, Senzhang Wang, Junping Du, Philip S. Yu
However, the characteristics of users and the properties of items may stem from different aspects, e. g., the brand-aspect and category-aspect of items.
no code implementations • WS 2019 • Yihuan Liu, Bin Li, Peiyi Yan, Li Song, Weiguang Qu
We find that 54. 98{\%} of sentences have ellipses.
2 code implementations • 4 Apr 2019 • Xinyuan Chen, Chang Xu, Xiaokang Yang, Li Song, DaCheng Tao
We propose adversarial gated networks (Gated GAN) to transfer multiple styles in a single model.
no code implementations • 20 Apr 2018 • Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, Wenjun Zhang
Transferring a low-dynamic-range (LDR) image to a high-dynamic-range (HDR) image, which is the so-called inverse tone mapping (iTM), is an important imaging technique to improve visual effects of imaging devices.