Search Results for author: Mengyang Liu

Found 8 papers, 5 papers with code

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

no code implementations25 Sep 2023 Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang

To address this issue, we introduce an automatic in-the-wild speech data preprocessing framework (AutoPrep) in this paper, which is designed to enhance speech quality, generate speaker labels, and produce transcriptions automatically.

Automatic Speech Recognition Speech Enhancement +3

PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas

1 code implementation26 Jun 2023 Chen Li, Xutan Peng, Teng Wang, Yixiao Ge, Mengyang Liu, Xuyuan Xu, Yexin Wang, Ying Shan

Art forms such as movies and television (TV) dramas are reflections of the real world, which have attracted much attention from the multimodal learning community recently.

Genre classification Retrieval +1

SVCNet: Scribble-based Video Colorization Network with Temporal Aggregation

1 code implementation21 Mar 2023 Yuzhi Zhao, Lai-Man Po, Kangcheng Liu, Xuehui Wang, Wing-Yin Yu, Pengfei Xian, Yujia Zhang, Mengyang Liu

It addresses three common issues in the scribble-based video colorization area: colorization vividness, temporal consistency, and color bleeding.

Colorization Super-Resolution

Deep Image Style Transfer from Freeform Text

no code implementations13 Dec 2022 Tejas Santanam, Mengyang Liu, Jiangyue Yu, Zhaodong Yang

The language model returns a closely matching image given a style text and description input, which is then passed to the style transfer model with an input content image to create a final output.

Language Modelling Style Transfer

SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling

1 code implementation7 Aug 2022 Mengyang Liu, Haozheng Luo, Leonard Thong, Yinghao Li, Chao Zhang, Le Song

Compared to frequently used text annotation tools, our annotation tool allows for the development of weak labels in addition to providing a manual annotation experience.

Denoising named-entity-recognition +3

Graph Condensation via Receptive Field Distribution Matching

no code implementations28 Jun 2022 Mengyang Liu, Shanchuan Li, Xinshi Chen, Le Song

Thus, we propose Graph Condesation via Receptive Field Distribution Matching (GCDM), which is accomplished by optimizing the synthetic graph through the use of a distribution matching loss quantified by maximum mean discrepancy (MMD).

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

1 code implementation16 Dec 2021 Yujia Zhang, Lai-Man Po, Xuyuan Xu, Mengyang Liu, Yexin Wang, Weifeng Ou, Yuzhi Zhao, Wing-Yin Yu

Moreover, we employ a joint optimization combining pretext tasks with contrastive learning to further enhance the spatio-temporal representation learning.

Contrastive Learning Representation Learning +1

VCGAN: Video Colorization with Hybrid Generative Adversarial Network

1 code implementation26 Apr 2021 Yuzhi Zhao, Lai-Man Po, Wing-Yin Yu, Yasar Abbas Ur Rehman, Mengyang Liu, Yujia Zhang, Weifeng Ou

We propose a hybrid recurrent Video Colorization with Hybrid Generative Adversarial Network (VCGAN), an improved approach to video colorization using end-to-end learning.

Colorization Generative Adversarial Network +1

Cannot find the paper you are looking for? You can Submit a new open access paper.