Search Results for author: Mengyang Liu

Found 8 papers, 5 papers with code

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

no code implementations • 25 Sep 2023 • Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang

To address this issue, we introduce an automatic in-the-wild speech data preprocessing framework (AutoPrep) in this paper, which is designed to enhance speech quality, generate speaker labels, and produce transcriptions automatically.

Automatic Speech Recognition Speech Enhancement +3

Paper
Add Code

PTVD: A Large-Scale Plot-Oriented Multimodal Dataset Based on Television Dramas

1 code implementation • 26 Jun 2023 • Chen Li, Xutan Peng, Teng Wang, Yixiao Ge, Mengyang Liu, Xuyuan Xu, Yexin Wang, Ying Shan

Art forms such as movies and television (TV) dramas are reflections of the real world, which have attracted much attention from the multimodal learning community recently.

Genre classification Retrieval +1

Paper
Code

SVCNet: Scribble-based Video Colorization Network with Temporal Aggregation

1 code implementation • 21 Mar 2023 • Yuzhi Zhao, Lai-Man Po, Kangcheng Liu, Xuehui Wang, Wing-Yin Yu, Pengfei Xian, Yujia Zhang, Mengyang Liu

It addresses three common issues in the scribble-based video colorization area: colorization vividness, temporal consistency, and color bleeding.

Colorization Super-Resolution

Paper
Code

Deep Image Style Transfer from Freeform Text

no code implementations • 13 Dec 2022 • Tejas Santanam, Mengyang Liu, Jiangyue Yu, Zhaodong Yang

The language model returns a closely matching image given a style text and description input, which is then passed to the style transfer model with an input content image to create a final output.

Language Modelling Style Transfer

Paper
Add Code

SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling

1 code implementation • 7 Aug 2022 • Mengyang Liu, Haozheng Luo, Leonard Thong, Yinghao Li, Chao Zhang, Le Song

Compared to frequently used text annotation tools, our annotation tool allows for the development of weak labels in addition to providing a manual annotation experience.

Denoising named-entity-recognition +3

Paper
Code

Graph Condensation via Receptive Field Distribution Matching

no code implementations • 28 Jun 2022 • Mengyang Liu, Shanchuan Li, Xinshi Chen, Le Song

Thus, we propose Graph Condesation via Receptive Field Distribution Matching (GCDM), which is accomplished by optimizing the synthetic graph through the use of a distribution matching loss quantified by maximum mean discrepancy (MMD).

Paper
Add Code

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

1 code implementation • 16 Dec 2021 • Yujia Zhang, Lai-Man Po, Xuyuan Xu, Mengyang Liu, Yexin Wang, Weifeng Ou, Yuzhi Zhao, Wing-Yin Yu

Moreover, we employ a joint optimization combining pretext tasks with contrastive learning to further enhance the spatio-temporal representation learning.

Contrastive Learning Representation Learning +1

Paper
Code

VCGAN: Video Colorization with Hybrid Generative Adversarial Network

1 code implementation • 26 Apr 2021 • Yuzhi Zhao, Lai-Man Po, Wing-Yin Yu, Yasar Abbas Ur Rehman, Mengyang Liu, Yujia Zhang, Weifeng Ou

We propose a hybrid recurrent Video Colorization with Hybrid Generative Adversarial Network (VCGAN), an improved approach to video colorization using end-to-end learning.

Colorization Generative Adversarial Network +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.