Search Results for author: Yu-Hua Chen

Found 15 papers, 5 papers with code

Source Separation-based Data Augmentation for Improved Joint Beat and Downbeat Tracking

1 code implementation16 Jun 2021 Ching-Yu Chiu, Joann Ching, Wen-Yi Hsiao, Yu-Hua Chen, Alvin Wen-Yu Su, Yi-Hsuan Yang

Due to advances in deep learning, the performance of automatic beat and downbeat tracking in musical audio signals has seen great improvement in recent years.

Data Augmentation

Automatic Composition of Guitar Tabs by Transformers and Groove Modeling

no code implementations4 Aug 2020 Yu-Hua Chen, Yu-Hsiang Huang, Wen-Yi Hsiao, Yi-Hsuan Yang

Deep learning algorithms are increasingly developed for learning to compose music in the form of MIDI files.

Sound Audio and Speech Processing

Analogical Image Translation for Fog Generation

no code implementations28 Jun 2020 Rui Gong, Dengxin Dai, Yu-Hua Chen, Wen Li, Luc van Gool

AIT achieves this zero-shot image translation capability by coupling a supervised training scheme in the synthetic domain, a cycle consistency strategy in the real domain, an adversarial training scheme between the two domains, and a novel network design.

Image-to-Image Translation Scene Understanding +1

Unconditional Audio Generation with Generative Adversarial Networks and Cycle Regularization

1 code implementation18 May 2020 Jen-Yu Liu, Yu-Hua Chen, Yin-Cheng Yeh, Yi-Hsuan Yang

Audio examples, as well as the code for implementing our model, will be publicly available online upon paper publication.

Audio Generation

Unbiased Mean Teacher for Cross-domain Object Detection

1 code implementation CVPR 2021 Jinhong Deng, Wen Li, Yu-Hua Chen, Lixin Duan

We reveal that there often exists a considerable model bias for the simple mean teacher (MT) model in cross-domain scenarios, and eliminate the model bias with several simple yet highly effective strategies.

object-detection Object Detection +1

Score and Lyrics-Free Singing Voice Generation

1 code implementation26 Dec 2019 Jen-Yu Liu, Yu-Hua Chen, Yin-Cheng Yeh, Yi-Hsuan Yang

Generative models for singing voice have been mostly concerned with the task of ``singing voice synthesis,'' i. e., to produce singing voice waveforms given musical scores and text lyrics.

Audio Generation Singing Voice Synthesis

Domain Agnostic Feature Learning for Image and Video Based Face Anti-spoofing

no code implementations15 Dec 2019 Suman Saha, Wen-Hao Xu, Menelaos Kanakis, Stamatios Georgoulis, Yu-Hua Chen, Danda Pani Paudel, Luc van Gool

Face anti-spoofing is a measure towards this direction for bio-metric user authentication, and in particular face recognition, that tries to prevent spoof attacks.

Face Anti-Spoofing Face Recognition

Enhanced generative adversarial network for 3D brain MRI super-resolution

no code implementations10 Jul 2019 Jiancong Wang, Yu-Hua Chen, Yifan Wu, Jianbo Shi, James Gee

Single image super-resolution (SISR) reconstruction for magnetic resonance imaging (MRI) has generated significant interest because of its potential to not only speed up imaging but to improve quantitative processing and analysis of available image data.

Image Super-Resolution SSIM

Dixit: Interactive Visual Storytelling via Term Manipulation

no code implementations6 Mar 2019 Chao-Chun Hsu, Yu-Hua Chen, Zi-Yuan Chen, Hsin-Yu Lin, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

In this paper, we introduce Dixit, an interactive visual storytelling system that the user interacts with iteratively to compose a short story for a photo sequence.

Visual Storytelling

DLOW: Domain Flow for Adaptation and Generalization

1 code implementation CVPR 2019 Rui Gong, Wen Li, Yu-Hua Chen, Luc van Gool

In this work, we present a domain flow generation(DLOW) model to bridge two different domains by generating a continuous sequence of intermediate domains flowing from one domain to the other.

Domain Adaptation Semantic Segmentation +1

The 2018 DAVIS Challenge on Video Object Segmentation

no code implementations1 Mar 2018 Sergi Caelles, Alberto Montes, Kevis-Kokitsi Maninis, Yu-Hua Chen, Luc van Gool, Federico Perazzi, Jordi Pont-Tuset

Motivated by the analysis of the results of the 2017 edition, the main track of the competition will be the same than in the previous edition (segmentation given the full mask of the objects in the first frame -- semi-supervised scenario).

Interactive Segmentation Semantic Segmentation +2

Calcium Removal From Cardiac CT Images Using Deep Convolutional Neural Network

no code implementations20 Feb 2018 Siming Yan, Feng Shi, Yu-Hua Chen, Damini Dey, Sang-Eun Lee, Hyuk-Jae Chang, Debiao Li, Yibin Xie

Coronary calcium causes beam hardening and blooming artifacts on cardiac computed tomography angiography (CTA) images, which lead to overestimation of lumen stenosis and reduction of diagnostic specificity.

BIG-bench Machine Learning Specificity

Video Object Segmentation Without Temporal Information

no code implementations18 Sep 2017 Kevis-Kokitsi Maninis, Sergi Caelles, Yu-Hua Chen, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool

Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames.

Foreground Segmentation Semantic Segmentation +3

Semantically-Guided Video Object Segmentation

no code implementations6 Apr 2017 Sergi Caelles, Yu-Hua Chen, Jordi Pont-Tuset, Luc van Gool

This paper tackles the problem of semi-supervised video object segmentation, that is, segmenting an object in a sequence given its mask in the first frame.

Semantic Segmentation Semi-Supervised Video Object Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.