Search Results for author: Suhua Tang

Found 8 papers, 3 papers with code

Deep Attention-Based Alignment Network for Melody Generation from Incomplete Lyrics

no code implementations • 23 Jan 2023 • Gurunath Reddy M, Zhe Zhang, Yi Yu, Florian Harscoet, Simon Canales, Suhua Tang

We propose a deep attention-based alignment network, which aims to automatically predict lyrics and melody with given incomplete lyrics as input in a way similar to the music creation of humans.

Deep Attention

Paper
Add Code

Variational Autoencoder with CCA for Audio-Visual Cross-Modal Retrieval

no code implementations • 5 Dec 2021 • Jiwei Zhang, Yi Yu, Suhua Tang, Jianming Wu, Wei Li

On the one hand, audio encoder and visual encoder separately encode audio data and visual data into two different latent spaces.

Cross-Modal Retrieval Information Retrieval +1

Paper
Add Code

Automatic Neural Lyrics and Melody Composition

no code implementations • 12 Nov 2020 • Gurunath Reddy Madhumani, Yi Yu, Florian Harscoët, Simon Canales, Suhua Tang

In this paper, we propose a technique to address the most challenging aspect of algorithmic songwriting process, which enables the human community to discover original lyrics, and melodies suitable for the generated lyrics.

Sentence

Paper
Add Code

Ensemble Super-Resolution with A Reference Dataset

1 code implementation • 12 May 2019 • Junjun Jiang, Yi Yu, Zheng Wang, Suhua Tang, Ruimin Hu, Jiayi Ma

In this paper, we present a simple but effective single image SR method based on ensemble learning, which can produce a better performance than that could be obtained from any of SR methods to be ensembled (or called component super-resolvers).

Ensemble Learning Image Super-Resolution

Paper
Code

Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning

2 code implementations • 3 Sep 2018 • Junjun Jiang, Yi Yu, Suhua Tang, Jiayi Ma, Akiko Aizawa, Kiyoharu Aizawa

To this end, this study incorporates the contextual information of image patch and proposes a powerful and efficient context-patch based face hallucination approach, namely Thresholding Locality-constrained Representation and Reproducing learning (TLcR-RL).

Face Hallucination Hallucination +1

205

Paper
Code

Deep CNN Denoiser and Multi-layer Neighbor Component Embedding for Face Hallucination

1 code implementation • 28 Jun 2018 • Junjun Jiang, Yi Yu, Jinhui Hu, Suhua Tang, Jiayi Ma

Most of the current face hallucination methods, whether they are shallow learning-based or deep learning-based, all try to learn a relationship model between Low-Resolution (LR) and High-Resolution (HR) spaces with the help of a training set.

Face Hallucination Hallucination +1

Paper
Code

Category-Based Deep CCA for Fine-Grained Venue Discovery from Multimodal Data

no code implementations • 8 May 2018 • Yi Yu, Suhua Tang, Kiyoharu Aizawa, Akiko Aizawa

Given a photo as input, this model performs (i) exact venue search (find the venue where the photo was taken), and (ii) group venue search (find relevant venues with the same category as that of the photo), by the cross-modal correlation between the input photo and textual description of venues.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Towards Deep Modeling of Music Semantics using EEG Regularizers

no code implementations • 14 Dec 2017 • Francisco Raposo, David Martins de Matos, Ricardo Ribeiro, Suhua Tang, Yi Yu

Modeling of music audio semantics has been previously tackled through learning of mappings from audio data to high-level tags or latent unsupervised spaces.

Cross-Modal Retrieval EEG +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.