Search Results for author: Takashi Shibuya

Found 10 papers, 6 papers with code

BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network

2 code implementations • 6 Sep 2023 • Takashi Shibuya, Yuhta Takida, Yuki Mitsufuji

In the literature, it has been demonstrated that slicing adversarial network (SAN), an improved GAN training framework that can find the optimal projection, is effective in the image generation task.

Ranked #2 on Speech Synthesis on LibriTTS

Generative Adversarial Network Speech Synthesis

173

Paper
Code

SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

1 code implementation • 16 May 2022 • Yuhta Takida, Takashi Shibuya, WeiHsiang Liao, Chieh-Hsin Lai, Junki Ohmura, Toshimitsu Uesaka, Naoki Murata, Shusuke Takahashi, Toshiyuki Kumakura, Yuki Mitsufuji

In this paper, we propose a new training scheme that extends the standard VAE via novel stochastic dequantization and quantization, called stochastically quantized variational autoencoder (SQ-VAE).

Quantization

164

Paper
Code

Nested Named Entity Recognition via Second-best Sequence Learning and Decoding

3 code implementations • 5 Sep 2019 • Takashi Shibuya, Eduard Hovy

When an entity name contains other names within it, the identification of all combinations of names can become difficult and expensive.

Ranked #2 on Nested Mention Recognition on ACE 2005

named-entity-recognition Named Entity Recognition +3

138

Paper
Code

Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement

1 code implementation • 27 Oct 2022 • Ryosuke Sawata, Naoki Murata, Yuhta Takida, Toshimitsu Uesaka, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji

Although deep neural network (DNN)-based speech enhancement (SE) methods outperform the previous non-DNN-based ones, they often degrade the perceptual quality of generated outputs.

Denoising Speech Enhancement

Paper
Code

Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER

1 code implementation • ACL 2022 • Dong-Ho Lee, Akshen Kadakia, Kangmin Tan, Mahak Agarwal, Xinyu Feng, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren

We also find that good demonstration can save many labeled examples and consistency in demonstration contributes to better performance.

Domain Adaptation Few-Shot Text Classification +6

Paper
Code

SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer

1 code implementation • 30 Jan 2023 • Yuhta Takida, Masaaki Imaizumi, Takashi Shibuya, Chieh-Hsin Lai, Toshimitsu Uesaka, Naoki Murata, Yuki Mitsufuji

Generative adversarial networks (GANs) learn a target probability distribution by optimizing a generator and a discriminator with minimax objectives.

Ranked #1 on Image Generation on FFHQ 1024 x 1024

Image Generation

Paper
Code

XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models

no code implementations • 30 Oct 2022 • Dong-Ho Lee, Akshen Kadakia, Brihi Joshi, Aaron Chan, Ziyi Liu, Kiran Narahari, Takashi Shibuya, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren

Explanation-based model debugging aims to resolve spurious biases by showing human users explanations of model behavior, asking users to give feedback on the behavior, then using the feedback to update the model.

text-classification Text Classification

Paper
Add Code

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders

no code implementations • 18 May 2023 • Hao Shi, Kazuki Shimada, Masato Hirano, Takashi Shibuya, Yuichiro Koyama, Zhi Zhong, Shusuke Takahashi, Tatsuya Kawahara, Yuki Mitsufuji

At the decoded feature level, we fuse the two decoded features by generative and predictive decoders.

Speech Enhancement

Paper
Add Code

On the Language Encoder of Contrastive Cross-modal Models

no code implementations • 20 Oct 2023 • Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji

Contrastive cross-modal models such as CLIP and CLAP aid various vision-language (VL) and audio-language (AL) tasks.

Sentence Sentence Embedding +1

Paper
Add Code

HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes

no code implementations • 31 Dec 2023 • Yuhta Takida, Yukara Ikemiya, Takashi Shibuya, Kazuki Shimada, Woosung Choi, Chieh-Hsin Lai, Naoki Murata, Toshimitsu Uesaka, Kengo Uchida, Wei-Hsiang Liao, Yuki Mitsufuji

Vector quantization (VQ) is a technique to deterministically learn features with discrete codebook representations.

Quantization Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.