Search Results for author: Jaeyeon Kim

Found 6 papers, 3 papers with code

Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations

no code implementations2 Feb 2024 Jaeyeon Kim, Injune Hwang, Kyogu Lee

We propose a framework to learn semantics from raw audio signals using two types of representations, encoding contextual and phonetic information respectively.

Language Modelling Spoken Language Understanding

EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning

1 code implementation31 Jan 2024 Jaeyeon Kim, JaeYoon Jung, Jinjoo Lee, Sang Hoon Woo

We also introduce a new training objective called masked codec modeling that improves acoustic awareness of the pretrained language model.

AudioCaps Audio captioning +1

Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates

1 code implementation20 Sep 2023 Ka Chun Shum, Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

Specifically, to insert a new foreground object represented by a set of multi-view images into a background radiance field, we use a text-to-image diffusion model to learn and generate combined images that fuse the object of interest into the given background across views.

3D Reconstruction Object +1

PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS

2 code implementations24 Feb 2023 Junhyeok Lee, Wonbin Jung, Hyunjae Cho, Jaeyeon Kim, Jaehwan Kim

Previous pitch-controllable text-to-speech (TTS) models rely on directly modeling fundamental frequency, leading to low variance in synthesized speech.

Variational Inference

Minimal Adversarial Examples for Deep Learning on 3D Point Clouds

no code implementations ICCV 2021 Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

With recent developments of convolutional neural networks, deep learning for 3D point clouds has shown significant progress in various 3D scene understanding tasks, e. g., object recognition, semantic segmentation.

3D Object Recognition Object Detection +3

Cannot find the paper you are looking for? You can Submit a new open access paper.