Search Results for author: Jihyun Lee

Found 13 papers, 2 papers with code

Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes

1 code implementation • CVPR 2023 • Jihyun Lee, Minhyuk Sung, Honggyu Choi, Tae-Kyun Kim

To handle the shape complexity and interaction context between two hands, Im2Hands models the occupancy volume of two hands - conditioned on an RGB image and coarse 3D keypoints - by two novel attention-based modules responsible for (1) initial occupancy estimation and (2) context-aware occupancy refinement, respectively.

Image Reconstruction Vocal Bursts Valence Prediction

Paper
Code

Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking

1 code implementation • 4 Dec 2023 • Jihyun Lee, Yejin Jeon, Wonjun Lee, Yunsu Kim, Gary Geunbae Lee

We address this by investigating synthetic audio data for audio-based DST.

Dialogue State Tracking Task-Oriented Dialogue Systems

Paper
Code

Learning Monotonic Alignments with Source-Aware GMM Attention

no code implementations • 1 Jan 2021 • Tae Gyoon Kang, Ho-Gyeong Kim, Min-Joong Lee, Jihyun Lee, Seongmin Ok, Hoshik Lee, Young Sang Choi

Transformers with soft attention have been widely adopted to various sequence-to-sequence tasks.

speech-recognition Speech Recognition

Paper
Add Code

Fast DCTTS: Efficient Deep Convolutional Text-to-Speech

no code implementations • 1 Apr 2021 • Minsu Kang, Jihyun Lee, Simin Kim, Injung Kim

We propose an end-to-end speech synthesizer, Fast DCTTS, that synthesizes speech in real time on a single CPU thread.

Computational Efficiency

Paper
Add Code

Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations

no code implementations • 26 Jul 2021 • Se-Yun Um, Jihyun Kim, Jihyun Lee, Hong-Goo Kang

In this paper, we propose a multi-speaker face-to-speech waveform generation model that also works for unseen speaker conditions.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

An Effective Diverse Decoding Scheme for Robust Synonymous Sentence Translation

no code implementations • AMTA 2016 • Youngki Park, Hwidong Na, Hodong Lee, Jihyun Lee, Inchul Song

Sentence Translation

Paper
Add Code

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian

no code implementations • CVPR 2022 • Jihyun Lee, Minhyuk Sung, HyunJin Kim, Tae-Kyun Kim

We propose a framework that can deform an object in a 2D image as it exists in 3D space.

3D Reconstruction Image Manipulation

Paper
Add Code

SF-DST: Few-Shot Self-Feeding Reading Comprehension Dialogue State Tracking with Auxiliary Task

no code implementations • 16 Sep 2022 • Jihyun Lee, Gary Geunbae Lee

Few-shot dialogue state tracking (DST) model tracks user requests in dialogue with reliable accuracy even with a small amount of data.

Dialogue State Tracking Reading Comprehension

Paper
Add Code

Self-Training with Purpose Preserving Augmentation Improves Few-shot Generative Dialogue State Tracking

no code implementations • 17 Nov 2022 • Jihyun Lee, Chaebin Lee, Yunsu Kim, Gary Geunbae Lee

In dialogue state tracking (DST), labeling the dataset involves considerable human labor.

Dialogue State Tracking

Paper
Add Code

DORIC : Domain Robust Fine-Tuning for Open Intent Clustering through Dependency Parsing

no code implementations • 17 Mar 2023 • Jihyun Lee, Seungyeon Seo, Yunsu Kim, Gary Geunbae Lee

We present our work on Track 2 in the Dialog System Technology Challenges 11 (DSTC11).

Clustering Dependency Parsing +1

Paper
Add Code

BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0

no code implementations • 21 Dec 2023 • Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

Specifically, we train an encoder module to map ECoG signals to latent embeddings that match Wav2Vec 2. 0 representations of the corresponding spoken speech.

Speech Synthesis Transfer Learning

Paper
Add Code

Style Modeling for Multi-Speaker Articulation-to-Speech

no code implementations • 21 Dec 2023 • Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

In this paper, we propose a neural articulation-to-speech (ATS) framework that synthesizes high-quality speech from articulatory signal in a multi-speaker situation.

Paper
Add Code

InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion

no code implementations • 26 Mar 2024 • Jihyun Lee, Shunsuke Saito, Giljoo Nam, Minhyuk Sung, Tae-Kyun Kim

Sampling from our model yields plausible and diverse two-hand shapes in close interaction with or without an object.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.