Search Results for author: Taehwan Kim

Found 9 papers, 2 papers with code

Sparse Coding for Learning Interpretable Spatio-Temporal Primitives

no code implementations NeurIPS 2010 Taehwan Kim, Gregory Shakhnarovich, Raquel Urtasun

Sparse coding has recently become a popular approach in computer vision to learn dictionaries of natural images.

Signer-independent Fingerspelling Recognition with Deep Neural Network Adaptation

no code implementations13 Feb 2016 Taehwan Kim, Weiran Wang, Hao Tang, Karen Livescu

Previous work has shown that it is possible to achieve almost 90% accuracies on fingerspelling recognition in a signer-dependent setting.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

American Sign Language fingerspelling recognition from video: Methods for unrestricted recognition and signer-independence

no code implementations30 Aug 2016 Taehwan Kim

In this thesis, we study the problem of recognizing video sequences of fingerspelled letters in American Sign Language (ASL).

Lexicon-Free Fingerspelling Recognition from Video: Data, Models, and Signer Adaptation

no code implementations26 Sep 2016 Taehwan Kim, Jonathan Keane, Weiran Wang, Hao Tang, Jason Riggle, Gregory Shakhnarovich, Diane Brentari, Karen Livescu

Recognizing fingerspelling is challenging for a number of reasons: It involves quick, small motions that are often highly coarticulated; it exhibits significant variation between signers; and there has been a dearth of continuous fingerspelling data collected.

Understanding Beauty via Deep Facial Features

no code implementations30 Jan 2019 Xudong Liu, Tao Li, Hao Peng, Iris Chuoying Ouyang, Taehwan Kim, Ruizhe Wang

The concept of beauty has been debated by philosophers and psychologists for centuries, but most definitions are subjective and metaphysical, and deficit in accuracy, generality, and scalability.

Generative Adversarial Network

Technical Report for CVPR 2022 LOVEU AQTC Challenge

1 code implementation29 Jun 2022 Hyeonyu Kim, Jongeun Kim, Jeonghun Kang, Sanguk Park, Dongchan Park, Taehwan Kim

This technical report presents the 2nd winning model for AQTC, a task newly introduced in CVPR 2022 LOng-form VidEo Understanding (LOVEU) challenges.

Video Understanding

Generating Realistic Images from In-the-wild Sounds

no code implementations ICCV 2023 Taegyeong Lee, Jeonghun Kang, Hyeonyu Kim, Taehwan Kim

Representing wild sounds as images is an important but challenging task due to the lack of paired datasets between sound and images and the significant differences in the characteristics of these two modalities.

Audio captioning Sentence

Effective Slogan Generation with Noise Perturbation

1 code implementation6 Oct 2023 Jongeun Kim, MinChung Kim, Taehwan Kim

Slogans play a crucial role in building the brand's identity of the firm.

Cannot find the paper you are looking for? You can Submit a new open access paper.