Search Results for author: Dong-Jin Kim

Found 17 papers, 8 papers with code

Self-Sufficient Framework for Continuous Sign Language Recognition

no code implementations21 Mar 2023 Youngjoon Jang, Youngtaek Oh, Jae Won Cho, Myungchul Kim, Dong-Jin Kim, In So Kweon, Joon Son Chung

The goal of this work is to develop self-sufficient framework for Continuous Sign Language Recognition (CSLR) that addresses key issues of sign language recognition.

Pseudo Label Sign Language Recognition

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data

no code implementations26 Jan 2023 Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, In So Kweon

We present a novel data-efficient semi-supervised framework to improve the generalization of image captioning models.

Relational Captioning Sentence

Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition

1 code implementation1 Nov 2022 Youngjoon Jang, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, Joon Son Chung, In So Kweon

Most existing Continuous Sign Language Recognition (CSLR) benchmarks have fixed backgrounds and are filmed in studios with a static monochromatic background.

Benchmarking Disentanglement +1

Generative Bias for Robust Visual Question Answering

1 code implementation CVPR 2023 Jae Won Cho, Dong-Jin Kim, Hyeonggon Ryu, In So Kweon

In this work, in order to better learn the bias a target VQA model suffers from, we propose a generative method to train the bias model directly from the target model, called GenB.

Knowledge Distillation Question Answering +1

ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection

1 code implementation9 Sep 2021 Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon

A common problem in the task of human-object interaction (HOI) detection is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.

Human-Object Interaction Detection

LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation

no code implementations ICCV 2021 Inkyu Shin, Dong-Jin Kim, Jae Won Cho, Sanghyun Woo, KwanYong Park, In So Kweon

In order to find the uncertain points, we generate an inconsistency mask using the proposed adaptive pixel selector and we label these segment-based regions to achieve near supervised performance with only a small fraction (about 2. 2%) ground truth points, which we call "Segment based Pixel-Labeling (SPL)".

Semantic Segmentation Unsupervised Domain Adaptation

MCDAL: Maximum Classifier Discrepancy for Active Learning

1 code implementation23 Jul 2021 Jae Won Cho, Dong-Jin Kim, Yunjae Jung, In So Kweon

Recent state-of-the-art active learning methods have mostly leveraged Generative Adversarial Networks (GAN) for sample acquisition; however, GAN is usually known to suffer from instability and sensitivity to hyper-parameters.

Active Learning Classification +3

DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning

1 code implementation CVPR 2022 Youngtaek Oh, Dong-Jin Kim, In So Kweon

The capability of the traditional semi-supervised learning (SSL) methods is far from real-world application due to severely biased pseudo-labels caused by (1) class imbalance and (2) class distribution mismatch between labeled and unlabeled data.

imbalanced classification Pseudo Label +1

Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

no code implementations13 Apr 2021 Jae Won Cho, Dong-Jin Kim, Jinsoo Choi, Yunjae Jung, In So Kweon

In this work, we address the issues of missing modalities that have arisen from the Visual Question Answer-Difference prediction task and find a novel method to solve the task at hand.

Knowledge Distillation Visual Question Answering (VQA)

Dense Relational Image Captioning via Multi-task Triple-Stream Networks

1 code implementation8 Oct 2020 Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, In So Kweon

To this end, we propose the multi-task triple-stream network (MTTSNet) which consists of three recurrent units responsible for each POS which is trained by jointly predicting the correct captions and POS for each word.

Graph Generation Object +4

Detecting Human-Object Interactions with Action Co-occurrence Priors

1 code implementation17 Jul 2020 Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon

A common problem in human-object interaction (HOI) detection task is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.

Human-Object Interaction Detection

Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach

no code implementations IJCNLP 2019 Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, In So Kweon

To this end, our proposed semi-supervised learning method assigns pseudo-labels to unpaired samples via Generative Adversarial Networks to learn the joint distribution of image and caption.

Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.