Search Results for author: Dong-Jin Kim

Found 17 papers, 8 papers with code

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

no code implementations • 5 Sep 2023 • TaeHoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh, Jonghwan Mun, Solgil Oh, Kenan Emir Ak, Gwang-Gook Lee, Yan Xu, Mingwei Shen, Kyomin Hwang, Wonsik Shin, Kamin Lee, Wonhark Park, Dongkwan Lee, Nojun Kwak, Yujin Wang, Yimu Wang, Tiancheng Gu, Xingchang Lv, Mingmao Sun

In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge.

Fairness Image Captioning

Paper
Add Code

Self-Sufficient Framework for Continuous Sign Language Recognition

no code implementations • 21 Mar 2023 • Youngjoon Jang, Youngtaek Oh, Jae Won Cho, Myungchul Kim, Dong-Jin Kim, In So Kweon, Joon Son Chung

The goal of this work is to develop self-sufficient framework for Continuous Sign Language Recognition (CSLR) that addresses key issues of sign language recognition.

Pseudo Label Sign Language Recognition

Paper
Add Code

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data

no code implementations • 26 Jan 2023 • Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, In So Kweon

We present a novel data-efficient semi-supervised framework to improve the generalization of image captioning models.

Relational Captioning Sentence

Paper
Add Code

Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition

1 code implementation • 1 Nov 2022 • Youngjoon Jang, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, Joon Son Chung, In So Kweon

Most existing Continuous Sign Language Recognition (CSLR) benchmarks have fixed backgrounds and are filmed in studios with a static monochromatic background.

Benchmarking Disentanglement +1

Paper
Code

Generative Bias for Robust Visual Question Answering

1 code implementation • CVPR 2023 • Jae Won Cho, Dong-Jin Kim, Hyeonggon Ryu, In So Kweon

In this work, in order to better learn the bias a target VQA model suffers from, we propose a generative method to train the bias model directly from the target model, called GenB.

Knowledge Distillation Question Answering +1

Paper
Code

Single-Modal Entropy based Active Learning for Visual Question Answering

no code implementations • 21 Oct 2021 • Dong-Jin Kim, Jae Won Cho, Jinsoo Choi, Yunjae Jung, In So Kweon

In this work, we address Active Learning in the multi-modal setting of Visual Question Answering (VQA).

Active Learning Question Answering +1

Paper
Add Code

ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection

1 code implementation • 9 Sep 2021 • Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon

A common problem in the task of human-object interaction (HOI) detection is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.

Ranked #41 on Human-Object Interaction Detection on HICO-DET

Human-Object Interaction Detection

Paper
Code

LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation

no code implementations • ICCV 2021 • Inkyu Shin, Dong-Jin Kim, Jae Won Cho, Sanghyun Woo, KwanYong Park, In So Kweon

In order to find the uncertain points, we generate an inconsistency mask using the proposed adaptive pixel selector and we label these segment-based regions to achieve near supervised performance with only a small fraction (about 2. 2%) ground truth points, which we call "Segment based Pixel-Labeling (SPL)".

Semantic Segmentation Unsupervised Domain Adaptation

Paper
Add Code

MCDAL: Maximum Classifier Discrepancy for Active Learning

1 code implementation • 23 Jul 2021 • Jae Won Cho, Dong-Jin Kim, Yunjae Jung, In So Kweon

Recent state-of-the-art active learning methods have mostly leveraged Generative Adversarial Networks (GAN) for sample acquisition; however, GAN is usually known to suffer from instability and sensitivity to hyper-parameters.

Active Learning Classification +3

Paper
Code

DASO: Distribution-Aware Semantics-Oriented Pseudo-label for Imbalanced Semi-Supervised Learning

1 code implementation • CVPR 2022 • Youngtaek Oh, Dong-Jin Kim, In So Kweon

The capability of the traditional semi-supervised learning (SSL) methods is far from real-world application due to severely biased pseudo-labels caused by (1) class imbalance and (2) class distribution mismatch between labeled and unlabeled data.

imbalanced classification Pseudo Label +1

Paper
Code

Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation

no code implementations • 13 Apr 2021 • Jae Won Cho, Dong-Jin Kim, Jinsoo Choi, Yunjae Jung, In So Kweon

In this work, we address the issues of missing modalities that have arisen from the Visual Question Answer-Difference prediction task and find a novel method to solve the task at hand.

Knowledge Distillation Visual Question Answering (VQA)

Paper
Add Code

OGLE-2017-BLG-1049: Another giant planet microlensing event

no code implementations • 21 Dec 2020 • Yun Hak Kim, Sun-Ju Chung, A. Udalski, Ian A. Bond, Youn Kil Jung, Andrew Gould, Michael D. Albrow, Cheongho Han, Kyu-Ha Hwang, Yoon-Hyun Ryu, In-Gu Shin, Yossi Shvartzvald, Jennifer C. Yee, Weicheng Zang, Sang-Mok Cha, Dong-Jin Kim, Hyoun-Woo Kim, Seung-Lee Kim, Chung-Uk Lee, Dong-Joo Lee, Yongseok Lee, Byeong-Gon Park, Richard W. Pogge, Radek Poleski, Przemek Mroz, Jan Skowron, Michal K. Szymanski, Igor Soszynski, Pawel Pietrukowicz, Syzmon Kozlowski, Krzysztof Ulaczyk, Krzysztof A. Rybicki, Patryk Iwanek, Fumio Abe, Richard Barry, David P. Bennett, Aparna Bhattacharya, Martin Donachie, Hirosane Fujii, Akihiko Fukui, Yoshitaka Itow, Yuki Hirao, Rintaro Kirikawa, Iona Kondo, Naoki Koshimoto, Yutaka Matsubara, Yasushi Muraki, Shota Miyazaki, Clement Ranc, Nicholas J. Rattenbury, Yuki Satoh, Hikaru Shoji, Takahiro Sumi, Daisuke Suzuki, Paul J. Tristram, Yuzuru Tanaka, Tsubasa Yamawaki, Atsunori Yonehara

From this, we find that the lens system has a star with mass $M_{\rm h}=0. 55^{+0. 36}_{-0. 29} \ M_{\odot}$ hosting a giant planet with $M_{\rm p}=5. 53^{+3. 62}_{-2. 87} \ M_{\rm Jup}$, at a distance of $D_{\rm L}=5. 67^{+1. 11}_{-1. 52}\ {\rm kpc}$.

Earth and Planetary Astrophysics Astrophysics of Galaxies

Paper
Add Code

Dense Relational Image Captioning via Multi-task Triple-Stream Networks

1 code implementation • 8 Oct 2020 • Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, In So Kweon

To this end, we propose the multi-task triple-stream network (MTTSNet) which consists of three recurrent units responsible for each POS which is trained by jointly predicting the correct captions and POS for each word.

Ranked #1 on Relational Captioning on relational captioning dataset

Graph Generation Object +4

Paper
Code

Detecting Human-Object Interactions with Action Co-occurrence Priors

1 code implementation • 17 Jul 2020 • Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon

A common problem in human-object interaction (HOI) detection task is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.

Human-Object Interaction Detection

Paper
Code

Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach

no code implementations • IJCNLP 2019 • Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, In So Kweon

To this end, our proposed semi-supervised learning method assigns pseudo-labels to unpaired samples via Generative Adversarial Networks to learn the joint distribution of image and caption.

Image Captioning