Search Results for author: Jinhyung Kim

Found 13 papers, 4 papers with code

EXAONEPath 1.0 Patch-level Foundation Model for Pathology

1 code implementation1 Aug 2024 Juseung Yun, Yi Hu, Jinhyung Kim, Jongseong Jang, Soonyoung Lee

To address this issue, we introduce EXAONEPath, a novel foundational model trained on patches that have undergone stain normalization.

Self-Supervised Learning whole slide images

Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition

1 code implementation13 Jun 2024 Youngtaek Oh, Pyunghwan Ahn, Jinhyung Kim, Gwangmo Song, Soonyoung Lee, In So Kweon, Junmo Kim

Vision and language models (VLMs) such as CLIP have showcased remarkable zero-shot recognition abilities yet face challenges in visio-linguistic compositionality, particularly in linguistic comprehension and fine-grained image-text alignment.

Retrieval Zero-Shot Learning

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining

no code implementations19 Dec 2023 Bumsoo Kim, Yeonsik Jo, Jinhyung Kim, Seung Hwan Kim

Contrastive Language-Image Pretraining has emerged as a prominent approach for training vision and text encoders with uncurated image-text pairs from the web.

Image Augmentation Metric Learning +1

Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders

no code implementations19 Dec 2023 Bumsoo Kim, Jinhyung Kim, Yeonsik Jo, Seung Hwan Kim

Based on the unified text embedding space, ECLIPSE compensates for the additional computational cost of the momentum image encoder by expediting the online image encoder.

Knowledge Distillation

Masked Autoencoder for Unsupervised Video Summarization

no code implementations2 Jun 2023 Minho Shim, Taeoh Kim, Jinhyung Kim, Dongyoon Wee

Summarizing a video requires a diverse understanding of the video, ranging from recognizing scenes to evaluating how much each frame is essential enough to be selected as a summary.

Decoder Self-Supervised Learning +1

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pre-training

no code implementations ICCV 2023 Bumsoo Kim, Yeonsik Jo, Jinhyung Kim, Seunghwan Kim

Contrastive Language-Image Pretraining has emerged as a prominent approach for training vision and text encoders with uncurated image-text pairs from the web.

Image Augmentation Metric Learning +1

Exploring Temporally Dynamic Data Augmentation for Video Recognition

no code implementations30 Jun 2022 Taeoh Kim, Jinhyung Kim, Minho Shim, Sangdoo Yun, Myunggu Kang, Dongyoon Wee, Sangyoun Lee

The magnitude of augmentation operations on each frame is changed by an effective mechanism, Fourier Sampling that parameterizes diverse, smooth, and realistic temporal variations.

Action Segmentation Image Augmentation +3

Frequency Selective Augmentation for Video Representation Learning

no code implementations8 Apr 2022 Jinhyung Kim, Taeoh Kim, Minho Shim, Dongyoon Han, Dongyoon Wee, Junmo Kim

FreqAug stochastically removes specific frequency components from the video so that learned representation captures essential features more from the remaining information for various downstream tasks.

Action Recognition Data Augmentation +3

VideoMix: Rethinking Data Augmentation for Video Classification

3 code implementations7 Dec 2020 Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Jinhyung Kim

Recent data augmentation strategies have been reported to address the overfitting problems in static image classifiers.

Action Localization Action Recognition +5

Regularization on Spatio-Temporally Smoothed Feature for Action Recognition

no code implementations CVPR 2020 Jinhyung Kim, Seunghwan Cha, Dongyoon Wee, Soonmin Bae, Junmo Kim

We present that selective regularization on this locally smoothed feature makes a model handle the low-frequency and high-frequency component distinctively, resulting in performance improvement.

Action Recognition Temporal Action Localization

Predictive Coding-based Deep Dynamic Neural Network for Visuomotor Learning

no code implementations8 Jun 2017 Jungsik Hwang, Jinhyung Kim, Ahmadreza Ahmadi, Minkyu Choi, Jun Tani

This study presents a dynamic neural network model based on the predictive coding framework for perceiving and predicting the dynamic visuo-proprioceptive patterns.

Action Generation

Achieving Synergy in Cognitive Behavior of Humanoids via Deep Learning of Dynamic Visuo-Motor-Attentional Coordination

no code implementations9 Jul 2015 Jungsik Hwang, Minju Jung, Naveen Madapana, Jinhyung Kim, Minkyu Choi, Jun Tani

The current study examines how adequate coordination among different cognitive processes including visual recognition, attention switching, action preparation and generation can be developed via learning of robots by introducing a novel model, the Visuo-Motor Deep Dynamic Neural Network (VMDNN).

Cannot find the paper you are looking for? You can Submit a new open access paper.