Search Results for author: Seung Hwan Kim

Found 13 papers, 6 papers with code

L-Verse: Bidirectional Generation Between Image and Text

1 code implementation • CVPR 2022 • TaeHoon Kim, Gwangmo Song, Sihaeng Lee, Sangyun Kim, Yewon Seo, Soonyoung Lee, Seung Hwan Kim, Honglak Lee, Kyunghoon Bae

Unlike other models, BiART can distinguish between image (or text) as a conditional reference and a generation target.

Ranked #1 on Image Reconstruction on ImageNet 256x256

Image Captioning Image Reconstruction +5

108

Paper
Code

Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution

1 code implementation • 15 Mar 2022 • Jinsu Yoo, TaeHoon Kim, Sihaeng Lee, Seung Hwan Kim, Honglak Lee, Tae Hyun Kim

Recent transformer-based super-resolution (SR) methods have achieved promising results against conventional CNN-based methods.

Image Restoration Super-Resolution

Paper
Code

DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning

no code implementations • 17 Aug 2022 • Hyounguk Shon, Janghyeon Lee, Seung Hwan Kim, Junmo Kim

We show that this allows us to design a linear model where quadratic parameter regularization method is placed as the optimal continual learning policy, and at the same time enjoying the high performance of neural networks.

Class Incremental Learning Image Classification +1

Paper
Add Code

Factors that affect the technological transition of firms toward the industry 4.0 technologies

no code implementations • 6 Sep 2022 • Seung Hwan Kim, Jeong hwan Jeon, Anwar Aridi, Bogang Jun

Using the technology space of firms, we can identify firms that successfully develop a new industry 4. 0 technology and examine whether their accumulated capabilities in their previous technology domains positively affect their technological diversification and which factors play a critical role in their transition towards industry 4. 0.

Paper
Add Code

UniCLIP: Unified Framework for Contrastive Language-Image Pre-training

no code implementations • 27 Sep 2022 • Janghyeon Lee, Jongsuk Kim, Hyounguk Shon, Bumsoo Kim, Seung Hwan Kim, Honglak Lee, Junmo Kim

Pre-training vision-language models with contrastive objectives has shown promising results that are both scalable to large uncurated datasets and transferable to many downstream applications.

Paper
Add Code

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

1 code implementation • 13 Nov 2022 • TaeHoon Kim, Mark Marsden, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Alessandra Sala, Seung Hwan Kim

However, we find that large-scale bidirectional training between image and text enables zero-shot image captioning.

Image Captioning Keyword Extraction

Paper
Code

Significantly Improving Zero-Shot X-ray Pathology Classification via Fine-tuning Pre-trained Image-Text Encoders

no code implementations • 14 Dec 2022 • Jongseong Jang, Daeun Kyung, Seung Hwan Kim, Honglak Lee, Kyunghoon Bae, Edward Choi

However, large-scale and high-quality data to train powerful neural networks are rare in the medical domain as the labeling must be done by qualified experts.

Classification Contrastive Learning +2

Paper
Add Code

ReConPatch : Contrastive Patch Representation Learning for Industrial Anomaly Detection

1 code implementation • 26 May 2023 • Jeeho Hyun, Sangyun Kim, Giyoung Jeon, Seung Hwan Kim, Kyunghoon Bae, Byung Jun Kang

In this paper, we introduce ReConPatch, which constructs discriminative features for anomaly detection by training a linear modulation of patch features extracted from the pre-trained model.

Ranked #1 on Anomaly Detection on MVTec AD

Anomaly Detection Contrastive Learning +2

Paper
Code

Story Visualization by Online Text Augmentation with Context Memory

1 code implementation • ICCV 2023 • Daechul Ahn, Daneul Kim, Gwangmo Song, Seung Hwan Kim, Honglak Lee, Dongyeop Kang, Jonghyun Choi

Story visualization (SV) is a challenging text-to-image generation task for the difficulty of not only rendering visual details from the text descriptions but also encoding a long-term context across multiple sentences.

Sentence Story Visualization +2

Paper
Code

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

no code implementations • 5 Sep 2023 • TaeHoon Kim, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Mark Marsden, Alessandra Sala, Seung Hwan Kim, Bohyung Han, Kyoung Mu Lee, Honglak Lee, Kyounghoon Bae, Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu, Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim, Wooyoung Kang, Won Young Jhoo, Byungseok Roh, Jonghwan Mun, Solgil Oh, Kenan Emir Ak, Gwang-Gook Lee, Yan Xu, Mingwei Shen, Kyomin Hwang, Wonsik Shin, Kamin Lee, Wonhark Park, Dongkwan Lee, Nojun Kwak, Yujin Wang, Yimu Wang, Tiancheng Gu, Xingchang Lv, Mingmao Sun

In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project and share the results and outcomes of 2023 challenge.

Fairness Image Captioning

Paper
Add Code

Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders

no code implementations • 19 Dec 2023 • Bumsoo Kim, Jinhyung Kim, Yeonsik Jo, Seung Hwan Kim

Based on the unified text embedding space, ECLIPSE compensates for the additional computational cost of the momentum image encoder by expediting the online image encoder.

Knowledge Distillation

Paper
Add Code

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining

no code implementations • 19 Dec 2023 • Bumsoo Kim, Yeonsik Jo, Jinhyung Kim, Seung Hwan Kim

Contrastive Language-Image Pretraining has emerged as a prominent approach for training vision and text encoders with uncurated image-text pairs from the web.

Image Augmentation Metric Learning +1

Paper
Add Code

Universal Noise Annotation: Unveiling the Impact of Noisy annotation on Object Detection

1 code implementation • 21 Dec 2023 • Kwangrok Ryoo, Yeonsik Jo, Seungjun Lee, Mira Kim, Ahra Jo, Seung Hwan Kim, Seungryong Kim, Soonyoung Lee

For object detection task with noisy labels, it is important to consider not only categorization noise, as in image classification, but also localization noise, missing annotations, and bogus bounding boxes.

Image Classification Object +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.