Search Results for author: Sanghyuk Chun

Found 43 papers, 29 papers with code

Toward Interactive Regional Understanding in Vision-Large Language Models

no code implementations • 27 Mar 2024 • Jungbeom Lee, Sanghyuk Chun, Sangdoo Yun

Recent Vision-Language Pre-training (VLP) models have demonstrated significant advancements.

Paper
Add Code

Language-only Efficient Training of Zero-shot Composed Image Retrieval

1 code implementation • 4 Dec 2023 • Geonmo Gu, Sanghyuk Chun, Wonjae Kim, Yoohoon Kang, Sangdoo Yun

Our LinCIR (Language-only training for CIR) can be trained only with text datasets by a novel self-supervision named self-masking projection (SMP).

Ranked #1 on Zero-Shot Composed Image Retrieval (ZS-CIR) on Fashion IQ

Image Retrieval Retrieval +1

Paper
Code

Longer-range Contextualized Masked Autoencoder

no code implementations • 20 Oct 2023 • Taekyung Kim, Sanghyuk Chun, Byeongho Heo, Dongyoon Han

However, as the encoder is trained with partial pixels, the MIM pre-training can suffer from a low capability of understanding long-range dependency.

Attribute Fine-Grained Image Classification +2

Paper
Add Code

Improved Probabilistic Image-Text Representations

1 code implementation • 29 May 2023 • Sanghyuk Chun

Image-Text Matching (ITM) task, a fundamental vision-language (VL) task, suffers from the inherent ambiguity arising from multiplicity and imperfect annotations.

Data Augmentation Image-text matching +2

Paper
Code

RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models

1 code implementation • 21 Apr 2023 • Seulki Park, Daeho Um, Hajung Yoon, Sanghyuk Chun, Sangdoo Yun, Jin Young Choi

In this paper, we propose a robustness benchmark for image-text matching models to assess their vulnerabilities.

Image-text matching Retrieval +1

Paper
Code

Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild

1 code implementation • 10 Apr 2023 • Gyeongsik Moon, Hongsuk Choi, Sanghyuk Chun, Jiyoung Lee, Sangdoo Yun

Recovering 3D human mesh in the wild is greatly challenging as in-the-wild (ITW) datasets provide only 2D pose ground truths (GTs).

Ranked #6 on 3D Multi-Person Pose Estimation on MuPoTS-3D

3D Multi-Person Pose Estimation

148

Paper
Code

CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion

1 code implementation • 21 Mar 2023 • Geonmo Gu, Sanghyuk Chun, Wonjae Kim, HeeJae Jun, Yoohoon Kang, Sangdoo Yun

This paper proposes a novel diffusion-based model, CompoDiff, for solving zero-shot Composed Image Retrieval (ZS-CIR) with latent diffusion.

Ranked #3 on Zero-Shot Composed Image Retrieval (ZS-CIR) on CIRCO

Retrieval Zero-Shot Composed Image Retrieval (ZS-CIR)

Paper
Code

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage

1 code implementation • ICCV 2023 • Song Park, Sanghyuk Chun, Byeongho Heo, Wonjae Kim, Sangdoo Yun

We need billion-scale images to achieve more generalizable and ground-breaking vision models, as well as massive dataset storage to ship the images (e. g., the LAION-4B dataset needs 240TB storage space).

Continual Learning

Paper
Code

Re-weighting Based Group Fairness Regularization via Classwise Robust Optimization

no code implementations • 1 Mar 2023 • Sangwon Jung, TaeEon Park, Sanghyuk Chun, Taesup Moon

Many existing group fairness-aware training methods aim to achieve the group fairness by either re-weighting underrepresented groups based on certain rules or using weakly approximated surrogates for the fairness metrics in the objective as regularization terms.

Fairness

Paper
Add Code

Group Generalized Mean Pooling for Vision Transformer

no code implementations • 8 Dec 2022 • Byungsoo Ko, Han-Gyu Kim, Byeongho Heo, Sangdoo Yun, Sanghyuk Chun, Geonmo Gu, Wonjae Kim

As ViT groups the channels via a multi-head attention mechanism, grouping the channels by GGeM leads to lower head-wise dependence while amplifying important channels on the activation maps.

Image Retrieval Representation Learning +1

Paper
Add Code

Similarity of Neural Architectures using Adversarial Attack Transferability

no code implementations • 20 Oct 2022 • Jaehui Hwang, Dongyoon Han, Byeongho Heo, Song Park, Sanghyuk Chun, Jong-Seok Lee

In recent years, many deep neural architectures have been developed for image classification.

Adversarial Attack Feature Importance +2

Paper
Add Code

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective

1 code implementation • 21 Aug 2022 • Chanwoo Park, Sangdoo Yun, Sanghyuk Chun

Our theoretical results show that regardless of the choice of the mixing strategy, MSDA behaves as a pixel-level regularization of the underlying training loss and a regularization of the first layer parameters.

Adversarial Robustness Data Augmentation

Paper
Code

An Extendable, Efficient and Effective Transformer-based Object Detector

1 code implementation • 17 Apr 2022 • Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang

Transformers have been widely used in numerous vision problems especially for visual recognition and detection.

Image Classification Instance Segmentation +4

299

Paper
Code

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO

2 code implementations • 7 Apr 2022 • Sanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh

Image-Text matching (ITM) is a common task for evaluating the quality of Vision and Language (VL) models.

Image-text matching Text Matching

119

Paper
Code

Domain Generalization by Mutual-Information Regularization with Pre-trained Models

1 code implementation • 21 Mar 2022 • Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun

Domain generalization (DG) aims to learn a generalized model to an unseen target domain using only limited source domains.

Ranked #2 on Domain Generalization on TerraIncognita

Domain Generalization

Paper
Code

Dataset Condensation with Contrastive Signals

2 code implementations • 7 Feb 2022 • Saehyung Lee, Sanghyuk Chun, Sangwon Jung, Sangdoo Yun, Sungroh Yoon

However, in this study, we prove that the existing DC methods can perform worse than the random selection method when task-irrelevant information forms a significant part of the training dataset.

Attribute Continual Learning +2

1,153

Paper
Code

Few-shot Font Generation with Weakly Supervised Localized Representations

2 code implementations • 22 Dec 2021 • Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim

Existing methods learn to disentangle style and content elements by developing a universal style representation for each font style.

Font Generation

187

Paper
Code

Learning Fair Classifiers with Partially Annotated Group Labels

1 code implementation • CVPR 2022 • Sangwon Jung, Sanghyuk Chun, Taesup Moon

To address this problem, we propose a simple Confidence-based Group Label assignment (CGL) strategy that is readily applicable to any fairness-aware learning method.

Fairness

Paper
Code

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

1 code implementation • ICLR 2022 • Hwanjun Song, Deqing Sun, Sanghyuk Chun, Varun Jampani, Dongyoon Han, Byeongho Heo, Wonjae Kim, Ming-Hsuan Yang

Transformers are transforming the landscape of computer vision, especially for recognition tasks.

Ranked #11 on Object Detection on COCO 2017 val

Image Classification Object +2

299

Paper
Code

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective

no code implementations • ICLR 2022 • Luca Scimeca, Seong Joon Oh, Sanghyuk Chun, Michael Poli, Sangdoo Yun

This phenomenon, also known as shortcut learning, is emerging as a key limitation of the current generation of machine learning models.

Paper
Add Code

Biased Multi-Domain Adversarial Training

no code implementations • 29 Sep 2021 • Saehyung Lee, Hyungyu Lee, Sanghyuk Chun, Sungroh Yoon

Several recent studies have shown that the use of extra in-distribution data can lead to a high level of adversarial robustness.

Adversarial Robustness

Paper
Add Code

StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures

no code implementations • 24 Aug 2021 • Sanghyuk Chun, Song Park

Hence, StyleAugment let the model observe abundant confounding cues for each image by on-the-fly the augmentation strategy, while the augmented images are more realistic than artistic style transferred images.

Data Augmentation Style Transfer

Paper
Add Code

Neural Hybrid Automata: Learning Dynamics with Multiple Modes and Stochastic Transitions

no code implementations • NeurIPS 2021 • Michael Poli, Stefano Massaroli, Luca Scimeca, Seong Joon Oh, Sanghyuk Chun, Atsushi Yamashita, Hajime Asama, Jinkyoo Park, Animesh Garg

Effective control and prediction of dynamical systems often require appropriate handling of continuous-time and discrete, event-triggered processes.

Paper
Add Code

Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts

4 code implementations • ICCV 2021 • Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim

MX-Font extracts multiple style features not explicitly conditioned on component labels, but automatically by multiple experts to represent different local concepts, e. g., left-side sub-glyph.

Disentanglement Font Generation +1

187

Paper
Code

Rethinking Spatial Dimensions of Vision Transformers

10 code implementations • ICCV 2021 • Byeongho Heo, Sangdoo Yun, Dongyoon Han, Sanghyuk Chun, Junsuk Choe, Seong Joon Oh

We empirically show that such a spatial dimension reduction is beneficial to a transformer architecture as well, and propose a novel Pooling-based Vision Transformer (PiT) upon the original ViT model.

Ranked #333 on Image Classification on ImageNet

Dimensionality Reduction Image Classification +2

29,648

Paper
Code

SWAD: Domain Generalization by Seeking Flat Minima

4 code implementations • NeurIPS 2021 • Junbum Cha, Sanghyuk Chun, Kyungjae Lee, Han-Cheol Cho, Seunghyun Park, Yunsung Lee, Sungrae Park

Domain generalization (DG) methods aim to achieve generalizability to an unseen target domain by using only training data from the source domains.

Ranked #17 on Domain Generalization on TerraIncognita

Domain Generalization Generalization Bounds +1

143

Paper
Code

Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels

2 code implementations • CVPR 2021 • Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Junsuk Choe, Sanghyuk Chun

However, they have not fixed the training set, presumably because of a formidable annotation cost.

Ranked #20 on Image Classification on OmniBenchmark

Image Classification Instance Segmentation +4

393

Paper
Code

Probabilistic Embeddings for Cross-Modal Retrieval

4 code implementations • CVPR 2021 • Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus

Instead, we propose to use Probabilistic Cross-Modal Embedding (PCME), where samples from the different modalities are represented as probabilistic distributions in the common embedding space.

Cross-Modal Retrieval Retrieval

119

Paper
Code

Few-shot Font Generation with Localized Style Representations and Factorization

3 code implementations • 23 Sep 2020 • Song Park, Sanghyuk Chun, Junbum Cha, Bado Lee, Hyunjung Shim

However, learning component-wise styles solely from reference glyphs is infeasible in the few-shot font generation scenario, when a target script has a large number of components, e. g., over 200 for Chinese.

Font Generation

147

Paper
Code

Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets

2 code implementations • 8 Jul 2020 • Junsuk Choe, Seong Joon Oh, Sanghyuk Chun, Seungho Lee, Zeynep Akata, Hyunjung Shim

In this paper, we argue that WSOL task is ill-posed with only image-level labels, and propose a new evaluation protocol where full supervision is limited to only a small held-out set not overlapping with the test set.

Few-Shot Learning Model Selection +1

327

Paper
Code

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights

4 code implementations • ICLR 2021 • Byeongho Heo, Sanghyuk Chun, Seong Joon Oh, Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo Ha

Because of the scale invariance, this modification only alters the effective step sizes without changing the effective update directions, thus enjoying the original convergence properties of GD optimizers.

Audio Classification Image Classification +3

29,648

Paper
Code

Few-shot Compositional Font Generation with Dual Memory

3 code implementations • ECCV 2020 • Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee

By utilizing the compositionality of compositional scripts, we propose a novel font generation framework, named Dual Memory-augmented Font Generation Network (DM-Font), which enables us to generate a high-quality font library with only a few samples.

Font Generation

147

Paper
Code

An Empirical Evaluation on Robustness and Uncertainty of Regularization Methods

no code implementations • 9 Mar 2020 • Sanghyuk Chun, Seong Joon Oh, Sangdoo Yun, Dongyoon Han, Junsuk Choe, Youngjoon Yoo

Despite apparent human-level performances of deep neural networks (DNN), they behave fundamentally differently from humans.

Bayesian Inference

Paper
Add Code

Evaluating Weakly Supervised Object Localization Methods Right

2 code implementations • CVPR 2020 • Junsuk Choe, Seong Joon Oh, Seungho Lee, Sanghyuk Chun, Zeynep Akata, Hyunjung Shim

Few-Shot Learning Model Selection +2

327

Paper
Code

Visualizing and Understanding Self-attention based Music Tagging

no code implementations • 11 Nov 2019 • Minz Won, Sanghyuk Chun, Xavier Serra

Recently, we proposed a self-attention based music tagging model.

Sound Audio and Speech Processing

Paper
Add Code

Neural Approximation of an Auto-Regressive Process through Confidence Guided Sampling

no code implementations • 15 Oct 2019 • YoungJoon Yoo, Sanghyuk Chun, Sangdoo Yun, Jung-Woo Ha, Jaejun Yoo

We first assume that the priors of future samples can be generated in an independently and identically distributed (i. i. d.)

Paper
Add Code

Learning De-biased Representations with Biased Representations

3 code implementations • ICML 2020 • Hyojin Bahng, Sanghyuk Chun, Sangdoo Yun, Jaegul Choo, Seong Joon Oh

This tactic is feasible in many scenarios where it is much easier to define a set of biased representations than to define and quantify bias.

167

Paper
Code

Toward Interpretable Music Tagging with Self-Attention

2 code implementations • 12 Jun 2019 • Minz Won, Sanghyuk Chun, Xavier Serra

In addition, we demonstrate the interpretability of the proposed architecture with a heat map visualization.

Sound Audio and Speech Processing

Paper
Code

CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features

30 code implementations • ICCV 2019 • Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, Youngjoon Yoo

Regional dropout strategies have been proposed to enhance the performance of convolutional neural network classifiers.

Ranked #1 on Out-of-Distribution Generalization on ImageNet-W

Domain Generalization Image Captioning +5

29,648

Paper
Code

ACE: Artificial Checkerboard Enhancer to Induce and Evade Adversarial Attacks

no code implementations • ICLR 2019 • Jisung Hwang, Younghoon Kim, Sanghyuk Chun, Jaejun Yoo, Ji-Hoon Kim, Dongyoon Han, Jung-Woo Ha

The checkerboard phenomenon is one of the well-known visual artifacts in the computer vision field.

Paper
Add Code

Photorealistic Style Transfer via Wavelet Transforms

4 code implementations • ICCV 2019 • Jaejun Yoo, Youngjung Uh, Sanghyuk Chun, Byeongkyu Kang, Jung-Woo Ha

The key ingredient of our method is wavelet transforms that naturally fits in deep networks.

Style Transfer

855

Paper
Code

Multi-Domain Processing via Hybrid Denoising Networks for Speech Enhancement

1 code implementation • 21 Dec 2018 • Jang-Hyun Kim, Jaejun Yoo, Sanghyuk Chun, Adrian Kim, Jung-Woo Ha

We present a hybrid framework that leverages the trade-off between temporal and frequency precision in audio representations to improve the performance of speech enhancement task.

Audio and Speech Processing Sound

Paper
Code

Scalable Iterative Algorithm for Robust Subspace Clustering

no code implementations • 5 Mar 2015 • Sanghyuk Chun, Yung-Kyun Noh, Jinwoo Shin

Subspace clustering (SC) is a popular method for dimensionality reduction of high-dimensional data, where it generalizes Principal Component Analysis (PCA).

Clustering Dimensionality Reduction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.