no code implementations • 6 Apr 2024 • Shufan Li, Konstantinos Kallidromitis, Akash Gokul, Yusuke Kato, Kazuki Kozuka
We present Diffusion-KTO, a novel approach for aligning text-to-image diffusion models by formulating the alignment objective as the maximization of expected human utility.
no code implementations • 31 Dec 2023 • Tiange Xiang, Adam Sun, Scott Delp, Kazuki Kozuka, Li Fei-Fei, Ehsan Adeli
In this work, we present Wild2Avatar, a neural rendering approach catered for occluded in-the-wild monocular videos.
1 code implementation • NeurIPS 2023 • Xudong Wang, Shufan Li, Konstantinos Kallidromitis, Yusuke Kato, Kazuki Kozuka, Trevor Darrell
Open-vocabulary image segmentation aims to partition an image into semantic regions according to arbitrary text descriptions.
Ranked #1 on Image Segmentation on Pascal Panoptic Parts
no code implementations • 16 Feb 2023 • Hiroki Adachi, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi, Yasunori Ishii, Kazuki Kozuka
Adversarial training is a popular and straightforward technique to defend against the threat of adversarial examples.
no code implementations • 12 Sep 2022 • Shungo Fujii, Yasunori Ishii, Kazuki Kozuka, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi
Data augmentation is an essential technique for improving recognition accuracy in object recognition using deep learning.
1 code implementation • 25 Aug 2022 • Akash Gokul, Konstantinos Kallidromitis, Shufan Li, Yusuke Kato, Kazuki Kozuka, Trevor Darrell, Colorado J Reed
Recent works in self-supervised learning have demonstrated strong performance on scene-level dense prediction tasks by pretraining with object-centric or region-based correspondence objectives.
no code implementations • 11 May 2022 • Risako Tanigawa, Yasunori Ishii, Kazuki Kozuka, Takayoshi Yamashita
They are highly and widely used in tasks such as segmentation.
no code implementations • 15 Apr 2022 • Risako Tanigawa, Yasunori Ishii, Kazuki Kozuka, Takayoshi Yamashita
In inference, it is possible to obtain instance segmentation results only from sound images.
1 code implementation • 24 Oct 2021 • Konstantinos Kallidromitis, Denis Gudovskiy, Kazuki Kozuka, Iku Ohama, Luca Rigazio
In this paper, we propose a novel self-supervised learning framework that combines contrastive learning with neural processes.
3 code implementations • 27 Jul 2021 • Denis Gudovskiy, Shun Ishizaka, Kazuki Kozuka
Our approach results in a computationally and memory-efficient model: CFLOW-AD is faster and smaller by a factor of 10x than prior state-of-the-art with the same input setting.
Ranked #13 on Anomaly Detection on VisA (Detection AUROC metric)
1 code implementation • CVPR 2021 • Nishant Rai, Haofeng Chen, Jingwei Ji, Rishi Desai, Kazuki Kozuka, Shun Ishizaka, Ehsan Adeli, Juan Carlos Niebles
However, there remains a lack of studies that extend action composition and leverage multiple viewpoints and multiple modalities of data for representation learning.
Ranked #1 on Video Classification on Home Action Genome
1 code implementation • CVPR 2021 • Denis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa
To overcome these limitations, we reformulate AutoAugment as a generalized automated dataset optimization (AutoDO) task that minimizes the distribution shift between test data and distorted train dataset.