no code implementations • 8 Nov 2024 • Yusuke Oumi, Yuto Shibata, Go Irie, Akisato Kimura, Yoshimitsu Aoki, Mariko Isogawa
This paper explores the problem of 3D human pose estimation from only low-level acoustic signals.
no code implementations • 5 Sep 2024 • Junpei Honma, Akisato Kimura, Go Irie
Measuring 3D geometric structures of indoor scenes requires dedicated depth sensors, which are not always available.
1 code implementation • 29 Mar 2024 • Atsuyuki Miyai, Jingkang Yang, Jingyang Zhang, Yifei Ming, Qing Yu, Go Irie, Yixuan Li, Hai Li, Ziwei Liu, Kiyoharu Aizawa
This paper introduces a novel and significant challenge for Vision Language Models (VLMs), termed Unsolvable Problem Detection (UPD).
1 code implementation • 2 Oct 2023 • Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa
We consider that such data may significantly affect the performance of large pre-trained networks because the discriminability of these OOD data depends on the pre-training algorithm.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
no code implementations • 30 Jul 2023 • Qing Yu, Go Irie, Kiyoharu Aizawa
Unsupervised domain adaptation (UDA) has proven to be very effective in transferring knowledge obtained from a source domain with labeled data to a target domain with unlabeled data.
1 code implementation • NeurIPS 2023 • Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa
CLIP's local features have a lot of ID-irrelevant nuisances (e. g., backgrounds), and by learning to push them away from the ID class text embeddings, we can remove the nuisances in the ID class text embeddings and enhance the separation between ID and OOD.
Out-of-Distribution Detection Out of Distribution (OOD) Detection
2 code implementations • 10 Apr 2023 • Atsuyuki Miyai, Qing Yu, Go Irie, Kiyoharu Aizawa
First, images should be collected using only the name of the ID class without training on the ID data.
no code implementations • CVPR 2023 • Yuto Shibata, Yutaka Kawashima, Mariko Isogawa, Go Irie, Akisato Kimura, Yoshimitsu Aoki
Aiming to capture subtle sound changes to reveal detailed pose information, we explicitly extract phase features from the acoustic signals together with typical spectrum features and feed them into our human pose estimation network.
1 code implementation • 23 Oct 2022 • Atsuyuki Miyai, Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa
The semantics of an image can be rotation-invariant or rotation-variant, so whether the rotated image is treated as positive or negative should be determined based on the content of the image.
no code implementations • CVPR 2021 • Yu Mitsuzumi, Go Irie, Daiki Ikami, Takashi Shibata
The key to our approach is self-supervised class-destructive learning, which enables the learning of class-invariant representations and domain-adversarial classifiers without using any domain labels.
no code implementations • ECCV 2020 • Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa
Semi-supervised learning (SSL) has been proposed to leverage unlabeled data for training powerful models when only limited labeled data is available.
no code implementations • 18 Apr 2019 • Onkar Krishna, Kiyoharu Aizawa, Go Irie
Observer's of different age-group have shown different scene viewing tendencies independent to the class of the image viewed.
1 code implementation • 30 Mar 2018 • Akito Takeki, Daiki Ikami, Go Irie, Kiyoharu Aizawa
Convolutional neural network (CNN) architectures utilize downsampling layers, which restrict the subsequent layers to learn spatially invariant features while reducing computational costs.
no code implementations • ICCV 2015 • Go Irie, Hiroyuki Arai, Yukinobu Taniguchi
This paper addresses the problem of unsupervised learning of binary hash codes for efficient cross-modal retrieval.
no code implementations • CVPR 2014 • Go Irie, Zhenguo Li, Xiao-Ming Wu, Shih-Fu Chang
Previous efforts in hashing intend to preserve data variance or pairwise affinity, but neither is adequate in capturing the manifold structures hidden in most visual data.
no code implementations • CVPR 2013 • Go Irie, Dong Liu, Zhenguo Li, Shih-Fu Chang
nary learning methods rely on image descriptors alone or together with class labels.