no code implementations • 12 Apr 2024 • Masahiro Yasuda, Noboru Harada, Yasunori Ohishi, Shoichiro Saito, Akira Nakayama, Nobutaka Ono
This is because the information obtained from a single sensor is often missing or fragmented in such an environment; observations from multiple locations and modalities should be integrated to analyze events comprehensively.
no code implementations • 4 Mar 2024 • Masahiro Yasuda, Shoichiro Saito, Akira Nakayama, Noboru Harada
A system trained only with a dataset using microphone arrays in a fixed position would be unable to adapt to the fast relative motion of sound events associated with self-motion, resulting in the degradation of SELD performance.
1 code implementation • 18 Feb 2022 • Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito, Noboru Harada
We tackle a challenging task: multi-view and multi-modal event detection that detects events in a wide-range real environment by utilizing data from distributed cameras and microphones and their weak labels.
1 code implementation • 18 Feb 2022 • Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito
Our goal is to develop a sound event localization and detection (SELD) system that works robustly in unknown environments.
1 code implementation • 17 Feb 2022 • Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe, Shoichiro Saito, Yasuhiro Oikawa
Sound event localization and detection (SELD) is a combined task of identifying the sound event and its direction.
7 code implementations • 4 Jun 2021 • Noboru Harada, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Masahiro Yasuda, Shoichiro Saito
This paper proposes a new large-scale dataset called "ToyADMOS2" for anomaly detection in machine operating sounds (ADMOS).
no code implementations • 1 Jul 2020 • Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito
TRACKE estimates keywords, which comprise a word set corresponding to audio events/scenes in the input audio, and generates the caption while referring to the estimated keywords to reduce word-selection indeterminacy.
no code implementations • 10 Oct 2019 • Masahiro Yasuda, Yuma Koizumi, Luca Mazzon, Shoichiro Saito, Hisashi Uematsu
We propose a direction of arrival (DOA) estimation method that combines sound-intensity vector (IV)-based DOA estimation and DNN-based denoising and dereverberation.
2 code implementations • 9 Aug 2019 • Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, Noboru Harada, Keisuke Imoto
To build a large-scale dataset for ADMOS, we collected anomalous operating sounds of miniature machines (toys) by deliberately damaging them.
no code implementations • 19 Jul 2019 • Yuma Koizumi, Shoichiro Saito, Masataka Yamaguchi, Shin Murata, Noboru Harada
The AE is trained to minimize the sample mean of the anomaly score of normal sounds in a mini-batch.
1 code implementation • 22 Oct 2018 • Yuma Koizumi, Shoichiro Saito, Hisashi Uematsum Yuta Kawachi, Noboru Harada
To calculate the TPR in the objective function, we consider that the set of anomalous sounds is the complementary set of normal sounds and simulate anomalous sounds by using a rejection sampling algorithm.