Search Results for author: Shoichiro Saito

Found 11 papers, 6 papers with code

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis

no code implementations12 Apr 2024 Masahiro Yasuda, Noboru Harada, Yasunori Ohishi, Shoichiro Saito, Akira Nakayama, Nobutaka Ono

This is because the information obtained from a single sensor is often missing or fragmented in such an environment; observations from multiple locations and modalities should be integrated to analyze events comprehensively.

6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human

no code implementations4 Mar 2024 Masahiro Yasuda, Shoichiro Saito, Akira Nakayama, Noboru Harada

A system trained only with a dataset using microphone arrays in a fixed position would be unable to adapt to the fast relative motion of sound events associated with self-motion, resulting in the degradation of SELD performance.

Sound Event Localization and Detection

Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion

1 code implementation18 Feb 2022 Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito, Noboru Harada

We tackle a challenging task: multi-view and multi-modal event detection that detects events in a wide-range real environment by utilizing data from distributed cameras and microphones and their weak labels.

Event Detection Sensor Fusion

A Transformer-based Audio Captioning Model with Keyword Estimation

no code implementations1 Jul 2020 Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito

TRACKE estimates keywords, which comprise a word set corresponding to audio events/scenes in the input audio, and generates the caption while referring to the estimated keywords to reduce word-selection indeterminacy.

Acoustic Scene Classification Audio captioning +2

DOA Estimation by DNN-based Denoising and Dereverberation from Sound Intensity Vector

no code implementations10 Oct 2019 Masahiro Yasuda, Yuma Koizumi, Luca Mazzon, Shoichiro Saito, Hisashi Uematsu

We propose a direction of arrival (DOA) estimation method that combines sound-intensity vector (IV)-based DOA estimation and DNN-based denoising and dereverberation.

Denoising

ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection

2 code implementations9 Aug 2019 Yuma Koizumi, Shoichiro Saito, Hisashi Uematsu, Noboru Harada, Keisuke Imoto

To build a large-scale dataset for ADMOS, we collected anomalous operating sounds of miniature machines (toys) by deliberately damaging them.

Anomaly Detection

Unsupervised Detection of Anomalous Sound based on Deep Learning and the Neyman-Pearson Lemma

1 code implementation22 Oct 2018 Yuma Koizumi, Shoichiro Saito, Hisashi Uematsum Yuta Kawachi, Noboru Harada

To calculate the TPR in the objective function, we consider that the set of anomalous sounds is the complementary set of normal sounds and simulate anomalous sounds by using a rejection sampling algorithm.

LEMMA Unsupervised Anomaly Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.