Search Results for author: Bong-Jin Lee

Found 17 papers, 3 papers with code

Rethinking Session Variability: Leveraging Session Embeddings for Session Robustness in Speaker Verification

no code implementations • 26 Sep 2023 • Hee-Soo Heo, Kihyun Nam, Bong-Jin Lee, Youngki Kwon, Minjae Lee, You Jin Kim, Joon Son Chung

In the field of speaker verification, session or channel variability poses a significant challenge.

Speaker Verification

Paper
Add Code

Encoder-decoder multimodal speaker change detection

no code implementations • 1 Jun 2023 • Jee-weon Jung, Soonshin Seo, Hee-Soo Heo, Geonmin Kim, You Jin Kim, Young-ki Kwon, Minjae Lee, Bong-Jin Lee

The task of speaker change detection (SCD), which detects points where speakers change in an input, is essential for several applications.

Automatic Speech Recognition Change Detection +2

Paper
Add Code

Absolute decision corrupts absolutely: conservative online speaker diarisation

no code implementations • 9 Nov 2022 • Youngki Kwon, Hee-Soo Heo, Bong-Jin Lee, You Jin Kim, Jee-weon Jung

Our focus lies in developing an online speaker diarisation framework which demonstrates robust performance across diverse domains.

Paper
Add Code

High-resolution embedding extractor for speaker diarisation

no code implementations • 8 Nov 2022 • Hee-Soo Heo, Youngki Kwon, Bong-Jin Lee, You Jin Kim, Jee-weon Jung

Extracted dense frame-level embeddings can each represent a speaker.

Vocal Bursts Intensity Prediction

Paper
Add Code

In search of strong embedding extractors for speaker diarisation

no code implementations • 26 Oct 2022 • Jee-weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesung Huh, Andrew Brown, Youngki Kwon, Shinji Watanabe, Joon Son Chung

First, the evaluation is not straightforward because the features required for better performance differ between speaker verification and diarisation.

Data Augmentation Speaker Verification

Paper
Add Code

Large-scale learning of generalised representations for speaker recognition

no code implementations • 20 Oct 2022 • Jee-weon Jung, Hee-Soo Heo, Bong-Jin Lee, Jaesong Lee, Hye-jin Shim, Youngki Kwon, Joon Son Chung, Shinji Watanabe

We also show that training with proposed large data configurations gives better performance.

Inductive Bias Speaker Recognition

Paper
Add Code

Curriculum learning for self-supervised speaker verification

no code implementations • 28 Mar 2022 • Hee-Soo Heo, Jee-weon Jung, Jingu Kang, Youngki Kwon, You Jin Kim, Bong-Jin Lee, Joon Son Chung

The goal of this paper is to train effective self-supervised speaker representations without identity labels.

Self-Supervised Learning Speaker Recognition +1

Paper
Add Code

SASV 2022: The First Spoofing-Aware Speaker Verification Challenge

no code implementations • 28 Mar 2022 • Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, Tomi Kinnunen

Pre-trained spoofing detection and speaker verification models are provided as open source and are used in two baseline SASV solutions.

Speaker Verification

Paper
Add Code

Pushing the limits of raw waveform speaker recognition

2 code implementations • 16 Mar 2022 • Jee-weon Jung, You Jin Kim, Hee-Soo Heo, Bong-Jin Lee, Youngki Kwon, Joon Son Chung

Our best model achieves an equal error rate of 0. 89%, which is competitive with the state-of-the-art models based on handcrafted features, and outperforms the best model based on raw waveform inputs by a large margin.

Self-Supervised Learning Speaker Recognition +1

971

Paper
Code

Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity

no code implementations • 7 Oct 2021 • You Jin Kim, Hee-Soo Heo, Jee-weon Jung, Youngki Kwon, Bong-Jin Lee, Joon Son Chung

The objective of this work is to train noise-robust speaker embeddings adapted for speaker diarisation.

Dimensionality Reduction

Paper
Add Code

Multi-scale speaker embedding-based graph attention networks for speaker diarisation

no code implementations • 7 Oct 2021 • Youngki Kwon, Hee-Soo Heo, Jee-weon Jung, You Jin Kim, Bong-Jin Lee, Joon Son Chung

The objective of this work is effective speaker diarisation using multi-scale speaker embeddings.

Graph Attention

Paper
Add Code

AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks

1 code implementation • 4 Oct 2021 • Jee-weon Jung, Hee-Soo Heo, Hemlata Tak, Hye-jin Shim, Joon Son Chung, Bong-Jin Lee, Ha-Jin Yu, Nicholas Evans

Artefacts that differentiate spoofed from bona-fide utterances can reside in spectral or temporal domains.

Ranked #1 on Voice Anti-spoofing on ASVspoof 2019 - LA

Graph Attention Voice Anti-spoofing

116

Paper
Code

Look Who's Talking: Active Speaker Detection in the Wild

1 code implementation • 17 Aug 2021 • You Jin Kim, Hee-Soo Heo, Soyeon Choe, Soo-Whan Chung, Yoohwan Kwon, Bong-Jin Lee, Youngki Kwon, Joon Son Chung

Face tracks are extracted from the videos and active segments are annotated based on the timestamps of VoxConverse in a semi-automatic way.

Paper
Code

Adapting Speaker Embeddings for Speaker Diarisation

no code implementations • 7 Apr 2021 • Youngki Kwon, Jee-weon Jung, Hee-Soo Heo, You Jin Kim, Bong-Jin Lee, Joon Son Chung

The goal of this paper is to adapt speaker embeddings for solving the problem of speaker diarisation.

Clustering Dimensionality Reduction +1

Paper
Add Code

Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural Network

no code implementations • 7 Apr 2021 • Jee-weon Jung, Hee-Soo Heo, Youngki Kwon, Joon Son Chung, Bong-Jin Lee

In this work, we propose an overlapped speech detection system trained as a three-class classifier.

Binary Classification speaker-diarization +1

Paper
Add Code

End-to-End Lip Synchronisation Based on Pattern Classification

no code implementations • 18 May 2020 • You Jin Kim, Hee Soo Heo, Soo-Whan Chung, Bong-Jin Lee

The goal of this work is to synchronise audio and video of a talking face using deep neural network models.

Classification General Classification

Paper
Add Code

Who said that?: Audio-visual speaker diarisation of real-world meetings

no code implementations • 24 Jun 2019 • Joon Son Chung, Bong-Jin Lee, Icksang Han

The goal of this work is to determine 'who spoke when' in real-world meetings.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.