Active Speaker Localization

1 benchmarks • 2 datasets

Active Speaker Localization (ASL) is the process of spatially localizing an active speaker (talker) in an environment using either audio, vision or both.

Most implemented papers