2 code implementations • 25 Nov 2020 • Jörgen Valk, Tanel Alumäe
Speech activity detection and speaker diarization are used to extract segments from the videos that contain speech.
Action Detection Activity Detection +4