no code implementations • 21 Jun 2020 • Shota Sakaguchi, Jun Kato, Masataka Goto, Seiichi Uchida
In order to analyze the motion of lyric words, we first apply a state-of-the-art scene text detector and recognizer to each video frame.
Clustering Dynamic Time Warping +2