no code implementations • 29 Sep 2021 • Raphaël Jean, Pierre-Luc St-Charles, Soren Pirk, Simon Brodeur
Our goal is to show that common Siamese networks can effectively be trained on video sequences to disentangle attributes related to pose and motion that are useful for video and non-video tasks, yet typically suppressed in usual training schemes.