no code implementations • 14 Jun 2021 • YiYu, XinyingWang, WeiHu, XunLuo, ChengLi
In this technical report, we present our solution to localize a spatio-temporal person in an untrimmed video based on a sentence.
Sentence