no code implementations • ICCV 2015 • Ye Luo, Loong-Fah Cheong, An Tran
We elicit from a fundamental definition of action low-level attributes that can reveal agency and intentionality.
1 code implementation • 30 Aug 2017 • An Tran, Loong-Fah Cheong
This paper proposes a two-stream flow-guided convolutional attention networks for action recognition in videos.
Action Recognition In Videos Temporal Action Localization +1
no code implementations • 14 Oct 2020 • An Tran, Ali Zonoozi, Jagannadan Varadarajan, Hannes Kruppa
In this paper, we propose a two-stage transfer learning technique to improve robustness of semantic segmentation for satellite images that leverages noisy pseudo ground truth masks obtained automatically (without human labor) from crowd-sourced OpenStreetMap (OSM) data.
1 code implementation • 21 Oct 2020 • An Tran, Konstantinos Drossos, Tuomas Virtanen
Automated audio captioning (AAC) is a novel task, where a method takes as an input an audio sample and outputs a textual description (i. e. a caption) of its contents.
no code implementations • 7 Jul 2023 • Wenmiao Hu, Yichen Zhang, Yuxuan Liang, Yifang Yin, Andrei Georgescu, An Tran, Hannes Kruppa, See-Kiong Ng, Roger Zimmermann
Street-view imagery provides us with novel experiences to explore different places remotely.
Ranked #3 on Image-Based Localization on cvact