no code implementations • 23 Jan 2024 • Ee Yeo Keat, Zhang Hao, Alexander Matyasko, Basura Fernando
We introduce VidTFS, a Training-free, open-vocabulary video goal and action inference framework that combines the frozen vision foundational model (VFM) and large language model (LLM) with a novel dynamic Frame Selection module.
no code implementations • 5 Nov 2019 • Hu Qiang, Gao Feifei, Zhang Hao, Jin Shi, Li Geoffrey Ye
Deep learning (DL) has emerged as an effective tool for channel estimation in wireless communication systems, especially under some imperfect environments.
no code implementations • 30 May 2018 • Nguyen Phuong Anh, Lu Yi-Jie, Zhang Hao, Ngo Chong-Wah
This short paper presents the video browsing tool of VIREO team which has been used in the Video Browser Showdown 2018.