Search Results for author: Yongtao Hu

Found 4 papers, 2 papers with code

DeepTag: A General Framework for Fiducial Marker Design and Detection

1 code implementation28 May 2021 Zhuming Zhang, Yongtao Hu, Guoxing Yu, Jingwen Dai

Furthermore, a sophisticatedly designed coding system is required to overcome the shortcomings of both markers and detection algorithms.

TAG

TopoTag: A Robust and Scalable Topological Fiducial Marker System

1 code implementation5 Aug 2019 Guoxing Yu, Yongtao Hu, Jingwen Dai

Here we introduce TopoTag, a robust and scalable topological fiducial marker system, which supports reliable and accurate pose estimation from a single image.

3D Pose Estimation Robot Navigation +1

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

no code implementations13 Feb 2016 Jimmy Ren, Yongtao Hu, Yu-Wing Tai, Chuan Wang, Li Xu, Wenxiu Sun, Qiong Yan

This task not only requires collective perception over both visual and auditory signals, the robustness to handle severe quality degradations and unconstrained content variations are also indispensable.

Speaker Identification

Deep Multimodal Speaker Naming

no code implementations17 Jul 2015 Yongtao Hu, Jimmy Ren, Jingwen Dai, Chang Yuan, Li Xu, Wenping Wang

Automatic speaker naming is the problem of localizing as well as identifying each speaking character in a TV/movie/live show video.

Face Alignment

Cannot find the paper you are looking for? You can Submit a new open access paper.