Search Results for author: Yongtao Hu

Found 4 papers, 2 papers with code

DeepTag: A General Framework for Fiducial Marker Design and Detection

1 code implementation • 28 May 2021 • Zhuming Zhang, Yongtao Hu, Guoxing Yu, Jingwen Dai

Furthermore, a sophisticatedly designed coding system is required to overcome the shortcomings of both markers and detection algorithms.

TAG

Paper
Code

TopoTag: A Robust and Scalable Topological Fiducial Marker System

1 code implementation • 5 Aug 2019 • Guoxing Yu, Yongtao Hu, Jingwen Dai

Here we introduce TopoTag, a robust and scalable topological fiducial marker system, which supports reliable and accurate pose estimation from a single image.

3D Pose Estimation Robot Navigation +1

Paper
Code

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

no code implementations • 13 Feb 2016 • Jimmy Ren, Yongtao Hu, Yu-Wing Tai, Chuan Wang, Li Xu, Wenxiu Sun, Qiong Yan

This task not only requires collective perception over both visual and auditory signals, the robustness to handle severe quality degradations and unconstrained content variations are also indispensable.

Speaker Identification

Paper
Add Code

Deep Multimodal Speaker Naming

no code implementations • 17 Jul 2015 • Yongtao Hu, Jimmy Ren, Jingwen Dai, Chang Yuan, Li Xu, Wenping Wang

Automatic speaker naming is the problem of localizing as well as identifying each speaking character in a TV/movie/live show video.

Face Alignment

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.