Search Results for author: Zhida Huang

Found 4 papers, 2 papers with code

GroundingGPT:Language Enhanced Multi-modal Grounding Model

2 code implementations11 Jan 2024 Zhaowei Li, Qi Xu, Dong Zhang, Hang Song, Yiqing Cai, Qi Qi, Ran Zhou, Junting Pan, Zefeng Li, Van Tu Vu, Zhida Huang, Tao Wang

Beyond capturing global information like other multi-modal models, our proposed model excels at tasks demanding a detailed understanding of local information within the input.

Language Modelling Large Language Model

MagFace: A Universal Representation for Face Recognition and Quality Assessment

2 code implementations CVPR 2021 Qiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou

This paper proposes MagFace, a category of losses that learn a universal feature embedding whose magnitude can measure the quality of the given face.

 Ranked #1 on Face Verification on IJB-C (training dataset metric)

Clustering Face Quality Assessement +1

Visible Feature Guidance for Crowd Pedestrian Detection

no code implementations23 Aug 2020 Zhida Huang, Kaiyu Yue, Jiangfan Deng, Feng Zhou

Then we perform NMS only on visible bounding boxes to achieve the best fitting full box in inference.

Pedestrian Detection

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

no code implementations22 Nov 2018 Zhida Huang, Zhuoyao Zhong, Lei Sun, Qiang Huo

In this paper, we present a new Mask R-CNN based text detection approach which can robustly detect multi-oriented and curved text from natural scene images in a unified manner.

Curved Text Detection Text Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.