1 code implementation • 5 Aug 2024 • Zhaowei Li, Wei Wang, Yiqing Cai, Xu Qi, Pengyu Wang, Dong Zhang, Hang Song, Botian Jiang, Zhida Huang, Tao Wang
In this paper, we propose UnifiedMLLM, a comprehensive model designed to represent various tasks using a unified representation.
no code implementations • 14 May 2024 • Wei Wang, Zhaowei Li, Qi Xu, Yiqing Cai, Hang Song, Qi Qi, Ran Zhou, Zhida Huang, Tao Wang, Li Xiao
For negative knowledge, we propose an innovative self-adversarial approach that generates low-quality rationales by sampling previous iterations of smaller language models, embracing the idea that one can learn from one's own weaknesses.
2 code implementations • 11 Jan 2024 • Zhaowei Li, Qi Xu, Dong Zhang, Hang Song, Yiqing Cai, Qi Qi, Ran Zhou, Junting Pan, Zefeng Li, Van Tu Vu, Zhida Huang, Tao Wang
Beyond capturing global information like other multi-modal models, our proposed model excels at tasks demanding a detailed understanding of local information within the input.
2 code implementations • CVPR 2021 • Qiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou
This paper proposes MagFace, a category of losses that learn a universal feature embedding whose magnitude can measure the quality of the given face.
Ranked #1 on
Face Verification
on IJB-C
(training dataset metric)
no code implementations • 23 Aug 2020 • Zhida Huang, Kaiyu Yue, Jiangfan Deng, Feng Zhou
Then we perform NMS only on visible bounding boxes to achieve the best fitting full box in inference.
no code implementations • 22 Nov 2018 • Zhida Huang, Zhuoyao Zhong, Lei Sun, Qiang Huo
In this paper, we present a new Mask R-CNN based text detection approach which can robustly detect multi-oriented and curved text from natural scene images in a unified manner.
Ranked #6 on
Scene Text Detection
on SCUT-CTW1500