no code implementations • 8 Oct 2024 • Jiangfan Deng, Zhuang Jia, Zhaoxue Wang, Xiang Long, Daniel K. Du
To achieve accurate parsing of the eye-region, we first leverage the pretrained foundation model Segment Anything (SAM) in an automatic way to refine the eye indications.
no code implementations • 30 Aug 2024 • Zhuang Jia, Jiangfan Deng, Liying Chi, Xiang Long, Daniel K. Du
Parsing of eye components (i. e. pupil, iris and sclera) is fundamental for eye tracking and gaze estimation for AR/VR products.
no code implementations • 22 Nov 2022 • Jiangfan Deng, Dewen Fan, Xiaosong Qiu, Feng Zhou
Crowdedness caused by overlapping among similar objects is a ubiquitous challenge in the field of 2D visual object detection.
1 code implementation • ICCV 2021 • Junfeng Wan, Jiangfan Deng, Xiaosong Qiu, Feng Zhou
Detecting pedestrians and their associated faces jointly is a challenging task. On one hand, body or face could be absent because of occlusion or non-frontal human pose. On the other hand, the association becomes difficult or even miss-leading in crowded scenes due to the lack of strong correlational evidence.
no code implementations • 23 Aug 2020 • Zhida Huang, Kaiyu Yue, Jiangfan Deng, Feng Zhou
Then we perform NMS only on visible bounding boxes to achieve the best fitting full box in inference.
1 code implementation • ECCV 2020 • Kaiyu Yue, Jiangfan Deng, Feng Zhou
However, this introduces two problems: a) The adaptation module brings more parameters into training.