no code implementations • 2 Apr 2024 • Wanrong Zheng, Haidong Zhu, Zhaoheng Zheng, Ram Nevatia
We demonstrate that with refined skeletons, the performance of the gait recognition model can achieve further improvement on public gait recognition datasets compared with state-of-the-art methods without extra annotations.
1 code implementation • 7 Dec 2023 • Zhaoheng Zheng, Jingmin Wei, Xuefeng Hu, Haidong Zhu, Ram Nevatia
Thus, we propose LLaMP, Large Language Models as Prompt learners, that produces adaptive prompts for the CLIP text encoder, establishing it as the connecting bridge.
1 code implementation • 1 Dec 2023 • Tianyu Ding, Tianyi Chen, Haidong Zhu, Jiachen Jiang, Yiqi Zhong, Jinxin Zhou, Guangzhi Wang, Zhihui Zhu, Ilya Zharkov, Luming Liang
The rapid growth of Large Language Models (LLMs) has been a driving force in transforming various domains, reshaping the artificial general intelligence landscape.
1 code implementation • 27 Nov 2023 • Haidong Zhu, Tianyu Ding, Tianyi Chen, Ilya Zharkov, Ram Nevatia, Luming Liang
CaesarNeRF explicitly models pose differences of reference views to combine scene-level semantic representations, providing a calibrated holistic understanding.
no code implementations • 24 Oct 2023 • Haidong Zhu, Wanrong Zheng, Zhaoheng Zheng, Ram Nevatia
PSE encodes the body shape via binarized silhouettes, skeleton motions, and 3-D body shape, while AAE provides two levels of temporal appearance feature aggregation: attention-based feature aggregation and averaging aggregation.
1 code implementation • 26 May 2023 • Zhaoheng Zheng, Haidong Zhu, Ram Nevatia
In this paper, we study the problem of Compositional Zero-Shot Learning (CZSL), which is to recognize novel attribute-object combinations with pre-existing concepts.
1 code implementation • 16 Apr 2023 • Haidong Zhu, Wanrong Zheng, Zhaoheng Zheng, Ram Nevatia
Two common modalities used for representing the walking sequence of a person are silhouettes and joint skeletons.
Ranked #3 on Multiview Gait Recognition on CASIA-B
1 code implementation • 16 Apr 2023 • Haidong Zhu, Zhaoheng Zheng, Wanrong Zheng, Ram Nevatia
This paper addresses the problem of human rendering in the video with temporal appearance constancy.
no code implementations • 18 Dec 2022 • Haidong Zhu, Zhaoheng Zheng, Ram Nevatia
Gait recognition, which identifies individuals based on their walking patterns, is an important biometric technique since it can be observed from a distance and does not require the subject's cooperation.
no code implementations • 5 Nov 2020 • Haidong Zhu, Arka Sadhu, Zhaoheng Zheng, Ram Nevatia
The annotated language queries available during training are limited, which also limits the variations of language combinations that a model can see during training.
no code implementations • 17 Apr 2020 • Chuanzi He, Haidong Zhu, Jiyang Gao, Kan Chen, Ram Nevatia
The task of referring relationships is to localize subject and object entities in an image satisfying a relationship query, which is given in the form of \texttt{<subject, predicate, object>}.
1 code implementation • ECCV 2020 • Yueqi Duan, Haidong Zhu, He Wang, Li Yi, Ram Nevatia, Leonidas J. Guibas
When learning to sketch, beginners start with simple and flexible shapes, and then gradually strive for more complex and accurate ones in the subsequent training sessions.
no code implementations • 27 Jul 2019 • Haidong Zhu, Jialin Shi, Ji Wu
We propose a solution for network automatically evaluating the relative quality of the labels in the training set and using good ones to tune the network parameters.