2 code implementations • 8 Jan 2025 • Zijiang Yang, Meishu Song, Xin Jing, Haojie Zhang, Kun Qian, Bin Hu, Kota Tamada, Toru Takumi, Björn W. Schuller, Yoshiharu Yamamoto
The findings suggest promising directions for vocalization analysis and highlight the potential value of audible and ultrasound vocalizations in ASD detection.
no code implementations • 24 Nov 2024 • Haojie Zhang, Zhihao Liang, Ruibo Fu, Zhengqi Wen, Xuefei Liu, Chenxing Li, JianHua Tao, Yaling Liang
Then we propose a suitable solution according to the modality differences of image, audio, and video generation.
1 code implementation • 20 Sep 2024 • Nanqing Liu, Xun Xu, Yongyi Su, Haojie Zhang, Heng-Chao Li
In brief, we use the prompts of overlapping masks as corresponding negative signals, resulting in refined masks.
no code implementations • 17 Sep 2024 • Wonduk Seo, Haojie Zhang, Yueyang Zhang, Changhao Zhang, Songyao Duan, Lixin Su, Daiting Shi, Jiashu Zhao, Dawei Yin
Query reformulation is a well-known problem in Information Retrieval (IR) aimed at enhancing single search successful completion rate by automatically modifying user's input query.
no code implementations • 26 Apr 2024 • Haojie Zhang, Yimeng Zhuang
Our approach enriches the context by utilizing label semantics as suffix prompts.
no code implementations • 25 Apr 2024 • Shen Zhang, Haojie Zhang, Jing Zhang, Xudong Zhang, Yimeng Zhuang, Jinting Wu
In human-computer interaction, it is crucial for agents to respond to human by understanding their emotions.
no code implementations • 8 Jan 2024 • Zhangjin Huang, Zhihao Liang, Haojie Zhang, Yangkai Lin, Kui Jia
Technically, we learn two parallel streams of an implicit signed distance field and an explicit surrogate surface Sur2f mesh, and unify volume rendering of the implicit signed distance function (SDF) and surface rendering of the surrogate mesh with a shared, neural shader; the unified shading promotes their convergence to the same, underlying surface.
1 code implementation • CVPR 2024 • Haojie Zhang, Yongyi Su, Xun Xu, Kui Jia
The success of large language models has inspired the computer vision community to explore image segmentation foundation model that is able to zero/few-shot generalize through prompt engineering.
1 code implementation • 3 Nov 2022 • Haojie Zhang, Ge Li, Jia Li, Zhongjin Zhang, Yuqi Zhu, Zhi Jin
Large-scale pre-trained language models have achieved impressive results on a wide range of downstream tasks recently.
no code implementations • 9 Oct 2022 • Haojie Zhang, Mingfei Liang, Ruobing Xie, Zhenlong Sun, Bo Zhang, Leyu Lin
Motivated by the above investigation, we propose two novel techniques to improve pre-trained language models: Decoupled Directional Relative Position (DDRP) encoding and MTH pre-training objective.