no code implementations • 11 Dec 2024 • Zihao Han, Baoquan Zhang, Lisai Zhang, Shanshan Feng, Kenghong Lin, Guotao Liang, Yunming Ye, Xiaochen Qi, Guangming Ye
Although these methods have shown superior performance, in this paper, we find that 1) existing methods suffer from a schedule-restoration mismatching issue, i. e., the theoretical schedule and practical restoration processes usually exist a large discrepancy, which theoretically results in the schedule not fully leveraged for restoring images; and 2) the key reason causing such issue is that the restoration process of all pixels are actually asynchronous but existing methods set a synchronous noise schedule to them, i. e., all pixels shares the same noise schedule.
1 code implementation • CVPR 2023 • Yunpeng Han, Lisai Zhang, Qingcai Chen, Zhijian Chen, Zhonghua Li, Jianxin Yang, Zhao Cao
We propose a method for fine-grained fashion vision-language pre-training based on fashion Symbols and Attributes Prompt (FashionSAP) to model fine-grained multi-modalities fashion attributes and characteristics.
no code implementations • 9 Mar 2023 • Lisai Zhang, Qingcai Chen, Zhijian Chen, Yunpeng Han, Zhonghua Li, Zhao Cao
In this paper, we propose a fine-grained VLP scheme without object annotations from the linguistic perspective.
no code implementations • 20 Oct 2021 • Lisai Zhang, Hongfa Wu, Qingcai Chen, Yimeng Deng, Zhonghua Li, Dejiang Kong, Zhao Cao, Joanna Siebert, Yunpeng Han
Cross-model retrieval has emerged as one of the most important upgrades for text-only search engines (SE).
1 code implementation • ACL 2021 • Shuoran Jiang, Qingcai Chen, Xin Liu, Baotian Hu, Lisai Zhang
In this study, we define the spectral graph convolutional network with the high-order dynamic Chebyshev approximation (HDGCN), which augments the multi-hop graph reasoning by fusing messages aggregated from direct and long-term dependencies into one convolutional layer.
1 code implementation • CVPR 2021 • Baoquan Zhang, Xutao Li, Yunming Ye, Zhichao Huang, Lisai Zhang
To avoid the prototype completion error caused by primitive knowledge noises or class differences, we further develop a Gaussian based prototype fusion strategy that combines the mean-based and completed prototypes by exploiting the unlabeled samples.
1 code implementation • 7 Apr 2020 • Lisai Zhang, Qingcai Chen, Baotian Hu, Shuoran Jiang
To fulfill such a task, we propose a novel inpainting model named Text-Guided Dual Attention Inpainting Network (TDANet).
no code implementations • 1 Dec 2019 • Lisai Zhang, Qingcai Chen, Dongfang Li, Buzhou Tang
In the framework, the visual features are obtained through a visualization and fusion mechanism.
Natural Language Inference
Natural Language Understanding
+2