no code implementations • 3 Sep 2024 • Chuanyang Ma, Jiangtao Li, Xingqun Qi, Muyi Sun, Huiling Zhou
In this paper, we introduce a systematic framework: Pest Manager for precise pest counting and identification within the invisible grain pile environment.
no code implementations • 28 May 2024 • Huiling Zhou, Xianhao Wu, Hongming Chen
Despite the superiority of convolutional neural networks (CNNs) and Transformers in single-image rain removal, current multi-scale models still face significant challenges due to their reliance on single-scale feature pyramid patterns.
no code implementations • 16 May 2024 • Huiling Zhou, Xianhao Wu, Hongming Chen, Xiang Chen, Xin He
To this end, we propose the first lightweight network on the mamba-based model called RSDhamba in the field of RSID.
1 code implementation • 19 Jul 2022 • Shuai Bai, Huiling Zhou, Zhikang Li, Chang Zhou, Hongxia Yang
Virtual try-on aims to generate a photo-realistic fitting result given an in-shop garment and a reference person image.
Ranked #3 on Virtual Try-on on VITON
no code implementations • 24 May 2022 • Zhikang Li, Huiling Zhou, Shuai Bai, Peike Li, Chang Zhou, Hongxia Yang
The fashion industry has diverse applications in multi-modal image generation and editing.
1 code implementation • 29 Mar 2022 • Xiao Pan, Peike Li, Zongxin Yang, Huiling Zhou, Chang Zhou, Hongxia Yang, Jingren Zhou, Yi Yang
By contrast, pixel-level optimization is more explicit, however, it is sensitive to the visual quality of training data and is not robust to object deformation.
no code implementations • 7 Dec 2021 • Huiling Zhou, Jie Liu, Zhikang Li, Jin Yu, Hongxia Yang
With user history represented by a domain-aware sequential model, a frequency encoder is applied to the underlying tags for user content preference learning.
no code implementations • 1 Mar 2021 • Junyang Lin, Rui Men, An Yang, Chang Zhou, Ming Ding, Yichang Zhang, Peng Wang, Ang Wang, Le Jiang, Xianyan Jia, Jie Zhang, Jianwei Zhang, Xu Zou, Zhikang Li, Xiaodong Deng, Jie Liu, Jinbao Xue, Huiling Zhou, Jianxin Ma, Jin Yu, Yong Li, Wei Lin, Jingren Zhou, Jie Tang, Hongxia Yang
In this work, we construct the largest dataset for multimodal pretraining in Chinese, which consists of over 1. 9TB images and 292GB texts that cover a wide range of domains.
no code implementations • 22 Jan 2021 • Dehong Gao, Wenjing Yang, Huiling Zhou, Yi Wei, Yi Hu, Hao Wang
The majority of current MTL studies adopt the hard parameter sharing structure, where hard layers tend to learn general representations over all tasks and specific layers are prone to learn specific representations for each task.
1 code implementation • WS 2020 • Dehong Gao, Wenjing Yang, Huiling Zhou, Yi Wei, Yi Hu, Hao Wang
In e-commerce system, category prediction is to automatically predict categories of given texts.