no code implementations • 9 Sep 2024 • Run Luo, Haonan Zhang, Longze Chen, Ting-En Lin, Xiong Liu, Yuchuan Wu, Min Yang, Minzheng Wang, Pengpeng Zeng, Lianli Gao, Heng Tao Shen, Yunshui Li, Xiaobo Xia, Fei Huang, Jingkuan Song, Yongbin Li
This framework iteratively improve data quality through a refined combination of fine-grained perception, cognitive reasoning, and interaction evolution, generating a more complex and diverse image-text instruction dataset that empowers MLLMs with enhanced capabilities.
1 code implementation • 21 May 2024 • Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Yihang Duan, Xinyu Lyu, HengTao Shen
Then, we devise a shared local interaction module that employs several learnable queries to capture latent semantic concepts for learning fine-grained alignment.
1 code implementation • 4 Apr 2024 • Haonan Zhang, Dongxia Wang, Zhu Sun, Yanhui Li, Youcheng Sun, HuiZhi Liang, Wenhai Wang
We consider the scenarios where knowledge in a KG gets completely removed, randomly distorted and decreased, and also where recommendations are for cold-start users.
1 code implementation • CVPR 2024 • Haonan Zhang, Longjun Liu, Yuqi Huang, Zhao Yang, Xinyu Lei, Bihan Wen
To address these issues we propose a simple yet effective Category-aware Knowledge Distillation and Pruning (CaKDP) framework for compressing 3D detectors.
1 code implementation • 19 Dec 2023 • Hongyi He, Longjun Liu, Haonan Zhang, Nanning Zheng
Among existing Neural Architecture Search methods, DARTS is known for its efficiency and simplicity.
no code implementations • 19 Aug 2023 • Yubo Shu, Haonan Zhang, Hansu Gu, Peng Zhang, Tun Lu, Dongsheng Li, Ning Gu
The rapid evolution of the web has led to an exponential growth in content.
no code implementations • 2 May 2023 • Haonan Zhang, Yuhan Zhang, Qing Wu, Jiangjie Wu, Zhiming Zhen, Feng Shi, Jianmin Yuan, Hongjiang Wei, Chen Liu, Yuyao Zhang
The anisotropic volume's high-resolution (HR) plane is used to build the HR-LR image pairs for model training.
no code implementations • 6 Apr 2023 • Haonan Zhang
This research aims to develop policy strategies to reduce embodied carbon emissions in retrofits.
no code implementations • 2 Apr 2023 • Haonan Zhang
This study employed EnergyPlus to examine the energy performance of 11 energy retrofit measures for a typical multi-unit residential building (MURB) in Metro Vancouver, British Columbia, Canada.
2 code implementations • NeurIPS 2022 2022 • Hao Li, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Haonan Zhang, Gongfu Li
To verify the effectiveness of our approach, extensive experiments are conducted on MS-COCO, CUB Captions, and Flickr30K, which are commonly used in cross-modal retrieval.
1 code implementation • 17 Nov 2022 • Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, Jin Qian, Heng Tao Shen
Generating consecutive descriptions for videos, i. e., Video Captioning, requires taking full advantage of visual representation along with the generation process.
no code implementations • 4 Sep 2022 • Yuyang Ren, Haonan Zhang, Qi Li, Luoyi Fu, Jiaxin Ding, Xinde Cao, Xinbing Wang, Chenghu Zhou
In review-based recommendation methods, review data is considered as auxiliary information that can improve the quality of learned user/item or interaction representations for the user rating prediction task.
no code implementations • CVPR 2021 • Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo
We demonstrate that local geometry has a greater impact on the sound than the global geometry and offers more cues in material recognition.
no code implementations • 4 Mar 2021 • Mohammadreza Nemati, Haonan Zhang, Michael Sloma, Dulat Bekbolsynov, Hong Wang, Stanislaw Stepkowski, Kevin S. Xu
Kidney transplantation can significantly enhance living standards for people suffering from end-stage renal disease.