Search Results for author: Haonan Zhang

Found 14 papers, 6 papers with code

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

no code implementations9 Sep 2024 Run Luo, Haonan Zhang, Longze Chen, Ting-En Lin, Xiong Liu, Yuchuan Wu, Min Yang, Minzheng Wang, Pengpeng Zeng, Lianli Gao, Heng Tao Shen, Yunshui Li, Xiaobo Xia, Fei Huang, Jingkuan Song, Yongbin Li

This framework iteratively improve data quality through a refined combination of fine-grained perception, cognitive reasoning, and interaction evolution, generating a more complex and diverse image-text instruction dataset that empowers MLLMs with enhanced capabilities.

Diversity Visual Reasoning

Text-Video Retrieval with Global-Local Semantic Consistent Learning

1 code implementation21 May 2024 Haonan Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Yihang Duan, Xinyu Lyu, HengTao Shen

Then, we devise a shared local interaction module that employs several learnable queries to capture latent semantic concepts for learning fine-grained alignment.

Concept Alignment Retrieval +1

Does Knowledge Graph Really Matter for Recommender Systems?

1 code implementation4 Apr 2024 Haonan Zhang, Dongxia Wang, Zhu Sun, Yanhui Li, Youcheng Sun, HuiZhi Liang, Wenhai Wang

We consider the scenarios where knowledge in a KG gets completely removed, randomly distorted and decreased, and also where recommendations are for cold-start users.

Knowledge Graphs Recommendation Systems

CaKDP: Category-aware Knowledge Distillation and Pruning Framework for Lightweight 3D Object Detection

1 code implementation CVPR 2024 Haonan Zhang, Longjun Liu, Yuqi Huang, Zhao Yang, Xinyu Lei, Bihan Wen

To address these issues we propose a simple yet effective Category-aware Knowledge Distillation and Pruning (CaKDP) framework for compressing 3D detectors.

3D Object Detection Knowledge Distillation +1

Leveraging policy instruments and financial incentives to reduce embodied carbon in energy retrofits

no code implementations6 Apr 2023 Haonan Zhang

This research aims to develop policy strategies to reduce embodied carbon emissions in retrofits.

Life cycle costing analysis of deep energy retrofits of a mid-rise building to understand the impact of energy conservation measures

no code implementations2 Apr 2023 Haonan Zhang

This study employed EnergyPlus to examine the energy performance of 11 energy retrofit measures for a typical multi-unit residential building (MURB) in Metro Vancouver, British Columbia, Canada.

A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval

2 code implementations NeurIPS 2022 2022 Hao Li, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Haonan Zhang, Gongfu Li

To verify the effectiveness of our approach, extensive experiments are conducted on MS-COCO, CUB Captions, and Flickr30K, which are commonly used in cross-modal retrieval.

Image-text matching Image-to-Text Retrieval +1

Visual Commonsense-aware Representation Network for Video Captioning

1 code implementation17 Nov 2022 Pengpeng Zeng, Haonan Zhang, Lianli Gao, Xiangpeng Li, Jin Qian, Heng Tao Shen

Generating consecutive descriptions for videos, i. e., Video Captioning, requires taking full advantage of visual representation along with the generation process.

Caption Generation Question Answering +2

Disentangled Graph Contrastive Learning for Review-based Recommendation

no code implementations4 Sep 2022 Yuyang Ren, Haonan Zhang, Qi Li, Luoyi Fu, Jiaxin Ding, Xinde Cao, Xinbing Wang, Chenghu Zhou

In review-based recommendation methods, review data is considered as auxiliary information that can improve the quality of learned user/item or interaction representations for the user rating prediction task.

Contrastive Learning Recommendation Systems

GLAVNet: Global-Local Audio-Visual Cues for Fine-Grained Material Recognition

no code implementations CVPR 2021 Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo

We demonstrate that local geometry has a greater impact on the sound than the global geometry and offers more cues in material recognition.

Material Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.