Search Results for author: Zihan Zhong

Found 6 papers, 2 papers with code

Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data

no code implementations19 Dec 2024 Zhiqiang Tang, Zihan Zhong, Tong He, Gerald Friedland

We curate a benchmark comprising 22 multimodal datasets from diverse real-world applications, encompassing all 4 combinations of the 3 modalities.

AutoML cross-modal alignment +1

UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks

no code implementations7 Jun 2023 Yanan sun, Zihan Zhong, Qi Fan, Chi-Keung Tang, Yu-Wing Tai

Our thorough studies validate that models pre-trained as such can learn rich representations of both modalities, improving their ability to understand how images and text relate to each other.

Semantic Segmentation

Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation

1 code implementation16 May 2023 Yuxin Ren, Zihan Zhong, Xingjian Shi, Yi Zhu, Chun Yuan, Mu Li

It has been commonly observed that a teacher model with superior performance does not necessarily result in a stronger student, highlighting a discrepancy between current teacher training practices and effective knowledge transfer.

Knowledge Distillation text-classification +2

Towards Arbitrary Text-driven Image Manipulation via Space Alignment

no code implementations25 Jan 2023 Yunpeng Bai, Zihan Zhong, Chao Dong, Weichen Zhang, Guowei Xu, Chun Yuan

Then, the text input can be directly accessed into the StyleGAN space and be used to find the semantic shift according to the text description.

Attribute Image Manipulation

Cannot find the paper you are looking for? You can Submit a new open access paper.