Search Results for author: Zilun Zhang

Found 5 papers, 2 papers with code

RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing

1 code implementation20 Jun 2023 Zilun Zhang, Tiancheng Zhao, Yulong Guo, Jianwei Yin

Moreover, we present an image-text paired dataset in the field of remote sensing (RS), RS5M, which has 5 million RS images with English descriptions.

 Ranked #1 on Cross-Modal Retrieval on RSITMD (using extra training data)

Cross-Modal Retrieval Image Retrieval +5

Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input

no code implementations3 Oct 2022 Zilun Zhang, Farzad Khalvati

Many high-performance classification models utilize complex CNN-based architectures for Alzheimer's Disease classification.

Classification

Injecting Image Details into CLIP's Feature Space

no code implementations31 Aug 2022 Zilun Zhang, Cuifeng Shen, Yuan Shen, Huixin Xiong, Xinyu Zhou

Although CLIP-like Visual Language Models provide a functional joint feature space for image and text, due to the limitation of the CILP-like model's image input size (e. g., 224), subtle details are lost in the feature representation if we input high-resolution images (e. g., 2240).

Retrieval

DPGN: Distribution Propagation Graph Network for Few-shot Learning

1 code implementation CVPR 2020 Ling Yang, Liangliang Li, Zilun Zhang, Xinyu Zhou, Erjin Zhou, Yu Liu

To combine the distribution-level relations and instance-level relations for all examples, we construct a dual complete graph network which consists of a point graph and a distribution graph with each node standing for an example.

Few-Shot Learning Relation

Cannot find the paper you are looking for? You can Submit a new open access paper.