1 code implementation • 12 May 2023 • Ruixiang Jiang, Lingbo Liu, Changwen Chen
Recent advances in visual-language models have shown remarkable zero-shot text-image matching ability that is transferable to down-stream tasks such as object detection and segmentation.
Ranked #1 on Zero-Shot Counting on FSC147
no code implementations • 8 Oct 2022 • Tao Yang, Haokui Zhang, Wenze Hu, Changwen Chen, Xiaoyu Wang
Transformer models have made tremendous progress in various fields in recent years.
no code implementations • 26 May 2022 • Peipei Zhu, Xiao Wang, Lin Zhu, Zhenglong Sun, Weishi Zheng, YaoWei Wang, Changwen Chen
Inspired by the success of Vision-Language Pre-Trained Models (VL-PTMs) in this research, we attempt to infer the cross-domain cue information about a given image from the large VL-PTMs for the UIC task.
no code implementations • 7 Mar 2022 • Peipei Zhu, Xiao Wang, Yong Luo, Zhenglong Sun, Wei-Shi Zheng, YaoWei Wang, Changwen Chen
The image-level labels are utilized to train a weakly-supervised object recognition model to extract object information (e. g., instance) in an image, and the extracted instances are adopted to infer the relationships among different objects based on an enhanced graph neural network (GNN).