1 code implementation • 15 Jun 2024 • Yike Yuan, Huanzhang Dou, Fengjun Guo, Xi Li
This paper represents a neat yet effective framework, named SemanticMIM, to integrate the advantages of masked image modeling (MIM) and contrastive learning (CL) for general visual representation.
3 code implementations • 12 Jul 2023 • YuAn Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin
In response to these challenges, we propose MMBench, a bilingual benchmark for assessing the multi-modal capabilities of VLMs.
no code implementations • 6 Jun 2023 • Yike Yuan, Xinghe Fu, Yunlong Yu, Xi Li
In this paper, we propose a simple yet effective transformer framework for self-supervised learning called DenseDINO to learn dense visual representations.
no code implementations • 3 Apr 2020 • Zunlei Feng, Xinchao Wang, Yongming He, Yike Yuan, Xin Gao, Mingli Song
In this paper, we study a new representation-learning task, which we termed as disassembling object representations.