Search Results for author: Yiran Song

Found 4 papers, 3 papers with code

SimAda: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes

1 code implementation31 Jan 2024 Yiran Song, Qianyu Zhou, Xuequan Lu, Zhiwen Shao, Lizhuang Ma

In this paper, we aim to investigate the impact of the general vision modules on finetuning SAM and enable them to generalize across all downstream tasks.

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

1 code implementation4 Jan 2024 Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

To this end, we propose Scalable Bias-Mode Attention Mask (BA-SAM) to enhance SAM's adaptability to varying image resolutions while eliminating the need for structure modifications.

Rethinking Implicit Neural Representations for Vision Learners

no code implementations22 Nov 2022 Yiran Song, Qianyu Zhou, Lizhuang Ma

Existing INRs methods suffer from two problems: 1) narrow theoretical definitions of INRs are inapplicable to high-level tasks; 2) lack of representation capabilities to deep networks.

Image Classification Image Generation +6

See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval

1 code implementation18 Aug 2022 Xiujun Shu, Wei Wen, Haoqian Wu, Keyu Chen, Yiran Song, Ruizhi Qiao, Bo Ren, Xiao Wang

To explore the fine-grained alignment, we further propose two implicit semantic alignment paradigms: multi-level alignment (MLA) and bidirectional mask modeling (BMM).

Person Retrieval Retrieval +3

Cannot find the paper you are looking for? You can Submit a new open access paper.