Search Results for author: Yiran Song

Found 4 papers, 3 papers with code

SimAda: A Simple Unified Framework for Adapting Segment Anything Model in Underperformed Scenes

1 code implementation • 31 Jan 2024 • Yiran Song, Qianyu Zhou, Xuequan Lu, Zhiwen Shao, Lizhuang Ma

In this paper, we aim to investigate the impact of the general vision modules on finetuning SAM and enable them to generalize across all downstream tasks.

Paper
Code

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

1 code implementation • 4 Jan 2024 • Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

To this end, we propose Scalable Bias-Mode Attention Mask (BA-SAM) to enhance SAM's adaptability to varying image resolutions while eliminating the need for structure modifications.

Paper
Code

Rethinking Implicit Neural Representations for Vision Learners

no code implementations • 22 Nov 2022 • Yiran Song, Qianyu Zhou, Lizhuang Ma

Existing INRs methods suffer from two problems: 1) narrow theoretical definitions of INRs are inapplicable to high-level tasks; 2) lack of representation capabilities to deep networks.

Image Classification Image Generation +6

Paper
Add Code

See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval

1 code implementation • 18 Aug 2022 • Xiujun Shu, Wei Wen, Haoqian Wu, Keyu Chen, Yiran Song, Ruizhi Qiao, Bo Ren, Xiao Wang

To explore the fine-grained alignment, we further propose two implicit semantic alignment paradigms: multi-level alignment (MLA) and bidirectional mask modeling (BMM).

Person Retrieval Retrieval +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.