no code implementations • 3 Apr 2025 • Laibin Chang, Yunke Wang, Jiaxing Huang, Longxiang Deng, Bo Du, Chang Xu
Marine Saliency Segmentation (MSS) plays a pivotal role in various vision-based marine exploration tasks.
no code implementations • 4 Feb 2025 • Siyu Xu, Yunke Wang, Chenghao Xia, Dihao Zhu, Tao Huang, Chang Xu
A natural idea is to reuse the computational results of unchanged visual tokens from the last step.
no code implementations • 2 Feb 2025 • Yunke Wang, Yanxi Li, Chang Xu
It argues that while Scaling Up of models faces inherent bottlenecks, the future trajectory of AI scaling lies in Scaling Down and Scaling Out.
no code implementations • 8 Jan 2025 • Laibin Chang, Yunke Wang, Bo Du, Chang Xu
For the sacrificed image details caused by underwater scattering, we further present the Cross-Spectral Detail Refinement (CSDR) to enhance the high-frequency details, which are integrated with the low-frequency signal as input conditions for guiding the diffusion.
no code implementations • 26 Aug 2024 • Daixun Li, Weiying Xie, Mingxiang Cao, Yunke Wang, Jiaqing Zhang, Yunsong Li, Leyuan Fang, Chang Xu
In this paper, we introduce SAM into multimodal image segmentation for the first time, proposing a novel framework that combines Latent Space Token Generation (LSTG) and Fusion Mask Prompting (FMP) modules to enhance SAM's multimodal fusion and segmentation capabilities.
no code implementations • 18 Mar 2024 • Siyu Xu, Yunke Wang, Daochang Liu, Chang Xu
Based on the observation that the accuracy of GPT-4V's image recognition varies significantly with the order of images within the collage prompt, our method further learns to optimize the arrangement of images for maximum recognition accuracy.
no code implementations • 21 Jan 2024 • Yunke Wang, Linwei Tao, Bo Du, Yutian Lin, Chang Xu
Adversarial Imitation Learning (AIL) allows the agent to reproduce expert behavior with low-dimensional states and actions.
no code implementations • 19 Jan 2024 • Rui Xu, Yunke Wang, Bo Du
To address these two issues, we propose a novel Masked Autoencoder-enhanced Diffusion Model (MAEDiff) for unsupervised anomaly detection in brain images.
1 code implementation • 11 Oct 2023 • Yunke Wang, Minjing Dong, Yukun Zhao, Bo Du, Chang Xu
In the first step, we apply a forward diffusion process to smooth potential noises in imperfect demonstrations by introducing additional noise.
1 code implementation • 13 Feb 2023 • Yunke Wang, Bo Du, Chang Xu
The trajectories of an initial agent policy could be closer to those non-optimal expert demonstrations, but within the framework of adversarial imitation learning, agent policy will be optimized to cheat the discriminator and produce trajectories that are similar to those optimal expert demonstrations.
no code implementations • 3 Mar 2022 • Yunke Wang, Bo Du, Wenyuan Wang, Chang Xu
To satisfy the sequential input of Transformer, the tail of ViT first splits each image into a sequence of visual tokens with a fixed length.