no code implementations • 5 Sep 2024 • Deyin Liu, Lin Yuanbo Wu, Xianghua Xie
First, although existing methods attempt to focus on local area editing by a pre-defined mask, the preservation of the outside-area background is non-ideal due to the spatially entire generation of each frame.
no code implementations • 3 Jul 2024 • Hanxi Li, Jingqi Wu, Lin Yuanbo Wu, Hao Chen, Deyin Liu, Chunhua Shen
By fine-tuning the ADClick-Seg model using the weak labels inferred by ADClick, we establish the state-of-the-art performances in supervised AD tasks (AP $= 86. 4\%$ on MVTec AD and AP $= 78. 4\%$, PRO $= 98. 6\%$ on KSDD2).
Ranked #3 on Supervised Anomaly Detection on MVTec AD (using extra training data)
no code implementations • 2 Mar 2024 • Xinyi Yu, Ling Yan, PengTao Jiang, Hao Chen, Bo Li, Lin Yuanbo Wu, Linlin Ou
This innovative approach empowers the network to simultaneously predict masks and depth, enhancing its ability to capture nuanced depth-related information during the instance segmentation process.
no code implementations • 18 Oct 2023 • Weian Mao, Muzhi Zhu, Zheng Sun, Shuaike Shen, Lin Yuanbo Wu, Hao Chen, Chunhua Shen
Most prior encoders rely on atom-wise features, such as angles and distances between atoms, which are not available in this context.
1 code implementation • ICCV 2023 • Kaining Ying, Qing Zhong, Weian Mao, Zhenhua Wang, Hao Chen, Lin Yuanbo Wu, Yifan Liu, Chengxiang Fan, Yunzhi Zhuge, Chunhua Shen
The discrimination of instance embeddings plays a vital role in associating instances across time for online video instance segmentation (VIS).
Ranked #3 on Video Instance Segmentation on Youtube-VIS 2022 Validation (using extra training data)
no code implementations • 6 Jun 2023 • Hanxi Li, Jingqi Wu, Lin Yuanbo Wu, Hao Chen, Deyin Liu, Mingwen Wang, Peng Wang
In this work, we propose a novel framework called "Weakly-supervised RESidual Transformer" (WeakREST), which aims to achieve high AD accuracy while minimizing the need for extensive annotations.
Ranked #1 on Anomaly Detection on BTAD (using extra training data)
1 code implementation • 26 Nov 2022 • Zhong Ji, Junhua Hu, Deyin Liu, Lin Yuanbo Wu, Ye Zhao
To implement this task, one needs to extract multi-scale features from both image and text domains, and then perform the cross-modal alignment.
1 code implementation • 18 Aug 2022 • Deyin Liu, Lin Yuanbo Wu, Bo Li, ZongYuan Ge
Our architecture is orthogonal to StackGAN++ , and focuses on person image generation, with all of them together to enrich the spectrum of GANs for the image generation task.