no code implementations • 18 Dec 2024 • Zhihang Yuan, Yuzhang Shang, Hanling Zhang, Tongcheng Fang, Rui Xie, Bingxin Xu, Yan Yan, Shengen Yan, Guohao Dai, Yu Wang
Our approach not only enhances computational efficiency but also aligns naturally with image generation principles by operating in continuous token space and following a hierarchical generation process from coarse to fine details.
no code implementations • 12 Jun 2024 • Zhihang Yuan, Hanling Zhang, Pu Lu, Xuefei Ning, Linfeng Zhang, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang
Diffusion Transformers (DiT) excel at image and video generation but face computational challenges due to the quadratic complexity of self-attention operators.
1 code implementation • 2 Dec 2021 • Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin
Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations.
Ranked #2 on Unsupervised Semantic Segmentation on COCO-Stuff-171 (using extra training data)
1 code implementation • CVPR 2021 • Zhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao
This paper proposes a framework for the interactive video object segmentation (VOS) in the wild where users can choose some frames for annotations iteratively.
Deep Reinforcement Learning Interactive Video Object Segmentation
no code implementations • 30 Nov 2017 • Chenxing Xia, Hanling Zhang, Keqin Li
Different from prior methods, we calculate the saliency value of each node based on the relationship between the corresponding node and the virtual node.
no code implementations • 14 Jun 2017 • Chenxing Xia, Hanling Zhang, Xiuju Gao
This paper proposes an unsupervised bottom-up saliency detection approach by aggregating complementary background template with refinement.
no code implementations • 5 Mar 2016 • Hanling Zhang, Chenxing Xia
In this paper, we propose an improved mechanism for saliency detection.
no code implementations • 14 Sep 2014 • Pichao Wang, Wanqing Li, Philip Ogunbona, Zhimin Gao, Hanling Zhang
These parts are referred to as Frequent Local Parts or FLPs.