1 code implementation • 22 Jan 2024 • Zihang Lai
Open-vocabulary semantic segmentation models aim to accurately assign a semantic label to each pixel in an image from a set of arbitrary open-vocabulary texts.
Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation +1
1 code implementation • 18 Sep 2022 • Xuran Pan, Zihang Lai, Shiji Song, Gao Huang
In this paper, we present a novel learning framework, ActiveNeRF, aiming to model a 3D scene with a constrained input budget.
1 code implementation • 17 Sep 2022 • Yizeng Han, Yifan Pu, Zihang Lai, Chaofei Wang, Shiji Song, Junfen Cao, Wenhui Huang, Chao Deng, Gao Huang
Intuitively, easy samples, which generally exit early in the network during inference, should contribute more to training early classifiers.
1 code implementation • 9 Jun 2022 • Zhirong Wu, Zihang Lai, Xiao Sun, Stephen Lin
The paper presents a scalable approach for learning spatially distributed visual representations over individual tokens and a holistic instance representation simultaneously.
1 code implementation • 14 Feb 2022 • Chunjiang Ge, Rui Huang, Mixue Xie, Zihang Lai, Shiji Song, Shuang Li, Gao Huang
Unsupervised domain adaption (UDA) aims to adapt models learned from a well-annotated source domain to a target domain, where only unlabeled samples are given.
2 code implementations • CVPR 2022 • Yulin Wang, Yang Yue, Yuanze Lin, Haojun Jiang, Zihang Lai, Victor Kulikov, Nikita Orlov, Humphrey Shi, Gao Huang
Recent works have shown that the computational efficiency of video recognition can be significantly improved by reducing the spatial redundancy.
no code implementations • ICCV 2021 • Zihang Lai, Sifei Liu, Alexei A. Efros, Xiaolong Wang
Relying on temporal continuity in videos, our work assumes that the 3D scene structure in nearby video frames remains static.
no code implementations • ICCV 2021 • Zihang Lai, Senthil Purushwalkam, Abhinav Gupta
For example, what are the correspondences between a bottle and shoe for the task of pounding or the task of pouring.
2 code implementations • CVPR 2020 • Zihang Lai, Erika Lu, Weidi Xie
Recent interest in self-supervised dense tracking has yielded rapid progress, but performance still remains far from supervised methods.
Ranked #4 on Unsupervised Video Object Segmentation on DAVIS 2017 (val) (using extra training data)
Semantic Segmentation Semi-Supervised Video Object Segmentation +2
1 code implementation • 2 May 2019 • Zihang Lai, Weidi Xie
Fourth, in order to shed light on the potential of self-supervised learning on the task of video correspondence flow, we probe the upper bound by training on additional data, \ie more diverse videos, further demonstrating significant improvements on video segmentation.
Self-Supervised Learning Semi-Supervised Video Object Segmentation +4
3 code implementations • 26 Oct 2018 • Yan Wang, Zihang Lai, Gao Huang, Brian H. Wang, Laurens van der Maaten, Mark Campbell, Kilian Q. Weinberger
Many applications of stereo depth estimation in robotics require the generation of accurate disparity maps in real time under significant computational constraints.
Ranked #1 on Stereo Depth Estimation on KITTI2012
no code implementations • 7 Sep 2018 • Zhihua Wang, Stefano Rosa, Yishu Miao, Zihang Lai, Linhai Xie, Andrew Markham, Niki Trigoni
In this framework, real images are first converted to a synthetic domain representation that reduces complexity arising from lighting and texture.
1 code implementation • 7 May 2018 • Junwen Bai, Zihang Lai, Runzhe Yang, Yexiang Xue, John Gregoire, Carla Gomes
We propose imitation refinement, a novel approach to refine imperfect input patterns, guided by a pre-trained classifier incorporating prior knowledge from simulated theoretical data, such that the refined patterns imitate the ideal data.