1 code implementation • 25 Nov 2024 • Jin Yao, Hao Gu, Xuweiyi Chen, Jiayun Wang, Zezhou Cheng
In this work, we pioneer the study of open-vocabulary monocular 3D object detection, a novel task that aims to detect and localize objects in 3D space from a single RGB image without limiting detection to a predefined set of categories.
no code implementations • 25 Nov 2024 • Xuweiyi Chen, Zezhou Cheng
Self-supervised learning has emerged as a promising approach for acquiring transferable 3D representations from unlabeled 3D point clouds.
no code implementations • 25 Nov 2024 • Xuweiyi Chen, Markus Marks, Zezhou Cheng
In this study, we introduce a suite of benchmark protocols to systematically assess mid-level vision capabilities and present a comprehensive, controlled evaluation of 22 prominent SSL models across 8 mid-level vision tasks.
1 code implementation • 23 Feb 2024 • Jin Yao, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang, Zezhou Cheng, Xiang Yue
This study investigates the concept of the `right to be forgotten' within the context of large language models (LLMs).
no code implementations • ICCV 2023 • Zezhou Cheng, Carlos Esteves, Varun Jampani, Abhishek Kar, Subhransu Maji, Ameesh Makadia
Consequently, there is growing interest in extending NeRF models to jointly optimize camera poses and scene representation, which offers an alternative to off-the-shelf SfM pipelines which have well-understood failure modes.
no code implementations • 13 Dec 2022 • Zezhou Cheng, Matheus Gadelha, Subhransu Maji
We propose a technique for learning single-view 3D object pose estimation models by utilizing a new source of data -- in-the-wild videos where objects turn.
no code implementations • 24 Jul 2022 • Zezhou Cheng, Menglei Chai, Jian Ren, Hsin-Ying Lee, Kyle Olszewski, Zeng Huang, Subhransu Maji, Sergey Tulyakov
In this paper, we propose a generic multi-modal generative model that couples the 2D modalities and implicit 3D representations through shared latent spaces.
no code implementations • 11 Apr 2022 • Oindrila Saha, Zezhou Cheng, Subhransu Maji
A significant bottleneck in training deep networks for part segmentation is the cost of obtaining detailed annotations.
no code implementations • CVPR 2022 • Oindrila Saha, Zezhou Cheng, Subhransu Maji
Motivated by this we present an alternative approach based on contrastive learning and compare their performance on standard few-shot part segmentation benchmarks.
1 code implementation • CVPR 2021 • Jong-Chyi Su, Zezhou Cheng, Subhransu Maji
We evaluate the effectiveness of semi-supervised learning (SSL) on a realistic benchmark where data exhibits considerable class imbalance and contains images from novel classes.
1 code implementation • ICCV 2021 • Zezhou Cheng, Jong-Chyi Su, Subhransu Maji
Given a collection of images, humans are able to discover landmarks by modeling the shared geometric structure across instances.
no code implementations • 24 Apr 2020 • Zezhou Cheng, Saadia Gabriel, Pankaj Bhambhani, Daniel Sheldon, Subhransu Maji, Andrew Laughlin, David Winkler
The US weather radar archive holds detailed information about biological phenomena in the atmosphere over the last 20 years.
1 code implementation • CVPR 2019 • Zezhou Cheng, Matheus Gadelha, Subhransu Maji, Daniel Sheldon
The deep image prior was recently introduced as a prior for natural images.
1 code implementation • ICCV 2015 • Zezhou Cheng, Qingxiong Yang, Bin Sheng
This paper investigates into the colorization problem which converts a grayscale image to a colorful version.