no code implementations • 16 Apr 2024 • SeungWook Kim, Kejie Li, Xueqing Deng, Yichun Shi, Minsu Cho, Peng Wang
Leveraging multi-view diffusion models as priors for 3D optimization have alleviated the problem of 3D consistency, e. g., the Janus face problem or the content drift problem, in zero-shot text-to-3D models.
no code implementations • 12 Apr 2024 • Xueqing Deng, Qihang Yu, Peng Wang, Xiaohui Shen, Liang-Chieh Chen
By enhancing the annotation quality and expanding the dataset to encompass 383K images with more than 5. 18M panoptic masks, we introduce COCONut, the COCO Next Universal segmenTation dataset.
1 code implementation • 30 Nov 2023 • Ju He, Qihang Yu, Inkyu Shin, Xueqing Deng, Xiaohui Shen, Alan Yuille, Liang-Chieh Chen
To alleviate the issue, we propose to adapt the trajectory attention for both the dense pixel features and object queries, aiming to improve the short-term and long-term tracking results, respectively.
Ranked #1 on Video Panoptic Segmentation on VIPSeg
no code implementations • 3 Oct 2023 • Xueqing Deng, Qi Fan, Xiaojie Jin, Linjie Yang, Peng Wang
Specifically, SFA consists of external adapters and internal adapters which are sequentially operated over a transformer model.
1 code implementation • NeurIPS 2023 • Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, Liang-Chieh Chen
The proposed FC-CLIP, benefits from the following observations: the frozen CLIP backbone maintains the ability of open-vocabulary classification and can also serve as a strong mask generator, and the convolutional CLIP generalizes well to a larger input resolution than the one used during contrastive image-text pretraining.
Ranked #1 on Open Vocabulary Semantic Segmentation on Cityscapes
Open Vocabulary Panoptic Segmentation Open Vocabulary Semantic Segmentation +1
1 code implementation • 19 Jan 2023 • Xiaojie Jin, BoWen Zhang, Weibo Gong, Kai Xu, Xueqing Deng, Peng Wang, Zhao Zhang, Xiaohui Shen, Jiashi Feng
The first is a Temporal Adaptation Module that is incorporated in the video branch to introduce global and local temporal contexts.
no code implementations • 20 Oct 2022 • Dalton Lunga, Yingjie Hu, Shawn Newsam, Song Gao, Bruno Martins, Lexie Yang, Xueqing Deng
Geospatial Artificial Intelligence (GeoAI) is an interdisciplinary field enjoying tremendous adoption.
1 code implementation • CVPR 2022 • Xueqing Deng, Peng Wang, Xiaochen Lian, Shawn Newsam
Notably, NightLab contains models at two levels of granularity, i. e. image and regional, and each level is composed of light adaptation and segmentation modules.
no code implementations • 12 Apr 2022 • Xueqing Deng, Dawei Sun, Shawn Newsam, Peng Wang
Specifically, given a pair of student and teacher networks, DistPro first sets up a rich set of KD connection from the transmitting layers of the teacher to the receiving layers of the student, and in the meanwhile, various transforms are also proposed for comparing feature maps along its pathway for the distillation.
no code implementations • 24 Jun 2021 • Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn Newsam
Neural network-based semantic segmentation has achieved remarkable results when large amounts of annotated data are available, that is, in the supervised case.
1 code implementation • 8 Dec 2020 • Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn Newsam
Land-cover classification using remote sensing imagery is an important Earth observation task.
no code implementations • 23 Dec 2019 • Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn Newsam
Inspired by this, we investigate methods to inform or guide deep learning models for geospatial image analysis to increase their performance when a limited amount of training data is available or when they are applied to scenarios other than which they were trained on.
no code implementations • 19 Feb 2019 • Xueqing Deng, Yi Zhu, Shawn Newsam
This paper develops a deep-learning framework to synthesize a ground-level view of a location given an overhead image.
no code implementations • 23 Jul 2018 • Weixun Zhou, Xueqing Deng, Zhenfeng Shao
In our approach, we first train a FCN model using a pixel-wise labeled dataset, and the trained FCN is then used to predict the segmentation maps of each image in the considered archive.
no code implementations • 13 Jun 2018 • Xueqing Deng, Yi Zhu, Shawn Newsam
More significantly, we show the generated images are representative of the locations and that the representations learned by the cGANs are informative.
no code implementations • 21 Feb 2018 • Xueqing Deng, Yi Zhu, Shawn Newsam
We also show that the spatial morphing kernel improves the results.
no code implementations • 7 Feb 2018 • Yi Zhu, Xueqing Deng, Shawn Newsam
We perform fine-grained land use mapping at the city scale using ground-level images.