Search Results for author: Weijia Li

Found 16 papers, 11 papers with code

3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions

2 code implementations • 7 Apr 2024 • Weijia Li, Haote Yang, Zhenghao Hu, Juepeng Zheng, Gui-Song Xia, Conghui He

3D building reconstruction from monocular remote sensing images is an important and challenging research problem that has received increasing attention in recent years, owing to its low cost of data acquisition and availability for large-scale applications.

3D Reconstruction

131

Paper
Code

SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation

1 code implementation • 3 Apr 2024 • Junyan Ye, Qiyan Luo, Jinhua Yu, Huaping Zhong, Zhimeng Zheng, Conghui He, Weijia Li

This paper aims at achieving fine-grained building attribute segmentation in a cross-view scenario, i. e., using satellite and street-view image pairs.

Attribute Semantic Segmentation

Paper
Code

H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

1 code implementation • 29 Mar 2024 • Chao Pang, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Xingxing Weng, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He

The generic large Vision-Language Models (VLMs) is rapidly developing, but still perform poorly in Remote Sensing (RS) domain, which is due to the unique and specialized nature of RS imagery and the comparatively limited spatial perception of current VLMs.

Hallucination Language Modelling +2

Paper
Code

Building Bridges across Spatial and Temporal Resolutions: Reference-Based Super-Resolution via Change Priors and Conditional Diffusion Model

1 code implementation • 26 Mar 2024 • Runmin Dong, Shuai Yuan, Bin Luo, Mengxuan Chen, Jinxiao Zhang, Lixian Zhang, Weijia Li, Juepeng Zheng, Haohuan Fu

Specifically, we inject the priors into the denoising model to improve the utilization of reference information in unchanged areas and regulate the reconstruction of semantically relevant content in changed areas.

Denoising Reference-based Super-Resolution

Paper
Code

Parrot Captions Teach CLIP to Spot Text

1 code implementation • 21 Dec 2023 • Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou

Despite CLIP being the foundation model in numerous vision-language applications, the CLIP suffers from a severe text spotting bias.

Representation Learning text similarity +1

Paper
Code

A review of individual tree crown detection and delineation from optical remote sensing images

no code implementations • 20 Oct 2023 • Juepeng Zheng, Shuai Yuan, Weijia Li, Haohuan Fu, Le Yu

); (2) traditional machine learning methods (such as random forest, decision tree, etc.

Image Segmentation object-detection +2

Paper
Add Code

VIGC: Visual Instruction Generation and Correction

2 code implementations • 24 Aug 2023 • Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He

A practical solution to this problem would be to utilize the available multimodal large language models (MLLMs) to generate instruction data for vision-language tasks.

Hallucination Image Captioning +1

Paper
Code

An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions

2 code implementations • CVPR 2023 • Weijia Li, Saihui Hou, Chunjie Zhang, Chunshui Cao, Xu Liu, Yongzhen Huang, Yao Zhao

For the cloth-changing problem, video-based ReID is rarely studied due to the lack of a suitable cloth-changing benchmark, and gait recognition is often researched under controlled conditions.

16k Gait Recognition +1

Paper
Code

Large-Scale Land Cover Mapping with Fine-Grained Classes via Class-Aware Semi-Supervised Semantic Segmentation

no code implementations • ICCV 2023 • Runmin Dong, Lichao Mou, Mengxuan Chen, Weijia Li, Xin-Yi Tong, Shuai Yuan, Lixian Zhang, Juepeng Zheng, Xiaoxiang Zhu, Haohuan Fu

Moreover, we propose the Class Center Contrast method to jointly utilize the labeled and unlabeled data.

Semi-Supervised Semantic Segmentation

Paper
Add Code

SEPT: Towards Scalable and Efficient Visual Pre-Training

no code implementations • 11 Dec 2022 • Yiqi Lin, Huabin Zheng, Huaping Zhong, Jinjing Zhu, Weijia Li, Conghui He, Lin Wang

To address these issues, we build a task-specific self-supervised pre-training framework from a data selection perspective based on a simple hypothesis that pre-training on the unlabeled samples with similar distribution to the target task can bring substantial performance gains.

Retrieval

Paper
Add Code

OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images

no code implementations • CVPR 2023 • Weijia Li, Yawen Lai, Linning Xu, Yuanbo Xiangli, Jinhua Yu, Conghui He, Gui-Song Xia, Dahua Lin

More precisely, the OmniCity contains multi-view satellite images as well as street-level panorama and mono-view images, constituting over 100K pixel-wise annotated images that are well-aligned and collected from 25K geo-locations in New York City.

Instance Segmentation Segmentation +1

Paper
Add Code

Exploring the Interactive Guidance for Unified and Effective Image Matting

1 code implementation • 17 May 2022 • Dinghao Yang, Bin Wang, Weijia Li, Yiqi Lin, Conghui He

Although avoiding the extensive labors of trimap annotation, existing methods still suffer from two limitations: (1) For the single image with multiple objects, it is essential to provide extra interaction information to help determining the matting target; (2) For transparent objects, the accurate regression of alpha matte from RGB image is much more difficult compared with the opaque ones.

Foreground Segmentation Image Matting +1

Paper
Code

Learning to Extract Building Footprints from Off-Nadir Aerial Images

1 code implementation • 28 Apr 2022 • Jinwang Wang, Lingxuan Meng, Weijia Li, Wen Yang, Lei Yu, Gui-Song Xia

In this paper, we propose an offset vector learning scheme, which turns the building footprint extraction problem in off-nadir images into an instance-level joint prediction problem of the building roof and its corresponding "roof to footprint" offset vector.

Paper
Code

Influence Selection for Active Learning

1 code implementation • ICCV 2021 • Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He

To obtain the Influence of the unlabeled sample in the active learning scenario, we design the Untrained Unlabeled sample Influence Calculation(UUIC) to estimate the unlabeled sample's expected gradient with which we calculate its Influence.

Active Learning

Paper
Code

3D Building Reconstruction From Monocular Remote Sensing Images

no code implementations • ICCV 2021 • Weijia Li, Lingxuan Meng, Jinwang Wang, Conghui He, Gui-Song Xia, Dahua Lin

3D building reconstruction from monocular remote sensing imagery is an important research problem and an economic solution to large-scale city modeling, compared with reconstruction from LiDAR data and multi-view imagery.

3D Reconstruction Model Optimization

Paper
Add Code

Cross-regional oil palm tree counting and detection via multi-level attention domain adaptation network

1 code implementation • 26 Aug 2020 • Juepeng Zheng, Haohuan Fu, Weijia Li, Wenzhao Wu, Yi Zhao, Runmin Dong, Le Yu

In this paper, we propose a novel domain adaptive oil palm tree detection method, i. e., a Multi-level Attention Domain Adaptation Network (MADAN) to reap cross-regional oil palm tree counting and detection.

Domain Adaptation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.