Search Results for author: Guoqing Wang

Found 18 papers, 4 papers with code

OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

no code implementations • 23 Apr 2024 • Guoqing Wang, Zhongdao Wang, Pin Tang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

Existing solutions for 3D semantic occupancy prediction typically treat the task as a one-shot 3D voxel-wise segmentation perception problem.

Paper
Add Code

Structure-Aware Human Body Reshaping with Adaptive Affinity-Graph Network

no code implementations • 22 Apr 2024 • Qiwen Deng, Yangcen Liu, Wen Li, Guoqing Wang

Particularly, an SRM filter is utilized to extract high-frequency details, which are combined with spatial features as input to the BSD.

Paper
Add Code

SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction

no code implementations • 15 Apr 2024 • Pin Tang, Zhongdao Wang, Guoqing Wang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

Vision-based perception for autonomous driving requires an explicit modeling of a 3D space, where 2D latent representations are mapped and subsequent 3D operators are applied.

Autonomous Driving

Paper
Add Code

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

2 code implementations • 10 Apr 2024 • Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

3D Object Detection Autonomous Driving +1

144

Paper
Code

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

no code implementations • 15 Mar 2024 • Meixuan Li, Tianyu Li, Guoqing Wang, Peng Wang, Yang Yang, Heng Tao Shen

Aligning these distributions between corresponding regions from different tasks imparts higher flexibility and capacity to capture intra-region structures, accommodating a broader range of tasks.

Depth Estimation Semantic Segmentation +1

Paper
Add Code

Open-Vocabulary Calibration for Vision-Language Models

no code implementations • 7 Feb 2024 • Shuoyuan Wang, Jindong Wang, Guoqing Wang, Bob Zhang, Kaiyang Zhou, Hongxin Wei

Vision-language models (VLMs) have emerged as formidable tools, showing their strong capability in handling various open-vocabulary tasks in image recognition, text-driven visual content generation, and visual chatbots, to name a few.

Paper
Add Code

ReCo-Diff: Explore Retinex-Based Condition Strategy in Diffusion Model for Low-Light Image Enhancement

no code implementations • 20 Dec 2023 • Yuhui Wu, Guoqing Wang, Zhiwen Wang, Yang Yang, Tianyu Li, Peng Wang, Chongyi Li, Heng Tao Shen

Low-light image enhancement (LLIE) has achieved promising performance by employing conditional diffusion models.

Low-Light Image Enhancement

Paper
Add Code

Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation

no code implementations • 24 Oct 2023 • Yinjie Lei, Zixuan Wang, Feng Chen, Guoqing Wang, Peng Wang, Yang Yang

Multi-modal 3D scene understanding has gained considerable attention due to its wide applications in many areas, such as autonomous driving and human-computer interaction.

Autonomous Driving Scene Understanding

Paper
Add Code

Blind quantum machine learning with quantum bipartite correlator

no code implementations • 19 Oct 2023 • Changhao Li, Boning Li, Omar Amer, Ruslan Shaydulin, Shouvanik Chakrabarti, Guoqing Wang, Haowei Xu, Hao Tang, Isidor Schoch, Niraj Kumar, Charles Lim, Ju Li, Paola Cappellaro, Marco Pistoia

Privacy in distributed quantum computing is critical for maintaining confidentiality and protecting the data in the presence of untrusted computing nodes.

Privacy Preserving Quantum Machine Learning

Paper
Add Code

Faster Video Moment Retrieval with Point-Level Supervision

no code implementations • 23 May 2023 • Xun Jiang, Zailei Zhou, Xing Xu, Yang Yang, Guoqing Wang, Heng Tao Shen

Existing VMR methods suffer from two defects: (1) massive expensive temporal annotations are required to obtain satisfying performance; (2) complicated cross-modal interaction modules are deployed, which lead to high computational cost and low efficiency for the retrieval process.

Moment Retrieval Natural Language Queries +1

Paper
Add Code

Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement

1 code implementation • CVPR 2023 • Yuhui Wu, Chen Pan, Guoqing Wang, Yang Yang, Jiwei Wei, Chongyi Li, Heng Tao Shen

To address this issue, we propose a novel semantic-aware knowledge-guided framework (SKF) that can assist a low-light enhancement model in learning rich and diverse priors encapsulated in a semantic segmentation model.

Ranked #3 on Low-Light Image Enhancement on LOLv2

Low-Light Image Enhancement Semantic Segmentation

168

Paper
Code

ScanERU: Interactive 3D Visual Grounding based on Embodied Reference Understanding

1 code implementation • 23 Mar 2023 • Ziyang Lu, Yunqiang Pei, Guoqing Wang, Yang Yang, Zheng Wang, Heng Tao Shen

Despite their effectiveness, existing methods suffer from the difficulty of low recognition accuracy in cases of multiple adjacent objects with similar appearances. To address this issue, this work intuitively introduces the human-robot interaction as a cue to facilitate the development of 3D visual grounding.

Visual Grounding

Paper
Code

Thunder: Thumbnail based Fast Lightweight Image Denoising Network

no code implementations • 24 May 2022 • Yifeng Zhou, Xing Xu, Shuaicheng Liu, Guoqing Wang, Huimin Lu, Heng Tao Shen

To achieve promising results on removing noise from real-world images, most of existing denoising networks are formulated with complex network structure, making them impractical for deployment.

Image Denoising SSIM

Paper
Add Code

Learning content and context with language bias for Visual Question Answering

1 code implementation • 21 Dec 2020 • Chao Yang, Su Feng, Dongsheng Li, HuaWei Shen, Guoqing Wang, Bin Jiang

Many works concentrate on how to reduce language bias which makes models answer questions ignoring visual content and language context.

Question Answering Visual Question Answering

Paper
Code

PPGN: Phrase-Guided Proposal Generation Network For Referring Expression Comprehension

no code implementations • 20 Dec 2020 • Chao Yang, Guoqing Wang, Dongsheng Li, HuaWei Shen, Su Feng, Bin Jiang

Reference expression comprehension (REC) aims to find the location that the phrase refer to in a given image.

Referring Expression Referring Expression Comprehension

Paper
Add Code

Cross-domain Face Presentation Attack Detection via Multi-domain Disentangled Representation Learning

no code implementations • CVPR 2020 • Guoqing Wang, Hu Han, Shiguang Shan, Xilin Chen

In light of this, we propose an efficient disentangled representation learning for cross-domain face PAD.

Face Presentation Attack Detection Face Recognition +1

Paper
Add Code

Modeling the Control of COVID-19: Impact of Policy Interventions and Meteorological Factors

no code implementations • 6 Mar 2020 • Jiwei Jia, Jian Ding, Siyu Liu, Guidong Liao, Jingzhi Li, Ben Duan, Guoqing Wang, Ran Zhang

Home quarantine is the most important one to prevent the spread of COVID-19.

Paper
Add Code

ERL-Net: Entangled Representation Learning for Single Image De-Raining

no code implementations • ICCV 2019 • Guoqing Wang, Changming Sun, Arcot Sowmya

In this paper, we hypothesize that there exists an inherent mapping between the low-quality embedding to a latent optimal one, with which the generator (decoder) can produce much better results.

Image Restoration Image-to-Image Translation +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.