Search Results for author: Jinglu Wang

Found 27 papers, 11 papers with code

Higher-Order CRF Structural Segmentation of 3D Reconstructed Surfaces

no code implementations ICCV 2015 Jingbo Liu, Jinglu Wang, Tian Fang, Chiew-Lan Tai, Long Quan

In this paper, we propose a structural segmentation algorithm to partition multi-view stereo reconstructed surfaces of large-scale urban environments into structural segments.

Segmentation

Parallel Structure from Motion from Local Increment to Global Averaging

no code implementations28 Feb 2017 Siyu Zhu, Tianwei Shen, Lei Zhou, Runze Zhang, Jinglu Wang, Tian Fang, Long Quan

In this paper, we tackle the accurate and consistent Structure from Motion (SfM) problem, in particular camera registration, far exceeding the memory of a single computer in parallel.

Clustering

Progressive Large Scale-Invariant Image Matching in Scale Space

no code implementations ICCV 2017 Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan

In this paper, we propose a scale-invariant image matching approach to tackling the very large scale variation of views.

Image Retrieval Retrieval

MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image

no code implementations23 Nov 2018 Jinglu Wang, Bo Sun, Yan Lu

In this paper, we address the problem of reconstructing an object's surface from a single image using generative networks.

3D Object Reconstruction From A Single Image regression

MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization

1 code implementation26 Nov 2018 Zengyi Qin, Jinglu Wang, Yan Lu

We propose MonoGRNet for the amodal 3D object detection from a monocular RGB image via geometric reasoning in both the observed 2D projection and the unobserved depth dimension.

Depth Estimation Monocular 3D Object Detection +3

Triangulation Learning Network: from Monocular to Stereo 3D Object Detection

1 code implementation CVPR 2019 Zengyi Qin, Jinglu Wang, Yan Lu

In this paper, we study the problem of 3D object detection from stereo images, in which the key challenge is how to effectively utilize stereo information.

3D Object Detection From Stereo Images Object +1

Weakly-supervised Temporal Action Localization by Uncertainty Modeling

2 code implementations12 Jun 2020 Pilhyeon Lee, Jinglu Wang, Yan Lu, Hyeran Byun

Experimental results show that our uncertainty modeling is effective at alleviating the interference of background frames and brings a large performance gain without bells and whistles.

Action Classification Multiple Instance Learning +4

Weakly Supervised 3D Object Detection from Point Clouds

1 code implementation28 Jul 2020 Zengyi Qin, Jinglu Wang, Yan Lu

A crucial task in scene understanding is 3D object detection, which aims to detect and localize the 3D bounding boxes of objects belonging to specific classes.

3D Object Detection Knowledge Distillation +4

MonoGRNet: A General Framework for Monocular 3D Object Detection

no code implementations18 Apr 2021 Zengyi Qin, Jinglu Wang, Yan Lu

Detecting and localizing objects in the real 3D space, which plays a crucial role in scene understanding, is particularly challenging given only a monocular image due to the geometric information loss during imagery projection.

Depth Estimation Monocular 3D Object Detection +4

Video Instance Segmentation by Instance Flow Assembly

no code implementations20 Oct 2021 Xiang Li, Jinglu Wang, Xiao Li, Yan Lu

Instance segmentation is a challenging task aiming at classifying and segmenting all object instances of specific classes.

Instance Segmentation Object +3

Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation

no code implementations3 Dec 2021 Xiang Li, Jinglu Wang, Xiao Li, Yan Lu

Based on this representation, we introduce a cropping-free temporal fusion approach to model the temporal consistency between video frames.

Image Segmentation Instance Segmentation +2

Reliable Propagation-Correction Modulation for Video Object Segmentation

1 code implementation6 Dec 2021 Xiaohao Xu, Jinglu Wang, Xiao Li, Yan Lu

We introduce two modulators, propagation and correction modulators, to separately perform channel-wise re-calibration on the target frame embeddings according to local temporal correlations and reliable references respectively.

Object Semantic Segmentation +2

Towards Robust Video Object Segmentation with Adaptive Object Calibration

1 code implementation2 Jul 2022 Xiaohao Xu, Jinglu Wang, Xiang Ming, Yan Lu

We consolidate this conditional mask calibration process in a progressive manner, where the object representations and proto-masks evolve to be discriminative iteratively.

Object Segmentation +5

Online Video Instance Segmentation via Robust Context Fusion

no code implementations12 Jul 2022 Xiang Li, Jinglu Wang, Xiaohao Xu, Bhiksha Raj, Yan Lu

We propose a robust context fusion network to tackle VIS in an online fashion, which predicts instance segmentation frame-by-frame with a few preceding frames.

Instance Segmentation Segmentation +2

Neural Capture of Animatable 3D Human from Monocular Video

no code implementations18 Aug 2022 Gusi Te, Xiu Li, Xiao Li, Jinglu Wang, Wei Hu, Yan Lu

We present a novel paradigm of building an animatable 3D human representation from a monocular video input, such that it can be rendered in any unseen poses and views.

Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction

no code implementations CVPR 2023 Mingfang Zhang, Jinglu Wang, Xiao Li, Yifei HUANG, Yoichi Sato, Yan Lu

The Multiplane Image (MPI), containing a set of fronto-parallel RGBA layers, is an effective and efficient representation for view synthesis from sparse inputs.

3D Reconstruction

Two-shot Video Object Segmentation

1 code implementation CVPR 2023 Kun Yan, Xiao Li, Fangyun Wei, Jinglu Wang, Chenbin Zhang, Ping Wang, Yan Lu

The underlying idea is to generate pseudo labels for unlabeled frames during training and to optimize the model on the combination of labeled and pseudo-labeled data.

Object Pseudo Label +5

Rethinking Voice-Face Correlation: A Geometry View

no code implementations26 Jul 2023 Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj

Previous works on voice-face matching and voice-guided face synthesis demonstrate strong correlations between voice and face, but mainly rely on coarse semantic cues such as gender, age, and emotion.

3D Face Reconstruction Face Generation

Efficient View Synthesis with Neural Radiance Distribution Field

no code implementations ICCV 2023 Yushuang Wu, Xiao Li, Jinglu Wang, Xiaoguang Han, Shuguang Cui, Yan Lu

Specifically, we use a small network similar to NeRF while preserving the rendering speed with a single network forwarding per pixel as in NeLF.

Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition

3 code implementations29 Sep 2023 Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj

We propose a semantic decomposition method based on product quantization, where the multi-source semantics can be decomposed and represented by several disentangled and noise-suppressed single-source semantics.

Quantization

$\text{R}^2$-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

2 code implementations7 Mar 2024 Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazak, Hao Chen, Xiaonan Huang, Bhiksha Raj

Referring perception, which aims at grounding visual objects with multimodal referring guidance, is essential for bridging the gap between humans, who provide instructions, and the environment where intelligent systems perceive.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.