Search Results for author: Yanmin Wu

Found 18 papers, 9 papers with code

SecureGS: Boosting the Security and Fidelity of 3D Gaussian Splatting Steganography

no code implementations8 Mar 2025 Xuanyu Zhang, Jiarui Meng, Zhipei Xu, Shuzhou Yang, Yanmin Wu, Ronggang Wang, Jian Zhang

3D Gaussian Splatting (3DGS) has emerged as a premier method for 3D representation due to its real-time rendering and high-quality outputs, underscoring the critical need to protect the privacy of 3D assets.

3DGS NeRF +1

RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians

1 code implementation3 Dec 2024 Qiankun Gao, Yanmin Wu, Chengxiang Wen, Jiarui Meng, Luyang Tang, Jie Chen, Ronggang Wang, Jian Zhang

Finally, we jointly learn the scene's temporal motion and refine the canonical Gaussians learned from the first two stages.

3DGS

InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception

no code implementations28 Nov 2024 Haijie Li, Yanmin Wu, Jiarui Meng, Qiankun Gao, Zhiyao Zhang, Ronggang Wang, Jian Zhang

3D scene understanding has become an essential area of research with applications in autonomous driving, robotics, and augmented reality.

3DGS Autonomous Driving +4

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

no code implementations19 Sep 2024 Dongzhi Jiang, Renrui Zhang, Ziyu Guo, Yanmin Wu, Jiayi Lei, Pengshuo Qiu, Pan Lu, Zehui Chen, Chaoyou Fu, Guanglu Song, Peng Gao, Yu Liu, Chunyuan Li, Hongsheng Li

We further present error analysis to unveil current LMMs still struggle to fully grasp the multimodal search tasks, and conduct ablation study to indicate the potential of scaling test-time computation for AI search engine.

Benchmarking

OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding

no code implementations4 Jun 2024 Yanmin Wu, Jiarui Meng, Haijie Li, Chenming Wu, Yahao Shi, Xinhua Cheng, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Jian Zhang

To ensure robust feature presentation and 3D point-level understanding, we first employ SAM masks without cross-frame associations to train instance features with 3D consistency.

3DGS Object

Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation

1 code implementation31 May 2024 Shuzhou Yang, Yu Wang, Haijie Li, Jiarui Meng, Yanmin Wu, Xiandong Meng, Jian Zhang

We note that there is a disparity between the generation priors of these two diffusion models, leading to their different appearance outputs.

3D Generation Image to 3D

S3-SLAM: Sparse Tri-plane Encoding for Neural Implicit SLAM

no code implementations28 Apr 2024 Zhiyao Zhang, Yunzhou Zhang, Yanmin Wu, Bin Zhao, Xingshuo Wang, Rui Tian

With the emergence of Neural Radiance Fields (NeRF), neural implicit representations have gained widespread applications across various domains, including simultaneous localization and mapping.

NeRF Simultaneous Localization and Mapping

Language-Assisted 3D Scene Understanding

no code implementations18 Dec 2023 Yanmin Wu, Qiankun Gao, Renrui Zhang, Jian Zhang

The scale and quality of point cloud datasets constrain the advancement of point cloud learning.

3D Object Detection 3D Semantic Segmentation +6

GIR: 3D Gaussian Inverse Rendering for Relightable Scene Factorization

1 code implementation8 Dec 2023 Yahao Shi, Yanmin Wu, Chenming Wu, Xing Liu, Chen Zhao, Haocheng Feng, Jian Zhang, Bin Zhou, Errui Ding, Jingdong Wang

Our method achieves state-of-the-art performance in both relighting and novel view synthesis tasks among the recently proposed inverse rendering methods while achieving real-time rendering.

Disentanglement Inverse Rendering +1

UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling

no code implementations29 Sep 2023 Linghao Yang, Yanmin Wu, Yu Deng, Rui Tian, Xinggang Hu, Tiefeng Ma

Subsequently, in the part of object state estimation, we propose a tightly coupled optimization model for object pose and scale estimation, incorporating hybrids constraints into a novel dual sliding window optimization framework for joint estimation.

Motion Estimation Object +2

An Object SLAM Framework for Association, Mapping, and High-Level Tasks

no code implementations12 May 2023 Yanmin Wu, Yunzhou Zhang, Delong Zhu, Zhiqiang Deng, Wenkai Sun, Xin Chen, Jian Zhang

Taking into consideration the semantic invariance of objects, we convert the object map to a topological map to provide semantic descriptors to enable multi-map matching.

Decision Making Object +2

BSH-Det3D: Improving 3D Object Detection with BEV Shape Heatmap

1 code implementation3 Mar 2023 You Shen, Yunzhou Zhang, Yanmin Wu, Zhenyu Wang, Linghao Yang, Sonya Coleman, Dermot Kerr

Specifically, we design the Pillar-based Shape Completion (PSC) module to predict the probability of occupancy whether a pillar contains object shapes.

3D Object Detection Autonomous Driving +2

Panoptic Compositional Feature Field for Editable Scene Rendering With Network-Inferred Labels via Metric Learning

no code implementations CVPR 2023 Xinhua Cheng, Yanmin Wu, Mengxi Jia, Qian Wang, Jian Zhang

In this work, we attempt to learn an object-compositional neural implicit representation for editable scene rendering by leveraging labels inferred from the off-the-shelf 2D panoptic segmentation networks instead of the ground truth annotations.

2D Panoptic Segmentation Metric Learning +2

EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding

3 code implementations CVPR 2023 Yanmin Wu, Xinhua Cheng, Renrui Zhang, Zesen Cheng, Jian Zhang

3D visual grounding aims to find the object within point clouds mentioned by free-form natural language descriptions with rich semantic cues.

3D visual grounding Object +1

Object SLAM-Based Active Mapping and Robotic Grasping

1 code implementation3 Dec 2020 Yanmin Wu, Yunzhou Zhang, Delong Zhu, Xin Chen, Sonya Coleman, Wenkai Sun, Xinggang Hu, Zhiqiang Deng

The framework is built on an object SLAM system integrated with a simultaneous multi-object pose estimation process that is optimized for robotic grasping.

Object Object SLAM +2

EAO-SLAM: Monocular Semi-Dense Object SLAM Based on Ensemble Data Association

2 code implementations27 Apr 2020 Yanmin Wu, Yunzhou Zhang, Delong Zhu, Yonghui Feng, Sonya Coleman, Dermot Kerr

Object-level data association and pose estimation play a fundamental role in semantic SLAM, which remain unsolved due to the lack of robust and accurate algorithms.

Object Object SLAM +2

Cannot find the paper you are looking for? You can Submit a new open access paper.