Search Results for author: Songyou Peng

Found 31 papers, 19 papers with code

OpenDAS: Domain Adaptation for Open-Vocabulary Segmentation

no code implementations • 30 May 2024 • Gonca Yilmaz, Songyou Peng, Francis Engelmann, Marc Pollefeys, Hermann Blum

We, therefore, introduce a new task domain adaptation for open-vocabulary segmentation, enhancing VLMs with domain-specific priors while preserving their open-vocabulary nature.

Paper
Add Code

3D Neural Edge Reconstruction

no code implementations • 29 May 2024 • Lei LI, Songyou Peng, Zehao Yu, Shaohui Liu, Rémi Pautrat, Xiaochuan Yin, Marc Pollefeys

Real-world objects and environments are predominantly composed of edge features, including straight lines and curves.

Paper
Add Code

NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild

1 code implementation • 29 May 2024 • Weining Ren, Zihan Zhu, Boyang Sun, Jiaqi Chen, Marc Pollefeys, Songyou Peng

Neural Radiance Fields (NeRFs) have shown remarkable success in synthesizing photorealistic views from multi-view images of static scenes, but face challenges in dynamic, real-world environments with distractors like moving objects, shadows, and lighting changes.

Paper
Code

When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

no code implementations • 16 May 2024 • Xianzheng Ma, Yash Bhalgat, Brandon Smart, Shuai Chen, Xinghui Li, Jian Ding, Jindong Gu, Dave Zhenyu Chen, Songyou Peng, Jia-Wang Bian, Philip H Torr, Marc Pollefeys, Matthias Nießner, Ian D Reid, Angel X. Chang, Iro Laina, Victor Adrian Prisacariu

Hence, with this paper, we aim to chart a course for future research that explores and expands the capabilities of 3D-LLMs in understanding and interacting with the complex 3D world.

In-Context Learning Question Answering +2

Paper
Add Code

NeRF in Robotics: A Survey

no code implementations • 2 May 2024 • Guangming Wang, Lei Pan, Songyou Peng, Shaohui Liu, Chenfeng Xu, Yanzi Miao, Wei Zhan, Masayoshi Tomizuka, Marc Pollefeys, Hesheng Wang

Meticulous 3D environment representations have been a longstanding goal in computer vision and robotics fields.

Paper
Add Code

Renovating Names in Open-Vocabulary Segmentation Benchmarks

no code implementations • 14 Mar 2024 • Haiwen Huang, Songyou Peng, Dan Zhang, Andreas Geiger

Names are essential to both human cognition and vision-language models.

Segmentation

Paper
Add Code

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

no code implementations • 23 Feb 2024 • Francis Engelmann, Ayca Takmaz, Jonas Schult, Elisabetta Fedele, Johanna Wald, Songyou Peng, Xi Wang, Or Litany, Siyu Tang, Federico Tombari, Marc Pollefeys, Leonidas Guibas, Hongbo Tian, Chunjie Wang, Xiaosheng Yan, Bingwen Wang, Xuanyang Zhang, Xiao Liu, Phuc Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham, Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby

This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023.

Scene Understanding

Paper
Add Code

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

no code implementations • 28 Dec 2023 • Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc Pollefeys, Shiji Song, Gao Huang, Francis Engelmann

Therefore, we explore the use of image segmentation foundation models to automatically generate training labels for 3D segmentation.

Image Segmentation Scene Segmentation +1

Paper
Add Code

Ternary-type Opacity and Hybrid Odometry for RGB-only NeRF-SLAM

no code implementations • 20 Dec 2023 • Junru Lin, Asen Nachkov, Songyou Peng, Luc van Gool, Danda Pani Paudel

To foster this line of research, we also propose a simple yet novel visual odometry scheme that uses a hybrid combination of volumetric and warping-based image renderings.

Visual Odometry

Paper
Add Code

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

1 code implementation • 10 Nov 2023 • Weiyang Liu, Zeju Qiu, Yao Feng, Yuliang Xiu, Yuxuan Xue, Longhui Yu, Haiwen Feng, Zhen Liu, Juyeon Heo, Songyou Peng, Yandong Wen, Michael J. Black, Adrian Weller, Bernhard Schölkopf

We apply this parameterization to OFT, creating a novel parameter-efficient finetuning method, called Orthogonal Butterfly (BOFT).

2,011

Paper
Code

NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM

no code implementations • 7 Feb 2023 • Zihan Zhu, Songyou Peng, Viktor Larsson, Zhaopeng Cui, Martin R. Oswald, Andreas Geiger, Marc Pollefeys

Neural implicit representations have recently become popular in simultaneous localization and mapping (SLAM), especially in dense visual SLAM.

3D Scene Reconstruction Novel View Synthesis +2

Paper
Add Code

OpenScene: 3D Scene Understanding with Open Vocabularies

1 code implementation • CVPR 2023 • Songyou Peng, Kyle Genova, Chiyu "Max" Jiang, Andrea Tagliasacchi, Marc Pollefeys, Thomas Funkhouser

Traditional 3D scene understanding approaches rely on labeled 3D datasets to train a model for a single task with supervision.

Ranked #5 on 3D Open-Vocabulary Instance Segmentation on Replica

3D Open-Vocabulary Instance Segmentation 3D Semantic Segmentation +1

566

Paper
Code

FastHuman: Reconstructing High-Quality Clothed Human in Minutes

no code implementations • 26 Nov 2022 • Lixiang Lin, Songyou Peng, Qijun Gan, Jianke Zhu

We propose an approach for optimizing high-quality clothed human body shapes in minutes, using multi-view posed images.

Neural Rendering Vocal Bursts Intensity Prediction

Paper
Add Code

DiffDreamer: Towards Consistent Unsupervised Single-view Scene Extrapolation with Conditional Diffusion Models

no code implementations • ICCV 2023 • Shengqu Cai, Eric Ryan Chan, Songyou Peng, Mohamad Shahbazi, Anton Obukhov, Luc van Gool, Gordon Wetzstein

Scene extrapolation -- the idea of generating novel views by flying into a given image -- is a promising, yet challenging task.

Ranked #1 on Perpetual View Generation on LHQ

Denoising Perpetual View Generation

Paper
Add Code

3D Textured Shape Recovery with Learned Geometric Priors

1 code implementation • 7 Sep 2022 • Lei LI, Zhizheng Liu, Weining Ren, Liudi Yang, Fangjinhua Wang, Marc Pollefeys, Songyou Peng

3D textured shape recovery from partial scans is crucial for many real-world applications.

Pose Prediction

Paper
Code

MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction

1 code implementation • 1 Jun 2022 • Zehao Yu, Songyou Peng, Michael Niemeyer, Torsten Sattler, Andreas Geiger

Motivated by recent advances in the area of monocular geometry prediction, we systematically explore the utility these cues provide for improving neural implicit surface reconstruction.

3D Reconstruction Multi-View 3D Reconstruction +1

543

Paper
Code

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

1 code implementation • CVPR 2022 • Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys

Neural implicit representations have recently shown encouraging results in various domains, including promising progress in simultaneous localization and mapping (SLAM).

Simultaneous Localization and Mapping

1,371

Paper
Code

Shape As Points: A Differentiable Poisson Solver

1 code implementation • NeurIPS 2021 • Songyou Peng, Chiyu "Max" Jiang, Yiyi Liao, Michael Niemeyer, Marc Pollefeys, Andreas Geiger

However, the implicit nature of neural implicit representations results in slow inference time and requires careful initialization.

3D Reconstruction Surface Reconstruction

545

Paper
Code

UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction

2 code implementations • ICCV 2021 • Michael Oechsle, Songyou Peng, Andreas Geiger

At the same time, neural radiance fields have revolutionized novel view synthesis.

3D Object Reconstruction Novel View Synthesis +1

848

Paper
Code

KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

4 code implementations • ICCV 2021 • Christian Reiser, Songyou Peng, Yiyi Liao, Andreas Geiger

NeRF synthesizes novel views of a scene with unprecedented quality by fitting a neural radiance field to RGB images.

602

Paper
Code

Dynamic Plane Convolutional Occupancy Networks

1 code implementation • 11 Nov 2020 • Stefan Lionar, Daniil Emtsev, Dusan Svilarkovic, Songyou Peng

To further exploit translational equivariance, convolutional neural networks are applied to process the plane features.

Ranked #4 on 3D Reconstruction on ShapeNet

3D Reconstruction Surface Reconstruction

Paper
Code

Convolutional Occupancy Networks

6 code implementations • ECCV 2020 • Songyou Peng, Michael Niemeyer, Lars Mescheder, Marc Pollefeys, Andreas Geiger

Recently, implicit neural representations have gained popularity for learning-based 3D reconstruction.

3D Reconstruction

1,231

Paper
Code

DIST: Rendering Deep Implicit Signed Distance Function with Differentiable Sphere Tracing

1 code implementation • CVPR 2020 • Shaohui Liu, yinda zhang, Songyou Peng, Boxin Shi, Marc Pollefeys, Zhaopeng Cui

We propose a differentiable sphere tracing algorithm to bridge the gap between inverse graphics methods and the recently proposed deep learning based implicit signed distance function.

215

Paper
Code

A Deep Framework for Bone Age Assessment based on Finger Joint Localization

no code implementations • 7 May 2019 • Xiaoman Zhang, Ziyuan Zhao, Cen Chen, Songyou Peng, Min Wu, Zhongyao Cheng, Singee Teo, Le Zhang, Zeng Zeng

In this study, we applied powerful deep neural network and explored a process in the forecast of skeletal bone age with the specifically combine joints images to increase the performance accuracy compared with the whole hand images.

Paper
Add Code

Semi-Supervised Self-Taught Deep Learning for Finger Bones Segmentation

1 code implementation • 12 Mar 2019 • Ziyuan Zhao, Xiaoman Zhang, Cen Chen, Wei Li, Songyou Peng, Jie Wang, Xulei Yang, Le Zhang, Zeng Zeng

Segmentation stands at the forefront of many high-level vision tasks.

Paper
Code

A Hybrid SLAM and Object Recognition System for Pepper Robot

1 code implementation • 2 Mar 2019 • Paola Ardón, Kaisar Kushibar, Songyou Peng

Providing robust solutions for the tasks such as indoor environment mapping, self-localisation and object recognition are essential to make the robots to be more autonomous, hence, more human-like.

Robotics

Paper
Code

PersEmoN: A Deep Network for Joint Analysis of Apparent Personality, Emotion and Their Relationship

1 code implementation • 21 Nov 2018 • Le Zhang, Songyou Peng, Stefan Winkler

Apparent personality and emotion analysis are both central to affective computing.

Emotion Recognition Multi-Task Learning

Paper
Code

Calibration Wizard: A Guidance System for Camera Calibration Based on Modelling Geometric and Corner Uncertainty

1 code implementation • ICCV 2019 • Songyou Peng, Peter Sturm

It is well known that the accuracy of a calibration depends strongly on the choice of camera poses from which images of a calibration object are acquired.

Camera Calibration Position

100

Paper
Code

Photometric Depth Super-Resolution

1 code implementation • 26 Sep 2018 • Bjoern Haefner, Songyou Peng, Alok Verma, Yvain Quéau, Daniel Cremers

This study explores the use of photometric techniques (shape-from-shading and uncalibrated photometric stereo) for upsampling the low-resolution depth map from an RGB-D sensor to the higher resolution of the companion RGB image.

Super-Resolution

Paper
Code

A Deep Network for Arousal-Valence Emotion Prediction with Acoustic-Visual Cues

1 code implementation • 2 May 2018 • Songyou Peng, Le Zhang, Yutong Ban, Meng Fang, Stefan Winkler

In this paper, we comprehensively describe the methodology of our submissions to the One-Minute Gradual-Emotion Behavior Challenge 2018.

Paper
Code

Depth Super-Resolution Meets Uncalibrated Photometric Stereo

1 code implementation • 1 Aug 2017 • Songyou Peng, Bjoern Haefner, Yvain Quéau, Daniel Cremers

A novel depth super-resolution approach for RGB-D sensors is presented.

Super-Resolution

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.