Search Results for author: Benjamin Planche

Found 31 papers, 1 papers with code

Anatomy-Aware Conditional Image-Text Retrieval

no code implementations10 Mar 2025 Meng Zheng, Jiajin Zhang, Benjamin Planche, Zhongpai Gao, Terrence Chen, Ziyan Wu

In this paper, we propose an Anatomical Location-Conditioned Image-Text Retrieval (ALC-ITR) framework, which, given a query image and the associated suspicious anatomical region(s), aims to retrieve similar patient cases exhibiting the same disease or symptoms in the same anatomical region.

Anatomy Contrastive Learning +2

Neural Finite-State Machines for Surgical Phase Recognition

no code implementations27 Nov 2024 Hao Ding, Zhongpai Gao, Benjamin Planche, Tianyu Luan, Abhishek Sharma, Meng Zheng, Ange Lou, Terrence Chen, Mathias Unberath, Ziyan Wu

Surgical phase recognition (SPR) is crucial for applications in workflow optimization, performance evaluation, and real-time intervention guidance.

Surgical phase recognition

Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding

no code implementations25 Nov 2024 Andong Deng, Zhongpai Gao, Anwesa Choudhuri, Benjamin Planche, Meng Zheng, Bin Wang, Terrence Chen, Chen Chen, Ziyan Wu

Temporal awareness is essential for video large language models (LLMs) to understand and reason about events within long videos, enabling applications like dense video captioning and temporal video grounding in a unified system.

Dense Video Captioning Transfer Learning +1

DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction

no code implementations18 Oct 2024 Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Meng Zheng, Terrence Chen, Ziyan Wu, Jack Noble

Numerous recent approaches to modeling and re-rendering dynamic scenes leverage plane-based explicit representations, addressing slow training times associated with models like neural radiance fields (NeRF) and Gaussian splatting (GS).

NeRF Novel View Synthesis

Order-aware Interactive Segmentation

no code implementations16 Oct 2024 Bin Wang, Anwesa Choudhuri, Meng Zheng, Zhongpai Gao, Benjamin Planche, Andong Deng, Qin Liu, Terrence Chen, Ulas Bagci, Ziyan Wu

However, current methods often fail to accurately separate target objects from the background, due to a limited understanding of order, the relative depth between objects in a scene.

Interactive Segmentation Segmentation

3D Vision-Language Gaussian Splatting

no code implementations10 Oct 2024 Qucheng Peng, Benjamin Planche, Zhongpai Gao, Meng Zheng, Anwesa Choudhuri, Terrence Chen, Chen Chen, Ziyan Wu

Recent advancements in 3D reconstruction methods and vision-language models have propelled the development of multi-modal 3D scene understanding, which has vital applications in robotics, autonomous driving, and virtual/augmented reality.

3D Reconstruction Autonomous Driving +5

6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering

no code implementations7 Oct 2024 Zhongpai Gao, Benjamin Planche, Meng Zheng, Anwesa Choudhuri, Terrence Chen, Ziyan Wu

Novel view synthesis has advanced significantly with the development of neural radiance fields (NeRF) and 3D Gaussian splatting (3DGS).

3DGS NeRF +1

Few-Shot 3D Volumetric Segmentation with Multi-Surrogate Fusion

no code implementations26 Aug 2024 Meng Zheng, Benjamin Planche, Zhongpai Gao, Terrence Chen, Richard J. Radke, Ziyan Wu

Conventional 3D medical image segmentation methods typically require learning heavy 3D networks (e. g., 3D-UNet), as well as large amounts of in-domain data with accurate pixel/voxel-level labels to avoid overfitting.

Anatomy Image Segmentation +3

Automated Patient Positioning with Learned 3D Hand Gestures

no code implementations20 Jul 2024 Zhongpai Gao, Abhishek Sharma, Meng Zheng, Benjamin Planche, Terrence Chen, Ziyan Wu

In this work, we propose an automated patient positioning system that utilizes a camera to detect specific hand gestures from technicians, allowing users to indicate the target patient region to the system and initiate automated positioning.

Gesture Recognition Hand Detection

Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images

no code implementations12 Jul 2024 Tianyu Luan, Zhongpai Gao, Luyuan Xie, Abhishek Sharma, Hao Ding, Benjamin Planche, Meng Zheng, Ange Lou, Terrence Chen, Junsong Yuan, Ziyan Wu

Traditional top-down methods, relying on whole-body parametric models like SMPL, falter when only a small part of the human is visible, as they require visibility of most of the human body for accurate mesh reconstruction.

DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering

no code implementations4 Jun 2024 Zhongpai Gao, Benjamin Planche, Meng Zheng, Xiao Chen, Terrence Chen, Ziyan Wu

Digitally reconstructed radiographs (DRRs) are simulated 2D X-ray images generated from 3D CT volumes, widely used in preoperative settings but limited in intraoperative applications due to computational bottlenecks, especially for accurate but heavy physics-based Monte Carlo methods.

3DGS

Automating Catheterization Labs with Real-Time Perception

no code implementations9 Mar 2024 Fan Yang, Benjamin Planche, Meng Zheng, Cheng Chen, Terrence Chen, Ziyan Wu

For decades, three-dimensional C-arm Cone-Beam Computed Tomography (CBCT) imaging system has been a critical component for complex vascular and nonvascular interventional procedures.

Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion

no code implementations5 Mar 2024 Meng Zheng, Benjamin Planche, Xuan Gong, Fan Yang, Terrence Chen, Ziyan Wu

3D patient body modeling is critical to the success of automated patient positioning for smart medical scanning and operating rooms.

Keypoint Detection

DaReNeRF: Direction-aware Representation for Dynamic Scenes

no code implementations CVPR 2024 Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Terrence Chen, Jack Noble, Ziyan Wu

However, the straightforward decomposition of 4D dynamic scenes into multiple 2D plane-based representations proves insufficient for re-rendering high-fidelity scenes with complex motions.

NeRF Novel View Synthesis

PBADet: A One-Stage Anchor-Free Approach for Part-Body Association

no code implementations12 Feb 2024 Zhongpai Gao, Huayi Zhou, Abhishek Sharma, Meng Zheng, Benjamin Planche, Terrence Chen, Ziyan Wu

The detection of human parts (e. g., hands, face) and their correct association with individuals is an essential task, e. g., for ubiquitous human-machine interfaces and action recognition.

Action Recognition

Implicit Modeling of Non-rigid Objects with Cross-Category Signals

no code implementations15 Dec 2023 Yuchun Liu, Benjamin Planche, Meng Zheng, Zhongpai Gao, Pierre Sibut-Bourde, Fan Yang, Terrence Chen, Ziyan Wu

To effectively capture the interrelation between these entities and ensure precise, collision-free representations, our approach facilitates signaling between category-specific fields to adequately rectify shapes.

Object

Disguise without Disruption: Utility-Preserving Face De-Identification

no code implementations23 Mar 2023 Zikui Cai, Zhongpai Gao, Benjamin Planche, Meng Zheng, Terrence Chen, M. Salman Asif, Ziyan Wu

We extensively evaluate our method using multiple datasets, demonstrating a higher de-identification rate and superior consistency compared to prior approaches in various downstream tasks.

De-identification Ensemble Learning

Exploring Cycle Consistency Learning in Interactive Volume Segmentation

1 code implementation11 Mar 2023 Qin Liu, Meng Zheng, Benjamin Planche, Zhongpai Gao, Terrence Chen, Marc Niethammer, Ziyan Wu

Given a medical volume, a user first segments a slice (or several slices) via the interaction module and then propagates the segmentation(s) to the remaining slices.

Segmentation

Progressive Multi-view Human Mesh Recovery with Self-Supervision

no code implementations10 Dec 2022 Xuan Gong, Liangchen Song, Meng Zheng, Benjamin Planche, Terrence Chen, Junsong Yuan, David Doermann, Ziyan Wu

To date, little attention has been given to multi-view 3D human mesh estimation, despite real-life applicability (e. g., motion capture, sport analysis) and robustness to single-view ambiguities.

Benchmarking Diversity +1

Self-supervised Human Mesh Recovery with Cross-Representation Alignment

no code implementations10 Sep 2022 Xuan Gong, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, David Doermann, Ziyan Wu

However, on synthetic dense correspondence maps (i. e., IUV) few have been explored since the domain gap between synthetic training data and real testing data is hard to address for 2D dense representation.

Diversity Human Mesh Recovery

PseudoClick: Interactive Image Segmentation with Click Imitation

no code implementations12 Jul 2022 Qin Liu, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, Marc Niethammer, Ziyan Wu

The goal of click-based interactive image segmentation is to obtain precise object segmentation masks with limited user interaction, i. e., by a minimal number of user clicks.

Image Segmentation Segmentation +1

SMPL-A: Modeling Person-Specific Deformable Anatomy

no code implementations CVPR 2022 Hengtao Guo, Benjamin Planche, Meng Zheng, Srikrishna Karanam, Terrence Chen, Ziyan Wu

In order to obtain accurate target location information, clinicians have to either conduct frequent intraoperative scans, resulting in higher exposition of patients to radiations, or adopt proxy procedures (e. g., creating and using custom molds to keep patients in the exact same pose during both preoperative organ scanning and subsequent treatment.

Anatomy Diagnostic +1

Physics-based Differentiable Depth Sensor Simulation

no code implementations ICCV 2021 Benjamin Planche, Rajat Vikram Singh

Gradient-based algorithms are crucial to modern computer-vision and graphics applications, enabling learning-based optimization and inverse problems.

Domain Adaptation Pose Estimation +2

AI on the Bog: Monitoring and Evaluating Cranberry Crop Risk

no code implementations8 Nov 2020 Peri Akiva, Benjamin Planche, Aditi Roy, Kristin Dana, Peter Oudemans, Michael Mars

Toward this goal, we propose two main deep learning-based modules for: 1) cranberry fruit segmentation to delineate the exact fruit regions in the cranberry field image that are exposed to sun, 2) prediction of cloud coverage conditions and sun irradiance to estimate the inner temperature of exposed cranberries.

3D Object Instance Recognition and Pose Estimation Using Triplet Loss with Dynamic Margin

no code implementations9 Apr 2019 Sergey Zakharov, Wadim Kehl, Benjamin Planche, Andreas Hutter, Slobodan Ilic

In this paper, we address the problem of 3D object instance recognition and pose estimation of localized objects in cluttered environments using convolutional neural networks.

Pose Estimation Triplet

Incremental Scene Synthesis

no code implementations NeurIPS 2019 Benjamin Planche, Xuejian Rong, Ziyan Wu, Srikrishna Karanam, Harald Kosch, YingLi Tian, Jan Ernst, Andreas Hutter

We present a method to incrementally generate complete 2D or 3D scenes with the following properties: (a) it is globally consistent at each step according to a learned scene prior, (b) real observations of a scene can be incorporated while observing global consistency, (c) unobserved regions can be hallucinated locally in consistence with previous observations, hallucinations and global priors, and (d) hallucinations are statistical in nature, i. e., different scenes can be generated from the same observations.

Autonomous Navigation Hallucination

Seeing Beyond Appearance - Mapping Real Images into Geometrical Domains for Unsupervised CAD-based Recognition

no code implementations9 Oct 2018 Benjamin Planche, Sergey Zakharov, Ziyan Wu, Andreas Hutter, Harald Kosch, Slobodan Ilic

Applying our approach to object recognition from texture-less CAD data, we present a custom generative network which fully utilizes the purely geometrical information to learn robust features and achieve a more refined mapping for unseen color images.

Denoising Domain Adaptation +1

Keep it Unreal: Bridging the Realism Gap for 2.5D Recognition with Geometry Priors Only

no code implementations24 Apr 2018 Sergey Zakharov, Benjamin Planche, Ziyan Wu, Andreas Hutter, Harald Kosch, Slobodan Ilic

With the increasing availability of large databases of 3D CAD models, depth-based recognition methods can be trained on an uncountable number of synthetically rendered images.

Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.