no code implementations • 27 Mar 2025 • Samra Irshad, Seungkyu Lee, Nassir Navab, Hong Joo Lee, Seong Tae Kim
The translation network encodes the characteristics of damaged signs into a latent `damage style code'.
no code implementations • 22 Mar 2025 • Shengyun Si, Xinpeng Wang, Guangyao Zhai, Nassir Navab, Barbara Plank
Recent advancements in large language models (LLMs) have demonstrated that fine-tuning and human alignment can render LLMs harmless.
no code implementations • 20 Mar 2025 • Beilei Cui, Long Bai, Mobarakol Islam, An Wang, Zhiqi Ma, Yiming Huang, Feng Li, Zhen Chen, Zhongliang Jiang, Nassir Navab, Hongliang Ren
Additionally, we propose a 3D scene reconstruction pipeline that optimizes depth maps' scales, shifts, and a few parameters based on our integrated network.
1 code implementation • 17 Mar 2025 • Tony Danjun Wang, Lennart Bastian, Tobias Czempiel, Christian Heiliger, Nassir Navab
When used to augment markerless personnel tracking, our approach improves accuracy by over 50%.
1 code implementation • 10 Mar 2025 • Luis D. Reyes Vargas, Martin J. Menten, Johannes C. Paetzold, Nassir Navab, Mohammad Farid Azampour
Skeletonization extracts thin representations from images that compactly encode their geometry and topology.
no code implementations • 4 Mar 2025 • Paul Stangel, David Bani-Harouni, Chantal Pellegrini, Ege Özsoy, Kamilia Zaripova, Matthias Keicher, Nassir Navab
A safe and trustworthy use of Large Language Models (LLMs) requires an accurate expression of confidence in their answers.
1 code implementation • 4 Mar 2025 • Ege Özsoy, Chantal Pellegrini, Tobias Czempiel, Felix Tristram, Kun Yuan, David Bani-Harouni, Ulrich Eck, Benjamin Busam, Matthias Keicher, Nassir Navab
Operating rooms (ORs) are complex, high-stakes environments requiring precise understanding of interactions among medical staff, tools, and equipment for enhancing surgical assistance, situational awareness, and patient safety.
Ranked #1 on
Video Panoptic Segmentation
on 4D-OR
(using extra training data)
no code implementations • 17 Feb 2025 • Klara Reichard, Giulia Rizzoli, Stefano Gasperini, Lukas Hoyer, Pietro Zanuttigh, Nassir Navab, Federico Tombari
Open-vocabulary semantic segmentation enables models to identify novel object categories beyond their training data.
Open Vocabulary Semantic Segmentation
Open-Vocabulary Semantic Segmentation
+1
no code implementations • 13 Feb 2025 • Lukas Buess, Matthias Keicher, Nassir Navab, Andreas Maier, Soroosh Tayebi Arasteh
The field has advanced rapidly, evolving from text-only large language models for tasks such as clinical documentation and decision support to multimodal AI systems capable of integrating diverse data modalities, including imaging, text, and structured data, within a single model.
no code implementations • 6 Feb 2025 • Weihang Li, Hongli Xu, Junwen Huang, HyunJun Jung, Peter KT Yu, Nassir Navab, Benjamin Busam
In this paper, we present GCE-Pose, a method that enhances pose estimation for novel instances by integrating category-level global context prior.
no code implementations • 4 Feb 2025 • Yaling Shen, Zhixiong Zhuang, Kun Yuan, Maria-Irina Nicolae, Nassir Navab, Nicolas Padoy, Mario Fritz
Experiments on the IU X-RAY and MIMIC-CXR radiology datasets demonstrate that Adversarial Domain Alignment enables attackers to steal the medical MLLM without any access to medical data.
1 code implementation • 20 Jan 2025 • Guankun Wang, Long Bai, Junyi Wang, Kun Yuan, Zhen Li, Tianxu Jiang, Xiting He, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu, Jiazheng Wang, Fan Zhang, Nicolas Padoy, Nassir Navab, Hongliang Ren
Recently, Multimodal Large Language Models (MLLMs) have demonstrated their immense potential in computer-aided diagnosis and decision-making.
1 code implementation • 16 Jan 2025 • Tingxuan Chen, Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy
Conclusion: We propose a text-driven adaptation approach that mitigates the modality gap and handles multiple downstream tasks in surgical workflow analysis, with minimal reliance on large annotated datasets.
no code implementations • 10 Jan 2025 • Felix Duelmer, Mohammad Farid Azampour, Nassir Navab
We propose a novel ultrasound simulation pipeline that utilizes a ray tracing algorithm to generate echo data, tracing each ray from the transducer through the scene and back to the sensor.
no code implementations • 30 Dec 2024 • Yousef Yeganeh, Ioannis Charisiadis, Marta Hasny, Martin Hartenberger, Björn Ommer, Nassir Navab, Azade Farshad, Ehsan Adeli
Scaling by training on large datasets has been shown to enhance the quality and fidelity of image generation and manipulation with diffusion models; however, such large datasets are not always accessible in medical imaging due to cost and privacy issues, which contradicts one of the main applications of such models to produce synthetic samples where real data is scarce.
no code implementations • 29 Dec 2024 • Yousef Yeganeh, Rui Xiao, Goktug Guvercin, Nassir Navab, Azade Farshad
While conventional computer vision emphasizes pixel-level and feature-based objectives, medical image analysis of intricate biological structures necessitates explicit representation of their complex topological properties.
no code implementations • 13 Dec 2024 • Siyun Liang, Sen Wang, Kunyi Li, Michael Niemeyer, Stefano Gasperini, Nassir Navab, Federico Tombari
3D Gaussian Splatting has recently gained traction for its efficient training and real-time rendering.
no code implementations • 1 Dec 2024 • Burak Bekci, Nassir Navab, Federico Tombari, Mahdi Saleh
Shape completion, a crucial task in 3D computer vision, involves predicting and filling the missing regions of scanned or partially observed objects.
no code implementations • 25 Nov 2024 • Kunyi Li, Michael Niemeyer, Zeyu Chen, Nassir Navab, Federico Tombari
By establishing a differentiable connection between the explicit Gaussians and the implicit SDF, our approach enables high-quality surface reconstruction and rendering.
no code implementations • 23 Nov 2024 • Ming Hu, Kun Yuan, Yaling Shen, Feilong Tang, Xiaohao Xu, Lin Zhou, Wei Li, Ying Chen, Zhongxing Xu, Zelin Peng, Siyuan Yan, Vinkle Srivastav, Diping Song, Tianbin Li, Danli Shi, Jin Ye, Nicolas Padoy, Nassir Navab, Junjun He, ZongYuan Ge
Surgical practice involves complex visual interpretation, procedural skills, and advanced medical knowledge, making surgical vision-language pretraining (VLP) particularly challenging due to this complexity and the limited availability of annotated data.
no code implementations • 6 Nov 2024 • Yuan Bi, Lucie Huang, Ricarda Clarenbach, Reza Ghotbi, Angelos Karlas, Nassir Navab, Zhongliang Jiang
To address these issues, we propose a novel unsupervised anomaly detection framework based on a diffusion model that incorporates a synthetic anomaly (Synomaly) noise function and a multi-stage diffusion process.
1 code implementation • 30 Oct 2024 • HyunJun Jung, Weihang Li, Shun-Cheng Wu, William Bittner, Nikolas Brasch, Jifei Song, Eduardo Pérez-Pellitero, Zhensong Zhang, Arthur Moreau, Nassir Navab, Benjamin Busam
However, using these datasets to evaluate dense geometry tasks, such as depth rendering, can be problematic as the meshes of the dataset are often incomplete and may produce wrong ground truth to evaluate the details.
no code implementations • 28 Oct 2024 • Zhihao Zhao, Junjie Yang, Shahrooz Faghihroohi, Yinzheng Zhao, Daniel Zapp, Kai Huang, Nassir Navab, M. Ali Nasseri
Subsequently, a time-aligned mask is employed to select a specific year for image generation.
1 code implementation • 28 Oct 2024 • Zhihao Zhao, Shahrooz Faghihroohi, Yinzheng Zhao, Junjie Yang, Shipeng Zhong, Kai Huang, Nassir Navab, Boyang Li, M. Ali Nasseri
Methods: To address these issues, we propose a novel network (KaLDeX) for vascular segmentation leveraging a Kalman filter based linear deformable cross attention (LDCA) module, integrated within a UNet++ framework.
no code implementations • 23 Oct 2024 • Yousef Yeganeh, Rachmadio Lazuardi, Amir Shamseddin, Emine Dari, Yash Thirani, Nassir Navab, Azade Farshad
The results of our experiments demonstrate high-fidelity video generation for laparoscopy procedures, which enables various applications in SDS.
no code implementations • 10 Oct 2024 • Markus Herb, Nassir Navab, Federico Tombari
Autonomous vehicles demand detailed maps to maneuver reliably through traffic, which need to be kept up-to-date to ensure a safe operation.
2 code implementations • 30 Sep 2024 • Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy
Surgical video-language pretraining (VLP) faces unique challenges due to the knowledge domain gap and the scarcity of multi-modal data.
1 code implementation • 20 Sep 2024 • Sven Lüpke, Yousef Yeganeh, Ehsan Adeli, Nassir Navab, Azade Farshad
Our approach utilizes latent diffusion models and a two-step generative process: first, unobserved physical tissue property maps are synthesized using a latent diffusion model, and then these maps are combined with a physical signal model to generate the final MRI scan.
no code implementations • 19 Sep 2024 • Zhihao Zhao, Yinzheng Zhao, Junjie Yang, Kai Huang, Nassir Navab, M. Ali Nasseri
To better optimize the coordinate positions of deformable convolution, we employ the Kalman filter to enhance the perception of vascular structures in linear deformable convolution.
no code implementations • 18 Sep 2024 • Maximilian Fehrentz, Mohammad Farid Azampour, Reuben Dorent, Hassan Rasheed, Colin Galvin, Alexandra Golby, William M. Wells, Sarah Frisken, Nassir Navab, Nazim Haouchine
We present in this paper a novel approach for 3D/2D intraoperative registration during neurosurgery via cross-modal inverse neural rendering.
no code implementations • 12 Sep 2024 • Çağhan Köksal, Ghazal Ghazaei, Nassir Navab
Considering the profusion of surgical videos obtained through standardized surgical workflows, we propose an annotation-efficient framework for the semantic segmentation of surgical scenes.
no code implementations • 10 Sep 2024 • David Bani-Harouni, Nassir Navab, Matthias Keicher
Large Language Models (LLMs) have the potential to alleviate some pressure from these clinicians by providing insights that can help them in their decision-making.
no code implementations • 13 Aug 2024 • Gizem Mert, Ario Sadafi, Raheleh Salehi, Nassir Navab, Carsten Marr
Biomedical imaging and RNA sequencing with single-cell resolution improves our understanding of white blood cell diseases like leukemia.
1 code implementation • 7 Aug 2024 • Felix Duelmer, Walter Simson, Mohammad Farid Azampour, Magdalena Wysocki, Angelos Karlas, Nassir Navab
Conventionally, deconvolution techniques attempt to rectify the imaging system's dependent PSF, working directly on the radio-frequency (RF) data.
1 code implementation • 4 Aug 2024 • Oleksandra Tmenova, Yordanka Velikova, Mahdi Saleh, Nassir Navab
Ultrasound imaging is challenging to interpret due to non-uniform intensities, low contrast, and inherent artifacts, necessitating extensive training for non-specialists.
1 code implementation • 2 Aug 2024 • Matan Atad, David Schinz, Hendrik Moeller, Robert Graf, Benedikt Wiestler, Daniel Rueckert, Nassir Navab, Jan S. Kirschke, Matthias Keicher
Counterfactual explanations (CEs) aim to enhance the interpretability of machine learning models by illustrating how alterations in input features would affect the resulting predictions.
no code implementations • 29 Jul 2024 • Çağhan Köksal, Ghazal Ghazaei, Felix Holm, Azade Farshad, Nassir Navab
Graph-based holistic scene representations facilitate surgical workflow understanding and have recently demonstrated significant success.
1 code implementation • 11 Jul 2024 • Xin You, Yixin Lou, Minghui Zhang, Jie Yang, Nassir Navab, Yun Gu
Specifically, a contour generation network is proposed based on Structural Low-Rank Descriptors for shape consistency, termed SLoRD.
1 code implementation • 7 Jul 2024 • Marina Domínguez, Yordanka Velikova, Nassir Navab, Mohammad Farid Azampour
However, when it comes to ultrasound (US) imaging, the authenticity of generated data often diminishes due to the oversight of ultrasound physics.
1 code implementation • 14 Jun 2024 • Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari
Our objective is to leverage a differentiable radiance field \eg NeRF to reconstruct detailed 3D surfaces in addition to producing the standard novel view renderings.
1 code implementation • 6 Jun 2024 • Zhongliang Jiang, Yunfeng Kang, Yuan Bi, Xuesong Li, Chenyang Li, Nassir Navab
Then, a dense skeleton graph-based non-rigid registration is presented to map the intercostal scanning path from a generic template to individual patients.
no code implementations • 2 Jun 2024 • Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang
In this work, we propose a novel framework for automatic ultrasound report generation, leveraging a combination of unsupervised and supervised learning methods to aid the report generation process.
2 code implementations • 16 May 2024 • Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy
By disentangling embedding spaces of different hierarchical levels, the learned multi-modal representations encode short-term and long-term surgical concepts in the same model.
1 code implementation • 10 May 2024 • Hartmut Häntze, Lina Xu, Christian J. Mertens, Felix J. Dorfner, Leonhard Donle, Felix Busch, Avan Kader, Sebastian Ziegelmayer, Nadine Bayerl, Nassir Navab, Daniel Rueckert, Julia Schnabel, Hugo JWL Aerts, Daniel Truhn, Fabian Bamberg, Jakob Weiß, Christopher L. Schlett, Steffen Ringhof, Thoralf Niendorf, Tobias Pischon, Hans-Ulrich Kauczor, Tobias Nonnenmacher, Thomas Kröncke, Henry Völzke, Jeanette Schulz-Menger, Klaus Maier-Hein, Mathias Prokop, Bram van Ginneken, Alessa Hering, Marcus R. Makowski, Lisa C. Adams, Keno K. Bressem
A human-in-the-loop annotation workflow was employed, leveraging cross-modality transfer learning from an existing CT segmentation model to segment 40 anatomical structures.
1 code implementation • 2 May 2024 • Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam
The scheme ensures that the denoising processes are influenced by a holistic understanding of the scene graph, facilitating the generation of globally coherent scenes.
no code implementations • 30 Apr 2024 • Kristina Mach, Hessam Roodaki, Michael Sommersperger, Nassir Navab
This paper presents an innovative approach to intraoperative Optical Coherence Tomography (iOCT) image segmentation in ophthalmic surgery, leveraging statistical analysis of speckle patterns to incorporate statistical pathology-specific prior knowledge.
no code implementations • 15 Apr 2024 • Yuan Bi, Cheng Qian, Zhicheng Zhang, Nassir Navab, Zhongliang Jiang
Ultrasound (US) has been widely used in daily clinical practice for screening internal organs and guiding interventions.
no code implementations • 12 Apr 2024 • Baochang Zhang, Mai Bui, Cheng Wang, Felix Bourier, Heribert Schunkert, Nassir Navab
For this purpose, real-time and accurate guidewire segmentation and tracking can enhance the visualization of guidewires and provide visual feedback for physicians during the intervention as well as for robot-assisted interventions.
1 code implementation • 11 Apr 2024 • Miruna-Alexandra Gafencu, Yordanka Velikova, Mahdi Saleh, Tamas Ungi, Nassir Navab, Thomas Wendler, Mohammad Farid Azampour
Purpose: Ultrasound (US) imaging, while advantageous for its radiation-free nature, is challenging to interpret due to only partially visible organs and a lack of complete 3D information.
1 code implementation • 10 Apr 2024 • Ege Özsoy, Chantal Pellegrini, Matthias Keicher, Nassir Navab
This demonstrates ORacle's potential to significantly enhance the scalability and affordability of OR domain modeling and opens a pathway for future advancements in surgical data science.
Ranked #1 on
Scene Graph Generation
on 4D-OR
no code implementations • 8 Apr 2024 • Michael Deutges, Ario Sadafi, Nassir Navab, Carsten Marr
We test our approach on three datasets of white blood cell images and show that we achieve competitive performance compared to conventional methods.
no code implementations • 21 Mar 2024 • Alex Ranne, Liming Kuang, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena
In minimally invasive endovascular procedures, contrast-enhanced angiography remains the most robust imaging technique.
1 code implementation • 21 Mar 2024 • Dianye Huang, Chenyang Li, Angelos Karlas, Xiangyu Chu, K. W. Samuel Au, Nassir Navab, Zhongliang Jiang
The results obtained on porcine samples demonstrate that VibNet effectively detects needles even when their visibility is severely reduced, with a tip error of $1. 61\pm1. 56~mm$ compared to $8. 15\pm9. 98~mm$ for UNet and $6. 63\pm7. 58~mm$ for WNet, and a needle direction error of $1. 64\pm1. 86^{\circ}$ compared to $9. 29\pm15. 30^{\circ}$ for UNet and $8. 54\pm17. 92^{\circ}$ for WNet.
no code implementations • 18 Mar 2024 • Florian Philipp Stilz, Mert Asim Karaoglu, Felix Tristram, Nassir Navab, Benjamin Busam, Alexander Ladikos
However, the setup has been restricted to a static endoscope, limited deformation, or required an external tracking device to retrieve camera pose information of the endoscopic camera.
no code implementations • CVPR 2024 • Junwen Huang, Hao Yu, Kuan-Ting Yu, Nassir Navab, Slobodan Ilic, Benjamin Busam
MatchU is a generic approach that fuses 2D texture and 3D geometric cues for 6D pose prediction of unseen objects.
no code implementations • 5 Feb 2024 • Mahdi Saleh, Michael Sommersperger, Nassir Navab, Federico Tombari
We also incorporate cross-attention mechanisms to capture the interplay between the objects.
1 code implementation • 4 Jan 2024 • Dianye Huang, Chenguang Yang, Mingchuan Zhou, Angelos Karlas, Nassir Navab, Zhongliang Jiang
To ensure the biometric measurements obtained in different examinations are comparable, the 6D scanning path is determined in a coarse-to-fine manner using both an external RGBD camera and US images.
1 code implementation • 1 Jan 2024 • Razieh Rezaei, Alireza Dizaji, Ashkan Khakzar, Anees Kazi, Nassir Navab, Daniel Rueckert
In this work, we assess attribution methods from a perspective not previously explored in the graph domain: retraining.
no code implementations • CVPR 2024 • HyunJun Jung, Shun-Cheng Wu, Patrick Ruhkamp, Guangyao Zhai, Hannah Schieber, Giulia Rizzoli, Pengyuan Wang, Hongcheng Zhao, Lorenzo Garattoni, Sven Meier, Daniel Roth, Nassir Navab, Benjamin Busam
Estimating 6D object poses is a major challenge in 3D computer vision.
no code implementations • 22 Dec 2023 • HyunJun Jung, Nikolas Brasch, Jifei Song, Eduardo Perez-Pellitero, Yiren Zhou, Zhihao LI, Nassir Navab, Benjamin Busam
ParDy-Human introduces parameter-driven dynamics into 3D Gaussian Splatting where 3D Gaussians are deformed by a human pose model to animate the avatar.
2 code implementations • 15 Dec 2023 • Kun Yuan, Manasi Kattel, Joel L. Lavanchy, Nassir Navab, Vinkle Srivastav, Nicolas Padoy
We highlight that the primary limitation in the current surgical VQA systems is the lack of scene knowledge to answer complex queries.
no code implementations • CVPR 2024 • Lennart Bastian, Yizheng Xie, Nassir Navab, Zorah Lähner
Non-isometric shape correspondence remains a fundamental challenge in computer vision.
no code implementations • 4 Dec 2023 • Felix Tristram, Stefano Gasperini, Nassir Navab, Federico Tombari
This introduces additional multi-view constraints and allows the second model to converge to a better solution.
no code implementations • 2 Dec 2023 • Patrick Ruhkamp, Daoyi Gao, Nassir Navab, Benjamin Busam
The novel training paradigm comprises 1) a physical model to extract geometric information of polarized light, 2) a teacher-student knowledge distillation scheme and 3) a self-supervised loss formulation through differentiable rendering and an invertible physical constraint.
1 code implementation • 30 Nov 2023 • Chantal Pellegrini, Ege Özsoy, Benjamin Busam, Nassir Navab, Matthias Keicher
Conversational AI tools that can generate and discuss clinically correct radiology reports for a given medical image have the potential to transform radiology.
no code implementations • 30 Nov 2023 • Kunyi Li, Michael Niemeyer, Nassir Navab, Federico Tombari
In this work, we introduce DNS SLAM, a novel neural RGB-D semantic SLAM approach featuring a hybrid representation.
no code implementations • 20 Nov 2023 • Mayar Lotfy Mostafa, Anna Alperovich, Tommaso Giannantonio, Bjorn Barz, Xiaohan Zhang, Felix Holm, Nassir Navab, Felix Boehm, Carolin Schwamborn, Thomas K. Hoffmann, Patrick J. Schuler
Despite the limited dataset, the GNN-based model significantly outperforms context-agnostic approaches, accurately distinguishing between healthy and tumor tissues, even in images from previously unseen patients.
1 code implementation • CVPR 2024 • Yamei Chen, Yan Di, Guangyao Zhai, Fabian Manhardt, Chenyangguang Zhang, Ruida Zhang, Federico Tombari, Nassir Navab, Benjamin Busam
Leveraging the advantage of DINOv2 in providing SE(3)-consistent semantic features, we hierarchically extract two types of SE(3)-invariant geometric features to further encapsulate local-to-global object-specific information.
no code implementations • 15 Nov 2023 • Junjie Yang, Zhihao Zhao, Siyuan Shen, Daniel Zapp, Mathias Maier, Kai Huang, Nassir Navab, M. Ali Nasseri
Robotic ophthalmic surgery is an emerging technology to facilitate high-precision interventions such as retina penetration in subretinal injection and removal of floating tissues in retinal detachment depending on the input imaging modalities such as microscopy and intraoperative OCT (iOCT).
no code implementations • 9 Nov 2023 • Sen Wang, Qing Cheng, Stefano Gasperini, Wei zhang, Shun-Cheng Wu, Niclas Zeller, Daniel Cremers, Nassir Navab
The generation of high-fidelity view synthesis is essential for robotic navigation and interaction but remains challenging, particularly in indoor environments and real-time scenarios.
no code implementations • 3 Nov 2023 • Pavel Jahoda, Azade Farshad, Yousef Yeganeh, Ehsan Adeli, Nassir Navab
We take advantage of the outer part of the masked area as they have a direct correlation with the context of the scene.
no code implementations • 25 Sep 2023 • Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab
Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time.
no code implementations • 25 Sep 2023 • Alex Ranne, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena
To date, endovascular surgeries are performed using the golden standard of Fluoroscopy, which uses ionising radiation to visualise catheters and vasculature.
no code implementations • 21 Sep 2023 • Guangyao Zhai, Xiaoni Cai, Dianye Huang, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam
In this paper, we present SG-Bot, a novel rearrangement framework that utilizes a coarse-to-fine scheme with a scene graph as the scene representation.
no code implementations • 18 Sep 2023 • Mert Asim Karaoglu, Viktoria Markova, Nassir Navab, Benjamin Busam, Alexander Ladikos
While most classical methods achieve rotation-equivariant detection and invariant description by design, many learning-based approaches learn to be robust only up to a certain degree.
1 code implementation • 16 Sep 2023 • Nicolas Schischka, Hannah Schieber, Mert Asim Karaoglu, Melih Görgülü, Florian Grötzner, Alexander Ladikos, Daniel Roth, Nassir Navab, Benjamin Busam
To address this challenge, we propose Dynamic Motion-Aware Fast and Robust Camera Localization for Dynamic Neural Radiance Fields (DynaMoN).
no code implementations • ICCV 2023 • Zhiying Leng, Shun-Cheng Wu, Mahdi Saleh, Antonio Montanaro, Hao Yu, Yin Wang, Nassir Navab, Xiaohui Liang, Federico Tombari
In this work, we propose the first precise hand-object reconstruction method in hyperbolic space, namely Dynamic Hyperbolic Attention Network (DHANet), which leverages intrinsic properties of hyperbolic space to learn representative features.
no code implementations • 5 Sep 2023 • Yu Liu, Gesine Muller, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng
Light-sheet fluorescence microscopy (LSFM), a planar illumination technique that enables high-resolution imaging of samples, experiences defocused image quality caused by light scattering when photons propagate through thick tissues.
1 code implementation • 1 Sep 2023 • Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab
We demonstrate that our multi-modal registration framework can localize images on the 3D surface topology of a patient-specific organ and the mean shape of an SSM.
no code implementations • 29 Aug 2023 • Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari
We conduct extensive experiments across a variety of scenarios on data from KITTI, Waymo, and CrashD for 3D object detection, and on data from SemanticKITTI, Waymo, and nuScenes for 3D semantic segmentation.
no code implementations • 24 Aug 2023 • Ario Sadafi, Raheleh Salehi, Armin Gruber, Sayedali Shetab Boushehri, Pascal Giehr, Nassir Navab, Carsten Marr
Here, we propose a rehearsal-based continual learning approach for class incremental and domain incremental scenarios in white blood cell classification.
no code implementations • 24 Aug 2023 • Ario Sadafi, Matthias Hehr, Nassir Navab, Carsten Marr
To that end, we train multiple MIL models using different levels of sex imbalance in the training set and excluding certain age groups.
no code implementations • 21 Aug 2023 • HyunJun Jung, Patrick Ruhkamp, Nassir Navab, Benjamin Busam
This paper addresses the limitations of current datasets for 3D vision tasks in terms of accuracy, size, realism, and suitable imaging modalities for photometrically challenging objects.
no code implementations • 21 Aug 2023 • Patrick Ruhkamp, Daoyi Gao, HyunJun Jung, Nassir Navab, Benjamin Busam
6D pose estimation pipelines that rely on RGB-only or RGB-D data show limitations for photometrically challenging objects with e. g. textureless surfaces, reflections or transparency.
no code implementations • 18 Aug 2023 • Vanessa Gonzalez Duque, Leonhard Zirus, Yordanka Velikova, Nassir Navab, Diana Mateus
Therefore, we propose to give the confidence maps as additional information to the networks.
no code implementations • ICCV 2023 • Stefano Gasperini, Nils Morbitzer, HyunJun Jung, Nassir Navab, Federico Tombari
While state-of-the-art monocular depth estimation approaches achieve impressive results in ideal settings, they are highly unreliable under challenging illumination and weather conditions, such as at nighttime or in the presence of rain.
no code implementations • 14 Aug 2023 • Indu Joshi, Priyank Upadhya, Gaurav Kumar Nayak, Peter Schüffler, Nassir Navab
Leveraging this, we introduce DISBELIEVE, a local model poisoning attack that creates malicious parameters or gradients such that their distance to benign clients' parameters or gradients is low respectively but at the same time their adverse effect on the global model's performance is high.
no code implementations • 7 Aug 2023 • Ardit Ramadani, Peter Ewert, Heribert Schunkert, Nassir Navab
Accurate catheter tracking is crucial during minimally invasive endovascular procedures (MIEP), and electromagnetic (EM) tracking is a widely used technology that serves this purpose.
1 code implementation • 7 Aug 2023 • Zhongliang Jiang, Yue Zhou, Dongliang Cao, Nassir Navab
The recovery of morphologically accurate anatomical images from deformed ones is challenging in ultrasound (US) image acquisition, but crucial to accurate and consistent diagnosis, particularly in the emerging field of computer-assisted diagnosis.
1 code implementation • 29 Jul 2023 • Yordanka Velikova, Mohammad Farid Azampour, Walter Simson, Vanessa Gonzalez Duque, Nassir Navab
Anatomical segmentation of organs in ultrasound images is essential to many clinical applications, particularly for diagnosis and monitoring.
2 code implementations • 27 Jul 2023 • Kun Yuan, Vinkle Srivastav, Tong Yu, Joel L. Lavanchy, Jacques Marescaux, Pietro Mascagni, Nassir Navab, Nicolas Padoy
We then present a novel method, SurgVLP - Surgical Vision Language Pre-training, for multi-modal representation learning.
1 code implementation • 26 Jul 2023 • Lennart Bastian, Tony Danjun Wang, Tobias Czempiel, Benjamin Busam, Nassir Navab
Methods: RGB and depth images from multiple cameras are fused into a 3D point cloud representation of the scene.
1 code implementation • 19 Jul 2023 • Matteo Ronchetti, Wolfgang Wein, Nassir Navab, Oliver Zettinig, Raphael Prevost
Our method is several orders of magnitude faster than local patch-based metrics and can be directly applied in clinical settings by replacing the similarity measure with the proposed one.
1 code implementation • 11 Jul 2023 • Chantal Pellegrini, Matthias Keicher, Ege Özsoy, Nassir Navab
However, there is limited research on automating structured reporting, and no public benchmark is available for evaluating and comparing different methods.
Ranked #1 on
Structured Report Generation
on Rad-ReStruct
1 code implementation • 7 Jul 2023 • Dianye Huang, Yuan Bi, Nassir Navab, Zhongliang Jiang
To validate the proposed robotic US system for imaging arteries, experiments are carried out on volunteers' carotid and radial arteries.
1 code implementation • 7 Jul 2023 • Zhongliang Jiang, Chenyang Li, Xuesong Li, Nassir Navab
To address this challenge, a graph-based non-rigid registration is proposed to enable transferring planned paths from the atlas to the current setup by explicitly considering subcutaneous bone surface features instead of the skin surface.
1 code implementation • 7 Jul 2023 • Zhongliang Jiang, Yuan Bi, Mingchuan Zhou, Ying Hu, Michael Burke, Nassir Navab
The results demonstrated that the proposed advanced framework can robustly work on a variety of seen and unseen phantoms as well as in-vivo human carotid data.
1 code implementation • NeurIPS 2023 • Guangyao Zhai, Evin Pınar Örnek, Shun-Cheng Wu, Yan Di, Federico Tombari, Nassir Navab, Benjamin Busam
The generated scenes can be manipulated by editing the input scene graph and sampling the noise in the diffusion model.
no code implementations • 21 May 2023 • Mehdi Astaraki, Francesca De Benetti, Yousef Yeganeh, Iuliana Toma-Dasu, Örjan Smedby, Chunliang Wang, Nassir Navab, Thomas Wendler
This work intends to, first, propose a robust inpainting model to learn the details of healthy anatomies and reconstruct high-resolution images by preserving anatomical constraints.
no code implementations • 17 May 2023 • Francesca De Benetti, Walter Simson, Magdalini Paschali, Hasan Sari, Axel Romiger, Kuangyu Shi, Nassir Navab, Thomas Wendler
Dynamic positron emission tomography imaging (dPET) provides temporally resolved images of a tracer enabling a quantitative measure of physiological processes.
1 code implementation • 15 May 2023 • Zhongliang Jiang, Felix Duelmer, Nassir Navab
The experimental results demonstrate that the proposed approach with the re-identification process can significantly improve the accuracy and robustness of the segmentation results (dice score: from 0:54 to 0:86; intersection over union: from 0:47 to 0:78).
no code implementations • 14 May 2023 • Zhongliang Jiang, Xuesong Li, Chenyu Zhang, Yuan Bi, Walter Stechele, Nassir Navab
Autonomous ultrasound (US) scanning has attracted increased attention, and it has been seen as a potential solution to overcome the limitations of conventional US examinations, such as inter-operator variations.
no code implementations • 5 May 2023 • Jonas Hein, Nicola Cavalcanti, Daniel Suter, Lukas Zingg, Fabio Carrillo, Lilian Calvet, Mazda Farshad, Marc Pollefeys, Nassir Navab, Philipp Fürnstahl
Third, we evaluate three state-of-the-art single-view and multi-view methods for the task of 6DoF pose estimation of surgical instruments and analyze the influence of camera configurations, training data, and occlusions on the pose accuracy and generalization ability.
no code implementations • CVPR 2023 • Shun-Cheng Wu, Keisuke Tateno, Nassir Navab, Federico Tombari
Our method consists of a novel incremental entity estimation pipeline and a scene graph prediction network.
no code implementations • 28 Apr 2023 • Yousef Yeganeh, Azade Farshad, Goktug Guevercin, Amr Abu-zer, Rui Xiao, Yongjian Tang, Ehsan Adeli, Nassir Navab
Although the preservation of shape continuity and physiological anatomy is a natural assumption in the segmentation of medical images, it is often neglected by deep learning methods that mostly aim for the statistical modeling of input data as pixels rather than interconnected structures.
no code implementations • 28 Apr 2023 • Yousef Yeganeh, Azade Farshad, Peter Weinberger, Seyed-Ahmad Ahmadi, Ehsan Adeli, Nassir Navab
Although purely transformer-based architectures showed promising performance in many computer vision tasks, many hybrid models consisting of CNN and transformer blocks are introduced to fit more specialized tasks.
no code implementations • 28 Apr 2023 • Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen, Björn Ommer, Nassir Navab
To address this limitation, we propose a novel guidance approach for the sampling process in the diffusion model that leverages bounding box and segmentation map information at inference time without additional training data.
1 code implementation • 15 Apr 2023 • Lennart Bastian, Alexander Baumann, Emily Hoppe, Vincent Bürgin, Ha Young Kim, Mahdi Saleh, Benjamin Busam, Nassir Navab
Statistical shape models (SSMs) are an established way to represent the anatomy of a population with various clinically relevant applications.
no code implementations • 30 Mar 2023 • Dominik Batić, Felix Holm, Ege Özsoy, Tobias Czempiel, Nassir Navab
In this work, we investigate the need for endoscopy domain-specific pretraining based on downstream objectives.
1 code implementation • CVPR 2023 • HyunJun Jung, Patrick Ruhkamp, Guangyao Zhai, Nikolas Brasch, Yitong Li, Yannick Verdie, Jifei Song, Yiren Zhou, Anil Armagan, Slobodan Ilic, Ales Leonardis, Nassir Navab, Benjamin Busam
Learning-based methods to solve dense 3D vision problems typically train on 3D sensor data.
1 code implementation • 24 Mar 2023 • Yiheng Xiong, Jingsong Liu, Kamilia Zaripova, Sahand Sharifzadeh, Matthias Keicher, Nassir Navab
The extraction of structured clinical information from free-text radiology reports in the form of radiology graphs has been demonstrated to be a valuable approach for evaluating the clinical correctness of report-generation methods.
1 code implementation • 23 Mar 2023 • Chantal Pellegrini, Matthias Keicher, Ege Özsoy, Petra Jiraskova, Rickmer Braren, Nassir Navab
Automated diagnosis prediction from medical images is a valuable resource to support clinical decision-making.
1 code implementation • 23 Mar 2023 • Ege Özsoy, Tobias Czempiel, Felix Holm, Chantal Pellegrini, Nassir Navab
The holistic representation of surgical scenes as semantic scene graphs (SGG), where entities are represented as nodes and relations between them as edges, is a promising direction for fine-grained semantic OR understanding.
Ranked #4 on
Scene Graph Generation
on 4D-OR
2 code implementations • 22 Mar 2023 • Yuan Bi, Zhongliang Jiang, Ricarda Clarenbach, Reza Ghotbi, Angelos Karlas, Nassir Navab
We validate the generalizability of the proposed domain-independent segmentation approach on several datasets with varying parameters and machines.
no code implementations • 21 Mar 2023 • Matthias Keicher, Matan Atad, David Schinz, Alexandra S. Gersing, Sarah C. Foreman, Sophia S. Goller, Juergen Weissinger, Jon Rischewski, Anna-Sophia Dietrich, Benedikt Wiestler, Jan S. Kirschke, Nassir Navab
We then regress the severity of the fracture as a function of the distance to this hyperplane, calibrating the results to the Genant scale.
1 code implementation • 20 Mar 2023 • Ege Özsoy, Felix Holm, Mahdi Saleh, Tobias Czempiel, Chantal Pellegrini, Nassir Navab, Benjamin Busam
Scene Graph Generation (SGG) is a visual understanding task, aiming to describe a scene as a graph of entities and their relationships with each other.
Ranked #3 on
Scene Graph Generation
on 4D-OR
no code implementations • 15 Mar 2023 • Artem Savkin, Rachid Ellouze, Nassir Navab, Federico Tombari
Image synthesis driven by computer graphics achieved recently a remarkable realism, yet synthetic image data generated this way reveals a significant domain gap with respect to real-world data.
1 code implementation • 15 Mar 2023 • Ario Sadafi, Oleksandra Adonkina, Ashkan Khakzar, Peter Lienemann, Rudolf Matthias Hehr, Daniel Rueckert, Nassir Navab, Carsten Marr
Explainability is a key requirement for computer-aided diagnosis systems in clinical decision-making.
no code implementations • 2 Mar 2023 • Ario Sadafi, Nassir Navab, Carsten Marr
Querying the expert to annotate regions of interest in a WSI guides the formation of high-attention regions for MIL.
no code implementations • 2 Mar 2023 • Daniel Sens, Ario Sadafi, Francesco Paolo Casale, Nassir Navab, Carsten Marr
Recent MIL approaches produce highly informative bag level representations by utilizing the transformer architecture's ability to model the dependencies between instances.
no code implementations • CVPR 2023 • Dekai Zhu, Guangyao Zhai, Yan Di, Fabian Manhardt, Hendrik Berkemeyer, Tuan Tran, Nassir Navab, Federico Tombari, Benjamin Busam
Reliable multi-agent trajectory prediction is crucial for the safe planning and control of autonomous systems.
2 code implementations • 13 Feb 2023 • Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai, Ziheng Wang, Guo Rui, Melanie Schellenberg, João L. Vilaça, Tobias Czempiel, Zhenkun Wang, Debdoot Sheet, Shrawan Kumar Thapa, Max Berniker, Patrick Godau, Pedro Morais, Sudarshan Regmi, Thuy Nuong Tran, Jaime Fonseca, Jan-Hinrich Nölke, Estevão Lima, Eduard Vazquez, Lena Maier-Hein, Nassir Navab, Pietro Mascagni, Barbara Seeliger, Cristians Gonzalez, Didier Mutter, Nicolas Padoy
This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection.
Ranked #1 on
Action Triplet Detection
on CholecT50 (Challenge)
no code implementations • 6 Feb 2023 • Walter A. Simson, Magdalini Paschali, Vasiliki Sideri-Lampretsa, Nassir Navab, Jeremy J. Dahl
However, the various types of breast tissue, such as glandular, fat, and lesions, differ in sound speed.
1 code implementation • 2 Feb 2023 • Masahiro Oda, Kazuhiro Furukawa, Nassir Navab, Kensaku MORI
Kinematic data of a colonoscope and the colon, including positions and directions of their centerlines, are obtained using electromagnetic and depth sensors.
no code implementations • 31 Jan 2023 • Artem Savkin, Yida Wang, Sebastian Wirkert, Nassir Navab, Federico Tombar
This in turn enables our method to employ a one-stage upsampling paradigm without the need for coarse and fine reconstruction.
1 code implementation • 25 Jan 2023 • Magdalena Wysocki, Mohammad Farid Azampour, Christine Eilers, Benjamin Busam, Mehrdad Salehi, Nassir Navab
In our work, we discuss direction-dependent changes in the scene and show that a physics-inspired rendering improves the fidelity of US image synthesis.
no code implementations • 17 Jan 2023 • Shervin Dehghani, Michael Sommersperger, Peiyao Zhang, Alejandro Martin-Gomez, Benjamin Busam, Peter Gehlbach, Nassir Navab, M. Ali Nasseri, Iulian Iordachita
In this work, we propose a framework for autonomous robotic navigation for subretinal injection, based on intelligent real-time processing of iOCT volumes.
no code implementations • CVPR 2023 • Hanzhi Chen, Fabian Manhardt, Nassir Navab, Benjamin Busam
In this paper, we introduce neural texture learning for 6D object pose estimation from synthetic data and a few unlabelled real images.
1 code implementation • 22 Dec 2022 • Evin Pınar Örnek, Aravindhan K Krishnan, Shreekant Gayaka, Cheng-Hao Kuo, Arnie Sen, Nassir Navab, Federico Tombari
We introduce a zero-shot split for Tabletop Objects Dataset (TOD-Z) to enable this study and present a method that uses annotated objects to learn the ``objectness'' of pixels and generalize to unseen object categories in cluttered indoor environments.
1 code implementation • 20 Dec 2022 • HyunJun Jung, Guangyao Zhai, Shun-Cheng Wu, Patrick Ruhkamp, Hannah Schieber, Giulia Rizzoli, Pengyuan Wang, Hongcheng Zhao, Lorenzo Garattoni, Sven Meier, Daniel Roth, Nassir Navab, Benjamin Busam
Estimating 6D object poses is a major challenge in 3D computer vision.
no code implementations • 10 Nov 2022 • Azade Farshad, Yousef Yeganeh, Helisa Dhamo, Federico Tombari, Nassir Navab
Graph representation of objects and their relations in a scene, known as a scene graph, provides a precise and discernible interface to manipulate a scene by modifying the nodes or the edges in the graph.
no code implementations • 5 Nov 2022 • Mane Margaryan, Matthias Seibold, Indu Joshi, Mazda Farshad, Philipp Fürnstahl, Nassir Navab
In contrast to previously proposed fully convolutional models, the proposed model implements residual Squeeze and Excitation modules in the generator architecture.
no code implementations • 12 Oct 2022 • Agnieszka Tomczak, Aarushi Gupta, Slobodan Ilic, Nassir Navab, Shadi Albarqouni
The purpose of this work is to investigate the hypothesis that we can predict image quality based on its latent representation in the GANs bottleneck.
no code implementations • 26 Sep 2022 • Guangyao Zhai, Dianye Huang, Shun-Cheng Wu, HyunJun Jung, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam
6-DoF robotic grasping is a long-lasting but unsolved problem.
1 code implementation • ICCV 2023 • Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari
By doing so, for the first time in panoptic segmentation with unknown objects, our U3HS is trained without unknown categories, reducing assumptions and leaving the settings as unconstrained as in real-life scenarios.
Ranked #3 on
Object Detection
on OoDIS
1 code implementation • 10 Aug 2022 • Zhongliang Jiang, Yuan Gao, Le Xie, Nassir Navab
Robotic ultrasound (US) imaging aims at overcoming some of the limitations of free-hand US examinations, e. g. difficulty in guaranteeing intra- and inter-operator repeatability.
no code implementations • 31 Jul 2022 • Guangyao Zhai, Yu Zheng, Ziwei Xu, Xin Kong, Yong liu, Benjamin Busam, Yi Ren, Nassir Navab, Zhengyou Zhang
In this paper, we introduce DA$^2$, the first large-scale dual-arm dexterity-aware dataset for the generation of optimal bimanual grasping pairs for arbitrary large objects.
1 code implementation • 31 Jul 2022 • Rüdiger Göbl, Christoph Hennersperger, Nassir Navab
To enable this, we make use of realistic ultrasound simulation techniques that allow for instantiation of several independent speckle realizations that represent the exact same tissue, thus allowing for the application of image reconstruction techniques that work with pairs of differently corrupted data.
no code implementations • 31 Jul 2022 • Mahdi Saleh, Yige Wang, Nassir Navab, Benjamin Busam, Federico Tombari
The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods while requiring significantly fewer computations.
no code implementations • 28 Jul 2022 • Dominik Jüstel, Hedwig Irl, Florian Hinterwimmer, Christoph Dehner, Walter Simson, Nassir Navab, Gerhard Schneider, Vasilis Ntziachristos
Various morphological and functional parameters of peripheral nerves and their vascular supply are indicative of pathological changes due to injury or disease.
no code implementations • 25 Jul 2022 • Felix Buchert, Nassir Navab, Seong Tae Kim
By considering the consistency information with the diversity in the consistency-based embedding scheme, the proposed method could select more informative samples for labeling in the semi-supervised learning setting.
2 code implementations • 21 Jul 2022 • Chantal Pellegrini, Nassir Navab, Anees Kazi
We find that our proposed pre-training methods help in modeling the data at a patient and population level and improve performance in different fine-tuning tasks on all datasets.
1 code implementation • 18 Jul 2022 • Yordanka Velikova, Walter Simson, Mehrdad Salehi, Mohammad Farid Azampour, Philipp Paprottka, Nassir Navab
Abdominal aortic aneurysm (AAA) is a vascular disease in which a section of the aorta enlarges, weakening its walls and potentially rupturing the vessel.
1 code implementation • 15 Jul 2022 • Matan Atad, Vitalii Dmytrenko, Yitong Li, Xinyue Zhang, Matthias Keicher, Jan Kirschke, Bene Wiestler, Ashkan Khakzar, Nassir Navab
Deep learning models used in medical image analysis are prone to raising reliability concerns due to their black-box nature.
no code implementations • 12 Jul 2022 • Yousef Yeganeh, Azade Farshad, Nassir Navab
Inpainting has recently been proposed as a successful deep learning technique for unsupervised medical image model discovery.
no code implementations • 7 Jul 2022 • Yousef Yeganeh, Azade Farshad, Johann Boschmann, Richard Gaus, Maximilian Frantzen, Nassir Navab
Although most medical centers conduct similar medical imaging tasks, their differences, such as specializations, number of patients, and devices, lead to distinctive data distributions.
1 code implementation • 1 Jul 2022 • Raheleh Salehi, Ario Sadafi, Armin Gruber, Peter Lienemann, Nassir Navab, Shadi Albarqouni, Carsten Marr
Here, we propose a cross-domain adapted autoencoder to extract features in an unsupervised manner on three different datasets of single white blood cells scanned from peripheral blood smears.
no code implementations • 27 Jun 2022 • Yu Liu, Kurt Weiss, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng
Light-sheet fluorescence microscopy (LSFM) is a cutting-edge volumetric imaging technique that allows for three-dimensional imaging of mesoscopic samples with decoupled illumination and detection paths.
no code implementations • 16 Jun 2022 • Marcel Kollovieh, Matthias Keicher, Stephan Wunderlich, Hendrik Burwinkel, Thomas Wendler, Nassir Navab
To this end, we propose a multi-task method based on U-Net that takes T1-weighted MR images as an input to generate synthetic FDG-PET images and classifies the dementia progression of the patient into cognitive normal (CN), cognitive impairment (MCI), and AD.
1 code implementation • 13 Jun 2022 • Matteo Ronchetti, Julia Rackerseder, Maria Tirindelli, Mehrdad Salehi, Nassir Navab, Wolfgang Wein, Oliver Zettinig
We propose a novel method to automatically calibrate tracked ultrasound probes.
no code implementations • 13 Jun 2022 • Tariq Bdair, Hossam Abdelhamid, Nassir Navab, Shadi Albarqouni
We validate TriMix on eight benchmark datasets consisting of natural and medical images with an improvement of 2. 71% and 0. 41% better than the second-best models for both data types.
no code implementations • 9 Jun 2022 • Shervin Dehghani, Benjamin Busam, Nassir Navab, Ali Nasseri
Despite its broad availability, volumetric information acquisition from Bright-Field Microscopy (BFM) is inherently difficult due to the projective nature of the acquisition process.
no code implementations • CVPR 2022 • Pengyuan Wang, HyunJun Jung, Yitong Li, Siyuan Shen, Rahul Parthasarathy Srikanth, Lorenzo Garattoni, Sven Meier, Nassir Navab, Benjamin Busam
Object pose estimation is crucial for robotic applications and augmented reality.
no code implementations • 16 May 2022 • Bailiang Jian, Mohammad Farid Azampour, Francesca De Benetti, Johannes Oberreuter, Christina Bukas, Alexandra S. Gersing, Sarah C. Foreman, Anna-Sophia Dietrich, Jon Rischewski, Jan S. Kirschke, Nassir Navab, Thomas Wendler
We specifically design these losses to depend only on the CT label maps since automatic vertebra segmentation in CT gives more accurate results contrary to MRI.
1 code implementation • 10 May 2022 • Yuan Bi, Zhongliang Jiang, Yuan Gao, Thomas Wendler, Angelos Karlas, Nassir Navab
The results demonstrate that proposed approach can effectively and accurately navigate the probe towards the longitudinal view of vessels.
1 code implementation • 9 May 2022 • Mohammad Eslami, Solale Tabarestani, Ehsan Adeli, Glyn Elwyn, Tobias Elze, Mengyu Wang, Nazlee Zebardast, Nassir Navab, Malek Adjouadi
With the advent of sophisticated machine learning (ML) techniques and the promising results they yield, especially in medical applications, where they have been investigated for different tasks to enhance the decision-making process.
no code implementations • 8 May 2022 • Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari
We propose a novel convolutional operator for the task of point cloud completion.
1 code implementation • 15 Apr 2022 • Azade Farshad, Yousef Yeganeh, Peter Gehlbach, Nassir Navab
Automated segmentation of retinal optical coherence tomography (OCT) images has become an important recent direction in machine learning for medical applications.
Ranked #1 on
Retinal OCT Layer Segmentation
on Duke SD-OCT
(using extra training data)
no code implementations • 4 Apr 2022 • Ashkan Khakzar, Yawei Li, Yang Zhang, Mirac Sanisoglu, Seong Tae Kim, Mina Rezaei, Bernd Bischl, Nassir Navab
One challenging property lurking in medical datasets is the imbalanced data distribution, where the frequency of the samples between the different classes is not balanced.
no code implementations • 1 Apr 2022 • Kamilia Mullakaeva, Luca Cosmo, Anees Kazi, Seyed-Ahmad Ahmadi, Nassir Navab, Michael M. Bronstein
In this work, we propose Graph-in-Graph (GiG), a neural network architecture for protein classification and brain imaging applications that exploits the graph representation of the input data samples and their latent relation.
no code implementations • CVPR 2022 • Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari
To this aim, we introduce a second model that assembles our layers within a transformer architecture.
1 code implementation • 30 Mar 2022 • Paul Engstler, Matthias Keicher, David Schinz, Kristina Mach, Alexandra S. Gersing, Sarah C. Foreman, Sophia S. Goller, Juergen Weissinger, Jon Rischewski, Anna-Sophia Dietrich, Benedikt Wiestler, Jan S. Kirschke, Ashkan Khakzar, Nassir Navab
Do black-box neural network models learn clinically relevant features for fracture diagnosis?
no code implementations • 29 Mar 2022 • Matthias Keicher, Kamilia Zaripova, Tobias Czempiel, Kristina Mach, Ashkan Khakzar, Nassir Navab
The automation of chest X-ray reporting has garnered significant interest due to the time-consuming nature of the task.
1 code implementation • 25 Mar 2022 • Mojtaba Bahrami, Mahsa Ghorbani, Nassir Navab
We show that training the agent against the prediction model can significantly improve the semantic features extracted for downstream classification tasks.
2 code implementations • 23 Mar 2022 • Chantal Pellegrini, Anees Kazi, Nassir Navab
We test our method on two medical datasets of patient records, TADPOLE and MIMIC-III, including imaging and non-imaging features and different prediction tasks.
Ranked #1 on
Length-of-Stay prediction
on MIMIC-III
1 code implementation • 22 Mar 2022 • Ege Özsoy, Evin Pınar Örnek, Ulrich Eck, Tobias Czempiel, Federico Tombari, Nassir Navab
Towards this goal, for the first time, we propose using semantic scene graphs (SSG) to describe and summarize the surgical scene.
Ranked #5 on
Scene Graph Generation
on 4D-OR
no code implementations • 22 Mar 2022 • Matthias Seibold, Armando Hoch, Mazda Farshad, Nassir Navab, Philipp Fürnstahl
In this work, we propose a novel data augmentation method for clinical audio datasets based on a conditional Wasserstein Generative Adversarial Network with Gradient Penalty (cWGAN-GP), operating on log-mel spectrograms.
no code implementations • 21 Mar 2022 • Tobias Czempiel, Coco Rogers, Matthias Keicher, Magdalini Paschali, Rickmer Braren, Egon Burian, Marcus Makowski, Nassir Navab, Thomas Wendler, Seong Tae Kim
For this purpose, longitudinal self-supervision schemes are explored on clinical longitudinal COVID-19 CT scans.
1 code implementation • CVPR 2022 • Yongzhi Su, Mahdi Saleh, Torben Fetzer, Jason Rambach, Nassir Navab, Benjamin Busam, Didier Stricker, Federico Tombari
Dense methods also improved pose estimation in the presence of occlusion.
no code implementations • 17 Mar 2022 • Tobias Czempiel, Aidean Sharghi, Magdalini Paschali, Nassir Navab, Omid Mohareri
Algorithmic surgical workflow recognition is an ongoing research field and can be divided into laparoscopic (Internal) and operating room (External) analysis.
no code implementations • 16 Mar 2022 • Lennart Bastian, Tobias Czempiel, Christian Heiliger, Konrad Karcz, Ulrich Eck, Benjamin Busam, Nassir Navab
Existing datasets from OR room cameras are thus far limited in size or modalities acquired, leaving it unclear which sensor modalities are best suited for tasks such as recognizing surgical action from videos.
no code implementations • 15 Mar 2022 • Evin Pınar Örnek, Shristi Mudgal, Johanna Wald, Yida Wang, Nassir Navab, Federico Tombari
There have been numerous recently proposed methods for monocular depth prediction (MDP) coupled with the equally rapid evolution of benchmarking tools.
3 code implementations • CVPR 2022 • Yan Di, Ruida Zhang, Zhiqiang Lou, Fabian Manhardt, Xiangyang Ji, Nassir Navab, Federico Tombari
While 6D object pose estimation has recently made a huge leap forward, most methods can still only handle a single or a handful of different objects, which limits their applications.
Ranked #1 on
6D Pose Estimation
on LineMOD
(Mean ADD-S metric)
no code implementations • CVPR 2022 • Ashkan Khakzar, Pedram Khorsandi, Rozhin Nobahari, Nassir Navab
It is a mystery which input features contribute to a neural network's output.
no code implementations • CVPR 2022 • Mahdi Saleh, Shun-Cheng Wu, Luca Cosmo, Nassir Navab, Benjamin Busam, Federico Tombari
Shape matching has been a long-studied problem for the computer graphics and vision community.
no code implementations • 14 Jan 2022 • John Ridley, Huseyin Coskun, David Joseph Tan, Nassir Navab, Federico Tombari
The video action segmentation task is regularly explored under weaker forms of supervision, such as transcript supervision, where a list of actions is easier to obtain than dense frame-wise labels.
1 code implementation • CVPR 2022 • Daniel Grzech, Mohammad Farid Azampour, Ben Glocker, Julia Schnabel, Nassir Navab, Bernhard Kainz, Loïc le Folgoc
We propose a novel variational Bayesian formulation for diffeomorphic non-rigid registration of medical images, which learns in an unsupervised way a data-specific similarity metric.
no code implementations • CVPR 2022 • Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Mohammad-Ali Nikouei Mahani, Nassir Navab, Benjamin Busam, Federico Tombari
Despite training only on a standard dataset, such as KITTI, augmenting with our vector fields significantly improves the generalization to differently shaped objects and scenes.
no code implementations • 7 Dec 2021 • HyunJun Jung, Nikolas Brasch, Ales Leonardis, Nassir Navab, Benjamin Busam
Indirect Time-of-Flight (I-ToF) imaging is a widespread way of depth estimation for mobile devices due to its small size and affordable price.
no code implementations • 6 Dec 2021 • Pengyuan Wang, Fabian Manhardt, Luca Minciullo, Lorenzo Garattoni, Sven Meie, Nassir Navab, Benjamin Busam
We first present a small sequence of RGB-D images displaying a human-object interaction.
1 code implementation • 2 Dec 2021 • Enis Simsar, Evin Pınar Örnek, Fabian Manhardt, Helisa Dhamo, Nassir Navab, Federico Tombari
With the advent of deep learning, estimating depth from a single RGB image has recently received a lot of attention, being capable of empowering many different applications ranging from path planning for robotics to computational cinematography.
no code implementations • 30 Nov 2021 • Shervin Dehghani, Michael Sommersperger, Junjie Yang, Benjamin Busam, Kai Huang, Peter Gehlbach, Iulian Iordachita, Nassir Navab, M. Ali Nasseri
For this purpose, we present a platform for autonomous trocar docking that combines computer vision and a robotic setup.
1 code implementation • 22 Oct 2021 • Azade Farshad, Sabrina Musatian, Helisa Dhamo, Nassir Navab
We propose MIGS (Meta Image Generation from Scene Graphs), a meta-learning based approach for few-shot image generation from graphs that enables adapting the model to different scenes and increases the image quality by training on diverse sets of tasks.
no code implementations • 15 Oct 2021 • Patrick Ruhkamp, Daoyi Gao, Hanzhi Chen, Nassir Navab, Benjamin Busam
A novel temporal attention mechanism further processes the local geometric information in a global context across consecutive images.
no code implementations • 8 Oct 2021 • Markus Herb, Matthias Lemberger, Marcel M. Schmitt, Alexander Kurz, Tobias Weiherer, Nassir Navab, Federico Tombari
Accurate and reliable localization is a fundamental requirement for autonomous vehicles to use map information in higher-level tasks such as navigation or planning.
no code implementations • 4 Oct 2021 • Stefano Gasperini, Jan Haug, Mohammad-Ali Nikouei Mahani, Alvaro Marcos-Ramiro, Nassir Navab, Benjamin Busam, Federico Tombari
Estimating the uncertainty of a neural network plays a fundamental role in safety-critical settings.
1 code implementation • NeurIPS 2021 • Yang Zhang, Ashkan Khakzar, Yawei Li, Azade Farshad, Seong Tae Kim, Nassir Navab
We propose a method to identify features with predictive information in the input domain.
no code implementations • 3 Oct 2021 • Michelle Xiao-Lin Foo, Seong Tae Kim, Magdalini Paschali, Leili Goli, Egon Burian, Marcus Makowski, Rickmer Braren, Nassir Navab, Thomas Wendler
Existing automatic and interactive segmentation models for medical images only use data from a single time point (static).
no code implementations • 24 Sep 2021 • Mert Asim Karaoglu, Nikolas Brasch, Marijn Stollenga, Wolfgang Wein, Nassir Navab, Federico Tombari, Alexander Ladikos
The results of our experiments show that the proposed method improves the network's performance on real images by a considerable margin and can be employed in 3D reconstruction pipelines.
1 code implementation • 18 Sep 2021 • Anastasia Makarevich, Azade Farshad, Vasileios Belagiannis, Nassir Navab
In this work, we present MetaMedSeg, a gradient-based meta-learning algorithm that redefines the meta-learning task for the volumetric medical data with the goal to capture the variety between the slices.
no code implementations • 11 Sep 2021 • Ario Sadafi, Asya Makhro, Leonid Livshits, Nassir Navab, Anna Bogdanova, Shadi Albarqouni, Carsten Marr
Sickle cell disease (SCD) is a severe genetic hemoglobin disorder that results in premature destruction of red blood cells.
1 code implementation • ICCV 2021 • Helisa Dhamo, Fabian Manhardt, Nassir Navab, Federico Tombari
Scene graphs are representations of a scene, composed of objects (nodes) and inter-object relationships (edges), proven to be particularly suited for this task, as they allow for semantic control on the generated content.
2 code implementations • ICCV 2021 • Yan Di, Fabian Manhardt, Gu Wang, Xiangyang Ji, Nassir Navab, Federico Tombari
Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (e. g. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem.
Ranked #1 on
6D Pose Estimation using RGB
on Occlusion LineMOD
no code implementations • ICCV 2021 • Sarthak Garg, Helisa Dhamo, Azade Farshad, Sabrina Musatian, Nassir Navab, Federico Tombari
Scene graphs, composed of nodes as objects and directed-edges as relationships among objects, offer an alternative representation of a scene that is more semantically grounded than images.
no code implementations • 10 Aug 2021 • Markus Krönke, Christine Eilers, Desislava Dimova, Melanie Köhler, Gabriel Buschner, Lilit Mirzojan, Lemonia Konstantinidou, Marcus R. Makowski, James Nagarajah, Nassir Navab, Wolfgang Weber, Thomas Wendler
Conclusion: Tracked 3D ultrasound combined with a CNN segmentation significantly reduces interobserver variability in thyroid volumetry and increases the accuracy of the measurements with shorter acquisition times.
no code implementations • 10 Aug 2021 • Stefano Gasperini, Patrick Koch, Vinzenz Dallabetta, Nassir Navab, Benjamin Busam, Federico Tombari
While self-supervised monocular depth estimation in driving scenarios has achieved comparable performance to supervised approaches, violations of the static world assumption can still lead to erroneous depth predictions of traffic participants, posing a potential safety issue.
no code implementations • 29 Jul 2021 • Matthias Keicher, Hendrik Burwinkel, David Bani-Harouni, Magdalini Paschali, Tobias Czempiel, Egon Burian, Marcus R. Makowski, Rickmer Braren, Nassir Navab, Thomas Wendler
Specifically, we introduce a multimodal similarity metric to build a population graph for clustering patients and an image-based end-to-end Graph Attention Network to process this graph and predict the COVID-19 patient outcomes: admission to ICU, need for ventilation and mortality.
1 code implementation • 26 Jul 2021 • Daniil Pakhomov, Sanchit Hira, Narayani Wagle, Kemar E. Green, Nassir Navab
Derived regions are consistent across different images and coincide with human-defined semantic classes on some datasets.
no code implementations • 9 Jun 2021 • Jakob Weiss, Nassir Navab
In this work, we introduce Deep Direct Volume Rendering (DeepDVR), a generalization of DVR that allows for the integration of deep neural networks into the DVR algorithm.
no code implementations • 9 Jun 2021 • Ege Özsoy, Evin Pınar Örnek, Ulrich Eck, Federico Tombari, Nassir Navab
We then use MSSG to introduce a dynamically generated graphical user interface tool for surgical procedure analysis which could be used for many applications including process optimization, OR design and automatic report generation.