Search Results for author: Nassir Navab

Found 412 papers, 143 papers with code

Think Before Refusal : Triggering Safety Reflection in LLMs to Mitigate False Refusal Behavior

no code implementations22 Mar 2025 Shengyun Si, Xinpeng Wang, Guangyao Zhai, Nassir Navab, Barbara Plank

Recent advancements in large language models (LLMs) have demonstrated that fine-tuning and human alignment can render LLMs harmless.

MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments

1 code implementation4 Mar 2025 Ege Özsoy, Chantal Pellegrini, Tobias Czempiel, Felix Tristram, Kun Yuan, David Bani-Harouni, Ulrich Eck, Benjamin Busam, Matthias Keicher, Nassir Navab

Operating rooms (ORs) are complex, high-stakes environments requiring precise understanding of interactions among medical staff, tools, and equipment for enhancing surgical assistance, situational awareness, and patient safety.

 Ranked #1 on Video Panoptic Segmentation on 4D-OR (using extra training data)

2D Panoptic Segmentation Graph Generation +4

From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine

no code implementations13 Feb 2025 Lukas Buess, Matthias Keicher, Nassir Navab, Andreas Maier, Soroosh Tayebi Arasteh

The field has advanced rapidly, evolving from text-only large language models for tasks such as clinical documentation and decision support to multimodal AI systems capable of integrating diverse data modalities, including imaging, text, and structured data, within a single model.

Diagnostic Drug Discovery +1

GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation

no code implementations6 Feb 2025 Weihang Li, Hongli Xu, Junwen Huang, HyunJun Jung, Peter KT Yu, Nassir Navab, Benjamin Busam

In this paper, we present GCE-Pose, a method that enhances pose estimation for novel instances by integrating category-level global context prior.

Pose Estimation

Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment

no code implementations4 Feb 2025 Yaling Shen, Zhixiong Zhuang, Kun Yuan, Maria-Irina Nicolae, Nassir Navab, Nicolas Padoy, Mario Fritz

Experiments on the IU X-RAY and MIMIC-CXR radiology datasets demonstrate that Adversarial Domain Alignment enables attackers to steal the medical MLLM without any access to medical data.

Data Augmentation

Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis

1 code implementation16 Jan 2025 Tingxuan Chen, Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy

Conclusion: We propose a text-driven adaptation approach that mitigates the modality gap and handles multiple downstream tasks in surgical workflow analysis, with minimal reliance on large annotated datasets.

Decoder Image Captioning +1

UltraRay: Full-Path Ray Tracing for Enhancing Realism in Ultrasound Simulation

no code implementations10 Jan 2025 Felix Duelmer, Mohammad Farid Azampour, Nassir Navab

We propose a novel ultrasound simulation pipeline that utilizes a ray tracing algorithm to generate echo data, tracing each ray from the transducer through the scene and back to the sensor.

Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis

no code implementations30 Dec 2024 Yousef Yeganeh, Ioannis Charisiadis, Marta Hasny, Martin Hartenberger, Björn Ommer, Nassir Navab, Azade Farshad, Ehsan Adeli

Scaling by training on large datasets has been shown to enhance the quality and fidelity of image generation and manipulation with diffusion models; however, such large datasets are not always accessible in medical imaging due to cost and privacy issues, which contradicts one of the main applications of such models to produce synthetic samples where real data is scarce.

counterfactual Image Generation

Conformable Convolution for Topologically Aware Learning of Complex Anatomical Structures

no code implementations29 Dec 2024 Yousef Yeganeh, Rui Xiao, Goktug Guvercin, Nassir Navab, Azade Farshad

While conventional computer vision emphasizes pixel-level and feature-based objectives, medical image analysis of intricate biological structures necessitates explicit representation of their complex topological properties.

Medical Image Analysis

ESCAPE: Equivariant Shape Completion via Anchor Point Encoding

no code implementations1 Dec 2024 Burak Bekci, Nassir Navab, Federico Tombari, Mahdi Saleh

Shape completion, a crucial task in 3D computer vision, involves predicting and filling the missing regions of scanned or partially observed objects.

Pose Estimation

G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs

no code implementations25 Nov 2024 Kunyi Li, Michael Niemeyer, Zeyu Chen, Nassir Navab, Federico Tombari

By establishing a differentiable connection between the explicit Gaussians and the implicit SDF, our approach enables high-quality surface reconstruction and rendering.

3DGS Novel View Synthesis +1

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining

no code implementations23 Nov 2024 Ming Hu, Kun Yuan, Yaling Shen, Feilong Tang, Xiaohao Xu, Lin Zhou, Wei Li, Ying Chen, Zhongxing Xu, Zelin Peng, Siyuan Yan, Vinkle Srivastav, Diping Song, Tianbin Li, Danli Shi, Jin Ye, Nicolas Padoy, Nassir Navab, Junjun He, ZongYuan Ge

Surgical practice involves complex visual interpretation, procedural skills, and advanced medical knowledge, making surgical vision-language pretraining (VLP) particularly challenging due to this complexity and the limited availability of annotated data.

Representation Learning Retrieval

Synomaly Noise and Multi-Stage Diffusion: A Novel Approach for Unsupervised Anomaly Detection in Ultrasound Imaging

no code implementations6 Nov 2024 Yuan Bi, Lucie Huang, Ricarda Clarenbach, Reza Ghotbi, Angelos Karlas, Nassir Navab, Zhongliang Jiang

To address these issues, we propose a novel unsupervised anomaly detection framework based on a diffusion model that incorporates a synthetic anomaly (Synomaly) noise function and a multi-stage diffusion process.

counterfactual Unsupervised Anomaly Detection

SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark

1 code implementation30 Oct 2024 HyunJun Jung, Weihang Li, Shun-Cheng Wu, William Bittner, Nikolas Brasch, Jifei Song, Eduardo Pérez-Pellitero, Zhensong Zhang, Arthur Moreau, Nassir Navab, Benjamin Busam

However, using these datasets to evaluate dense geometry tasks, such as depth rendering, can be problematic as the meshes of the dataset are often incomplete and may produce wrong ground truth to evaluate the details.

6D Pose Estimation

KaLDeX: Kalman Filter based Linear Deformable Cross Attention for Retina Vessel Segmentation

1 code implementation28 Oct 2024 Zhihao Zhao, Shahrooz Faghihroohi, Yinzheng Zhao, Junjie Yang, Shipeng Zhong, Kai Huang, Nassir Navab, Boyang Li, M. Ali Nasseri

Methods: To address these issues, we propose a novel network (KaLDeX) for vascular segmentation leveraging a Kalman filter based linear deformable cross attention (LDCA) module, integrated within a UNet++ framework.

Segmentation

VISAGE: Video Synthesis using Action Graphs for Surgery

no code implementations23 Oct 2024 Yousef Yeganeh, Rachmadio Lazuardi, Amir Shamseddin, Emine Dari, Yash Thirani, Nassir Navab, Azade Farshad

The results of our experiments demonstrate high-fidelity video generation for laparoscopy procedures, which enables various applications in SDS.

Video Generation

Neural Semantic Map-Learning for Autonomous Vehicles

no code implementations10 Oct 2024 Markus Herb, Nassir Navab, Federico Tombari

Autonomous vehicles demand detailed maps to maneuver reliably through traffic, which need to be kept up-to-date to ensure a safe operation.

Autonomous Vehicles

Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation

2 code implementations30 Sep 2024 Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy

Surgical video-language pretraining (VLP) faces unique challenges due to the knowledge domain gap and the scarcity of multi-modal data.

Cross-Modal Retrieval Dynamic Time Warping +1

Physics-Informed Latent Diffusion for Multimodal Brain MRI Synthesis

1 code implementation20 Sep 2024 Sven Lüpke, Yousef Yeganeh, Ehsan Adeli, Nassir Navab, Azade Farshad

Our approach utilizes latent diffusion models and a two-step generative process: first, unobserved physical tissue property maps are synthesized using a latent diffusion model, and then these maps are combined with a physical signal model to generate the final MRI scan.

KLDD: Kalman Filter based Linear Deformable Diffusion Model in Retinal Image Segmentation

no code implementations19 Sep 2024 Zhihao Zhao, Yinzheng Zhao, Junjie Yang, Kai Huang, Nassir Navab, M. Ali Nasseri

To better optimize the coordinate positions of deformable convolution, we employ the Kalman filter to enhance the perception of vascular structures in linear deformable convolution.

Image Segmentation Retinal Vessel Segmentation +2

SURGIVID: Annotation-Efficient Surgical Video Object Discovery

no code implementations12 Sep 2024 Çağhan Köksal, Ghazal Ghazaei, Nassir Navab

Considering the profusion of surgical videos obtained through standardized surgical workflows, we propose an annotation-efficient framework for the semantic segmentation of surgical scenes.

Object Object Discovery +1

MAGDA: Multi-agent guideline-driven diagnostic assistance

no code implementations10 Sep 2024 David Bani-Harouni, Nassir Navab, Matthias Keicher

Large Language Models (LLMs) have the potential to alleviate some pressure from these clinicians by providing insights that can help them in their decision-making.

Diagnostic Language Modelling

Multimodal Analysis of White Blood Cell Differentiation in Acute Myeloid Leukemia Patients using a β-Variational Autoencoder

no code implementations13 Aug 2024 Gizem Mert, Ario Sadafi, Raheleh Salehi, Nassir Navab, Carsten Marr

Biomedical imaging and RNA sequencing with single-cell resolution improves our understanding of white blood cell diseases like leukemia.

PHOCUS: Physics-Based Deconvolution for Ultrasound Resolution Enhancement

1 code implementation7 Aug 2024 Felix Duelmer, Walter Simson, Mohammad Farid Azampour, Magdalena Wysocki, Angelos Karlas, Nassir Navab

Conventionally, deconvolution techniques attempt to rectify the imaging system's dependent PSF, working directly on the radio-frequency (RF) data.

Diagnostic SSIM

Deep Spectral Methods for Unsupervised Ultrasound Image Interpretation

1 code implementation4 Aug 2024 Oleksandra Tmenova, Yordanka Velikova, Mahdi Saleh, Nassir Navab

Ultrasound imaging is challenging to interpret due to non-uniform intensities, low contrast, and inherent artifacts, necessitating extensive training for non-specialists.

Anatomy Clustering +1

Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder

1 code implementation2 Aug 2024 Matan Atad, David Schinz, Hendrik Moeller, Robert Graf, Benedikt Wiestler, Daniel Rueckert, Nassir Navab, Jan S. Kirschke, Matthias Keicher

Counterfactual explanations (CEs) aim to enhance the interpretability of machine learning models by illustrating how alterations in input features would affect the resulting predictions.

counterfactual Image Classification +2

SANGRIA: Surgical Video Scene Graph Optimization for Surgical Workflow Prediction

no code implementations29 Jul 2024 Çağhan Köksal, Ghazal Ghazaei, Felix Holm, Azade Farshad, Nassir Navab

Graph-based holistic scene representations facilitate surgical workflow understanding and have recently demonstrated significant success.

Disentanglement Graph Generation +1

SLoRD: Structural Low-Rank Descriptors for Shape Consistency in Vertebrae Segmentation

1 code implementation11 Jul 2024 Xin You, Yixin Lou, Minghui Zhang, Jie Yang, Nassir Navab, Yun Gu

Specifically, a contour generation network is proposed based on Structural Low-Rank Descriptors for shape consistency, termed SLoRD.

Instance Segmentation Segmentation +1

Diffusion as Sound Propagation: Physics-inspired Model for Ultrasound Image Generation

1 code implementation7 Jul 2024 Marina Domínguez, Yordanka Velikova, Nassir Navab, Mohammad Farid Azampour

However, when it comes to ultrasound (US) imaging, the authenticity of generated data often diminishes due to the oversight of ultrasound physics.

Data Augmentation Image Generation

RaNeuS: Ray-adaptive Neural Surface Reconstruction

1 code implementation14 Jun 2024 Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari

Our objective is to leverage a differentiable radiance field \eg NeRF to reconstruct detailed 3D surfaces in addition to producing the standard novel view renderings.

NeRF Novel View Synthesis +1

Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

1 code implementation6 Jun 2024 Zhongliang Jiang, Yunfeng Kang, Yuan Bi, Xuesong Li, Chenyang Li, Nassir Navab

Then, a dense skeleton graph-based non-rigid registration is presented to map the intercostal scanning path from a generic template to individual patients.

Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

no code implementations2 Jun 2024 Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang

In this work, we propose a novel framework for automatic ultrasound report generation, leveraging a combination of unsupervised and supervised learning methods to aid the report generation process.

HecVL: Hierarchical Video-Language Pretraining for Zero-shot Surgical Phase Recognition

2 code implementations16 May 2024 Kun Yuan, Vinkle Srivastav, Nassir Navab, Nicolas Padoy

By disentangling embedding spaces of different hierarchical levels, the learned multi-modal representations encode short-term and long-term surgical concepts in the same model.

Contrastive Learning Surgical phase recognition

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

1 code implementation2 May 2024 Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam

The scheme ensures that the denoising processes are influenced by a holistic understanding of the scene graph, facilitating the generation of globally coherent scenes.

3D Object Retrieval Denoising +2

SpecstatOR: Speckle statistics-based iOCT Segmentation Network for Ophthalmic Surgery

no code implementations30 Apr 2024 Kristina Mach, Hessam Roodaki, Michael Sommersperger, Nassir Navab

This paper presents an innovative approach to intraoperative Optical Coherence Tomography (iOCT) image segmentation in ophthalmic surgery, leveraging statistical analysis of speckle patterns to incorporate statistical pathology-specific prior knowledge.

Image Segmentation Segmentation +1

Real-time guidewire tracking and segmentation in intraoperative x-ray

no code implementations12 Apr 2024 Baochang Zhang, Mai Bui, Cheng Wang, Felix Bourier, Heribert Schunkert, Nassir Navab

For this purpose, real-time and accurate guidewire segmentation and tracking can enhance the visualization of guidewires and provide visual feedback for physicians during the intervention as well as for robot-assisted interventions.

Shape Completion in the Dark: Completing Vertebrae Morphology from 3D Ultrasound

1 code implementation11 Apr 2024 Miruna-Alexandra Gafencu, Yordanka Velikova, Mahdi Saleh, Tamas Ungi, Nassir Navab, Thomas Wendler, Mohammad Farid Azampour

Purpose: Ultrasound (US) imaging, while advantageous for its radiation-free nature, is challenging to interpret due to only partially visible organs and a lack of complete 3D information.

Anatomy

ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling

1 code implementation10 Apr 2024 Ege Özsoy, Chantal Pellegrini, Matthias Keicher, Nassir Navab

This demonstrates ORacle's potential to significantly enhance the scalability and affordability of OR domain modeling and opens a pathway for future advancements in surgical data science.

Data Augmentation Graph Generation +2

Neural Cellular Automata for Lightweight, Robust and Explainable Classification of White Blood Cell Images

no code implementations8 Apr 2024 Michael Deutges, Ario Sadafi, Nassir Navab, Carsten Marr

We test our approach on three datasets of white blood cell images and show that we achieve competitive performance compared to conventional methods.

Classification Image Classification

VibNet: Vibration-Boosted Needle Detection in Ultrasound Images

1 code implementation21 Mar 2024 Dianye Huang, Chenyang Li, Angelos Karlas, Xiangyu Chu, K. W. Samuel Au, Nassir Navab, Zhongliang Jiang

The results obtained on porcine samples demonstrate that VibNet effectively detects needles even when their visibility is severely reduced, with a tip error of $1. 61\pm1. 56~mm$ compared to $8. 15\pm9. 98~mm$ for UNet and $6. 63\pm7. 58~mm$ for WNet, and a needle direction error of $1. 64\pm1. 86^{\circ}$ compared to $9. 29\pm15. 30^{\circ}$ for UNet and $8. 54\pm17. 92^{\circ}$ for WNet.

FLex: Joint Pose and Dynamic Radiance Fields Optimization for Stereo Endoscopic Videos

no code implementations18 Mar 2024 Florian Philipp Stilz, Mert Asim Karaoglu, Felix Tristram, Nassir Navab, Benjamin Busam, Alexander Ladikos

However, the setup has been restricted to a static endoscope, limited deformation, or required an external tracking device to retrieve camera pose information of the endoscopic camera.

Neural Rendering Novel View Synthesis

Robot-Assisted Deep Venous Thrombosis Ultrasound Examination using Virtual Fixture

1 code implementation4 Jan 2024 Dianye Huang, Chenguang Yang, Mingchuan Zhou, Angelos Karlas, Nassir Navab, Zhongliang Jiang

To ensure the biometric measurements obtained in different examinations are comparable, the 6D scanning path is determined in a coarse-to-fine manner using both an external RGBD camera and US images.

ARC Position

Deformable 3D Gaussian Splatting for Animatable Human Avatars

no code implementations22 Dec 2023 HyunJun Jung, Nikolas Brasch, Jifei Song, Eduardo Perez-Pellitero, Yiren Zhou, Zhihao LI, Nassir Navab, Benjamin Busam

ParDy-Human introduces parameter-driven dynamics into 3D Gaussian Splatting where 3D Gaussians are deformed by a human pose model to animate the avatar.

Human Animation Novel View Synthesis

Advancing Surgical VQA with Scene Graph Knowledge

2 code implementations15 Dec 2023 Kun Yuan, Manasi Kattel, Joel L. Lavanchy, Nassir Navab, Vinkle Srivastav, Nicolas Padoy

We highlight that the primary limitation in the current surgical VQA systems is the lack of scene knowledge to answer complex queries.

Question Answering Visual Question Answering

Re-Nerfing: Improving Novel View Synthesis through Novel View Synthesis

no code implementations4 Dec 2023 Felix Tristram, Stefano Gasperini, Nassir Navab, Federico Tombari

This introduces additional multi-view constraints and allows the second model to converge to a better solution.

3D geometry Data Augmentation +3

S2P3: Self-Supervised Polarimetric Pose Prediction

no code implementations2 Dec 2023 Patrick Ruhkamp, Daoyi Gao, Nassir Navab, Benjamin Busam

The novel training paradigm comprises 1) a physical model to extract geometric information of polarized light, 2) a teacher-student knowledge distillation scheme and 3) a self-supervised loss formulation through differentiable rendering and an invertible physical constraint.

Knowledge Distillation Pose Prediction +1

RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance

1 code implementation30 Nov 2023 Chantal Pellegrini, Ege Özsoy, Benjamin Busam, Nassir Navab, Matthias Keicher

Conversational AI tools that can generate and discuss clinically correct radiology reports for a given medical image have the potential to transform radiology.

Diagnostic Language Modeling +3

DNS SLAM: Dense Neural Semantic-Informed SLAM

no code implementations30 Nov 2023 Kunyi Li, Michael Niemeyer, Nassir Navab, Federico Tombari

In this work, we introduce DNS SLAM, a novel neural RGB-D semantic SLAM approach featuring a hybrid representation.

Semantic SLAM

Robust Tumor Segmentation with Hyperspectral Imaging and Graph Neural Networks

no code implementations20 Nov 2023 Mayar Lotfy Mostafa, Anna Alperovich, Tommaso Giannantonio, Bjorn Barz, Xiaohan Zhang, Felix Holm, Nassir Navab, Felix Boehm, Carolin Schwamborn, Thomas K. Hoffmann, Patrick J. Schuler

Despite the limited dataset, the GNN-based model significantly outperforms context-agnostic approaches, accurately distinguishing between healthy and tumor tissues, even in images from previously unseen patients.

Tumor Segmentation

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

1 code implementation CVPR 2024 Yamei Chen, Yan Di, Guangyao Zhai, Fabian Manhardt, Chenyangguang Zhang, Ruida Zhang, Federico Tombari, Nassir Navab, Benjamin Busam

Leveraging the advantage of DINOv2 in providing SE(3)-consistent semantic features, we hierarchically extract two types of SE(3)-invariant geometric features to further encapsulate local-to-global object-specific information.

Object Pose Estimation

EyeLS: Shadow-Guided Instrument Landing System for Intraocular Target Approaching in Robotic Eye Surgery

no code implementations15 Nov 2023 Junjie Yang, Zhihao Zhao, Siyuan Shen, Daniel Zapp, Mathias Maier, Kai Huang, Nassir Navab, M. Ali Nasseri

Robotic ophthalmic surgery is an emerging technology to facilitate high-precision interventions such as retina penetration in subretinal injection and removal of floating tissues in retinal detachment depending on the input imaging modalities such as microscopy and intraoperative OCT (iOCT).

VoxNeRF: Bridging Voxel Representation and Neural Radiance Fields for Enhanced Indoor View Synthesis

no code implementations9 Nov 2023 Sen Wang, Qing Cheng, Stefano Gasperini, Wei zhang, Shun-Cheng Wu, Niclas Zeller, Daniel Cremers, Nassir Navab

The generation of high-fidelity view synthesis is essential for robotic navigation and interaction but remains challenging, particularly in indoor environments and real-time scenarios.

Novel View Synthesis

PRISM: Progressive Restoration for Scene Graph-based Image Manipulation

no code implementations3 Nov 2023 Pavel Jahoda, Azade Farshad, Yousef Yeganeh, Ehsan Adeli, Nassir Navab

We take advantage of the outer part of the masked area as they have a direct correlation with the context of the scene.

Denoising Descriptive +2

Dynamic Scene Graph Representation for Surgical Video

no code implementations25 Sep 2023 Felix Holm, Ghazal Ghazaei, Tobias Czempiel, Ege Özsoy, Stefan Saur, Nassir Navab

Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time.

AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers

no code implementations25 Sep 2023 Alex Ranne, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena

To date, endovascular surgeries are performed using the golden standard of Fluoroscopy, which uses ionising radiation to visualise catheters and vasculature.

SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs

no code implementations21 Sep 2023 Guangyao Zhai, Xiaoni Cai, Dianye Huang, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam

In this paper, we present SG-Bot, a novel rearrangement framework that utilizes a coarse-to-fine scheme with a scene graph as the scene representation.

Object Rearrangement

RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy

no code implementations18 Sep 2023 Mert Asim Karaoglu, Viktoria Markova, Nassir Navab, Benjamin Busam, Alexander Ladikos

While most classical methods achieve rotation-equivariant detection and invariant description by design, many learning-based approaches learn to be robust only up to a certain degree.

Keypoint Detection Self-Supervised Learning

Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction

no code implementations ICCV 2023 Zhiying Leng, Shun-Cheng Wu, Mahdi Saleh, Antonio Montanaro, Hao Yu, Yin Wang, Nassir Navab, Xiaohui Liang, Federico Tombari

In this work, we propose the first precise hand-object reconstruction method in hyperbolic space, namely Dynamic Hyperbolic Attention Network (DHANet), which leverages intrinsic properties of hyperbolic space to learn representative features.

Object Object Reconstruction

BigFUSE: Global Context-Aware Image Fusion in Dual-View Light-Sheet Fluorescence Microscopy with Image Formation Prior

no code implementations5 Sep 2023 Yu Liu, Gesine Muller, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng

Light-sheet fluorescence microscopy (LSFM), a planar illumination technique that enables high-resolution imaging of samples, experiences defocused image quality caused by light scattering when photons propagate through thick tissues.

On the Localization of Ultrasound Image Slices within Point Distribution Models

1 code implementation1 Sep 2023 Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab

We demonstrate that our multi-modal registration framework can localize images on the 3D surface topology of a patient-specific organ and the mean shape of an SSM.

3D Reconstruction 3D Shape Representation +3

3D Adversarial Augmentations for Robust Out-of-Domain Predictions

no code implementations29 Aug 2023 Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari

We conduct extensive experiments across a variety of scenarios on data from KITTI, Waymo, and CrashD for 3D object detection, and on data from SemanticKITTI, Waymo, and nuScenes for 3D semantic segmentation.

3D Object Detection 3D Semantic Segmentation +2

A Continual Learning Approach for Cross-Domain White Blood Cell Classification

no code implementations24 Aug 2023 Ario Sadafi, Raheleh Salehi, Armin Gruber, Sayedali Shetab Boushehri, Pascal Giehr, Nassir Navab, Carsten Marr

Here, we propose a rehearsal-based continual learning approach for class incremental and domain incremental scenarios in white blood cell classification.

Classification Continual Learning

A Study of Age and Sex Bias in Multiple Instance Learning based Classification of Acute Myeloid Leukemia Subtypes

no code implementations24 Aug 2023 Ario Sadafi, Matthias Hehr, Nassir Navab, Carsten Marr

To that end, we train multiple MIL models using different levels of sex imbalance in the training set and excluding certain age groups.

Classification Decision Making +1

Multi-Modal Dataset Acquisition for Photometrically Challenging Object

no code implementations21 Aug 2023 HyunJun Jung, Patrick Ruhkamp, Nassir Navab, Benjamin Busam

This paper addresses the limitations of current datasets for 3D vision tasks in terms of accuracy, size, realism, and suitable imaging modalities for photometrically challenging objects.

Object

Polarimetric Information for Multi-Modal 6D Pose Estimation of Photometrically Challenging Objects with Limited Data

no code implementations21 Aug 2023 Patrick Ruhkamp, Daoyi Gao, HyunJun Jung, Nassir Navab, Benjamin Busam

6D pose estimation pipelines that rely on RGB-only or RGB-D data show limitations for photometrically challenging objects with e. g. textureless surfaces, reflections or transparency.

6D Pose Estimation

Robust Monocular Depth Estimation under Challenging Conditions

no code implementations ICCV 2023 Stefano Gasperini, Nils Morbitzer, HyunJun Jung, Nassir Navab, Federico Tombari

While state-of-the-art monocular depth estimation approaches achieve impressive results in ideal settings, they are highly unreliable under challenging illumination and weather conditions, such as at nighttime or in the presence of rain.

Monocular Depth Estimation valid

DISBELIEVE: Distance Between Client Models is Very Essential for Effective Local Model Poisoning Attacks

no code implementations14 Aug 2023 Indu Joshi, Priyank Upadhya, Gaurav Kumar Nayak, Peter Schüffler, Nassir Navab

Leveraging this, we introduce DISBELIEVE, a local model poisoning attack that creates malicious parameters or gradients such that their distance to benign clients' parameters or gradients is low respectively but at the same time their adverse effect on the global model's performance is high.

Federated Learning Medical Image Analysis +2

WarpEM: Dynamic Time Warping for Accurate Catheter Registration in EM-guided Procedures

no code implementations7 Aug 2023 Ardit Ramadani, Peter Ewert, Heribert Schunkert, Nassir Navab

Accurate catheter tracking is crucial during minimally invasive endovascular procedures (MIEP), and electromagnetic (EM) tracking is a widely used technology that serves this purpose.

Dynamic Time Warping Medical Procedure

DefCor-Net: Physics-Aware Ultrasound Deformation Correction

1 code implementation7 Aug 2023 Zhongliang Jiang, Yue Zhou, Dongliang Cao, Nassir Navab

The recovery of morphologically accurate anatomical images from deformed ones is challenging in ultrasound (US) image acquisition, but crucial to accurate and consistent diagnosis, particularly in the emerging field of computer-assisted diagnosis.

Anatomy

LOTUS: Learning to Optimize Task-based US representations

1 code implementation29 Jul 2023 Yordanka Velikova, Mohammad Farid Azampour, Walter Simson, Vanessa Gonzalez Duque, Nassir Navab

Anatomical segmentation of organs in ultrasound images is essential to many clinical applications, particularly for diagnosis and monitoring.

Image Generation Segmentation

DisguisOR: Holistic Face Anonymization for the Operating Room

1 code implementation26 Jul 2023 Lennart Bastian, Tony Danjun Wang, Tobias Czempiel, Benjamin Busam, Nassir Navab

Methods: RGB and depth images from multiple cameras are fused into a 3D point cloud representation of the scene.

Face Anonymization

DISA: DIfferentiable Similarity Approximation for Universal Multimodal Registration

1 code implementation19 Jul 2023 Matteo Ronchetti, Wolfgang Wein, Nassir Navab, Oliver Zettinig, Raphael Prevost

Our method is several orders of magnitude faster than local patch-based metrics and can be directly applied in clinical settings by replacing the similarity measure with the proposed one.

Anatomy Image Registration

Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting

1 code implementation11 Jul 2023 Chantal Pellegrini, Matthias Keicher, Ege Özsoy, Nassir Navab

However, there is limited research on automating structured reporting, and no public benchmark is available for evaluating and comparing different methods.

Medical Visual Question Answering Question Answering +2

Motion Magnification in Robotic Sonography: Enabling Pulsation-Aware Artery Segmentation

1 code implementation7 Jul 2023 Dianye Huang, Yuan Bi, Nassir Navab, Zhongliang Jiang

To validate the proposed robotic US system for imaging arteries, experiments are carried out on volunteers' carotid and radial arteries.

Motion Magnification Segmentation

Thoracic Cartilage Ultrasound-CT Registration using Dense Skeleton Graph

1 code implementation7 Jul 2023 Zhongliang Jiang, Chenyang Li, Xuesong Li, Nassir Navab

To address this challenge, a graph-based non-rigid registration is proposed to enable transferring planned paths from the atlas to the current setup by explicitly considering subcutaneous bone surface features instead of the skin surface.

Template Matching

Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations

1 code implementation7 Jul 2023 Zhongliang Jiang, Yuan Bi, Mingchuan Zhou, Ying Hu, Michael Burke, Nassir Navab

The results demonstrated that the proposed advanced framework can robustly work on a variety of seen and unseen phantoms as well as in-vivo human carotid data.

Navigate

AutoPaint: A Self-Inpainting Method for Unsupervised Anomaly Detection

no code implementations21 May 2023 Mehdi Astaraki, Francesca De Benetti, Yousef Yeganeh, Iuliana Toma-Dasu, Örjan Smedby, Chunliang Wang, Nassir Navab, Thomas Wendler

This work intends to, first, propose a robust inpainting model to learn the details of healthy anatomies and reconstruct high-resolution images by preserving anatomical constraints.

Unsupervised Anomaly Detection

Self-Supervised Learning for Physiologically-Based Pharmacokinetic Modeling in Dynamic PET

no code implementations17 May 2023 Francesca De Benetti, Walter Simson, Magdalini Paschali, Hasan Sari, Axel Romiger, Kuangyu Shi, Nassir Navab, Thomas Wendler

Dynamic positron emission tomography imaging (dPET) provides temporally resolved images of a tracer enabling a quantitative measure of physiological processes.

Diagnostic Self-Supervised Learning

DopUS-Net: Quality-Aware Robotic Ultrasound Imaging based on Doppler Signal

1 code implementation15 May 2023 Zhongliang Jiang, Felix Duelmer, Nassir Navab

The experimental results demonstrate that the proposed approach with the re-identification process can significantly improve the accuracy and robustness of the segmentation results (dice score: from 0:54 to 0:86; intersection over union: from 0:47 to 0:78).

Image Segmentation Region Proposal +2

Skeleton Graph-based Ultrasound-CT Non-rigid Registration

no code implementations14 May 2023 Zhongliang Jiang, Xuesong Li, Chenyu Zhang, Yuan Bi, Walter Stechele, Nassir Navab

Autonomous ultrasound (US) scanning has attracted increased attention, and it has been seen as a potential solution to overcome the limitations of conventional US examinations, such as inter-operator variations.

Next-generation Surgical Navigation: Marker-less Multi-view 6DoF Pose Estimation of Surgical Instruments

no code implementations5 May 2023 Jonas Hein, Nicola Cavalcanti, Daniel Suter, Lukas Zingg, Fabio Carrillo, Lilian Calvet, Mazda Farshad, Marc Pollefeys, Nassir Navab, Philipp Fürnstahl

Third, we evaluate three state-of-the-art single-view and multi-view methods for the task of 6DoF pose estimation of surgical instruments and analyze the influence of camera configurations, training data, and occlusions on the pose accuracy and generalization ability.

Anatomy Pose Estimation

Incremental 3D Semantic Scene Graph Prediction from RGB Sequences

no code implementations CVPR 2023 Shun-Cheng Wu, Keisuke Tateno, Nassir Navab, Federico Tombari

Our method consists of a novel incremental entity estimation pipeline and a scene graph prediction network.

SCOPE: Structural Continuity Preservation for Medical Image Segmentation

no code implementations28 Apr 2023 Yousef Yeganeh, Azade Farshad, Goktug Guevercin, Amr Abu-zer, Rui Xiao, Yongjian Tang, Ehsan Adeli, Nassir Navab

Although the preservation of shape continuity and physiological anatomy is a natural assumption in the segmentation of medical images, it is often neglected by deep learning methods that mostly aim for the statistical modeling of input data as pixels rather than interconnected structures.

Anatomy Image Segmentation +3

DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation

no code implementations28 Apr 2023 Yousef Yeganeh, Azade Farshad, Peter Weinberger, Seyed-Ahmad Ahmadi, Ehsan Adeli, Nassir Navab

Although purely transformer-based architectures showed promising performance in many computer vision tasks, many hybrid models consisting of CNN and transformer blocks are introduced to fit more specialized tasks.

Image Segmentation Medical Image Segmentation +1

SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis

no code implementations28 Apr 2023 Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen, Björn Ommer, Nassir Navab

To address this limitation, we propose a novel guidance approach for the sampling process in the diffusion model that leverages bounding box and segmentation map information at inference time without additional training data.

Image Generation from Scene Graphs Segmentation +1

S3M: Scalable Statistical Shape Modeling through Unsupervised Correspondences

1 code implementation15 Apr 2023 Lennart Bastian, Alexander Baumann, Emily Hoppe, Vincent Bürgin, Ha Young Kim, Mahdi Saleh, Benjamin Busam, Nassir Navab

Statistical shape models (SSMs) are an established way to represent the anatomy of a population with various clinically relevant applications.

Anatomy

Prior-RadGraphFormer: A Prior-Knowledge-Enhanced Transformer for Generating Radiology Graphs from X-Rays

1 code implementation24 Mar 2023 Yiheng Xiong, Jingsong Liu, Kamilia Zaripova, Sahand Sharifzadeh, Matthias Keicher, Nassir Navab

The extraction of structured clinical information from free-text radiology reports in the form of radiology graphs has been demonstrated to be a valuable approach for evaluating the clinical correctness of report-generation methods.

Decision Making Medical Image Analysis +3

LABRAD-OR: Lightweight Memory Scene Graphs for Accurate Bimodal Reasoning in Dynamic Operating Rooms

1 code implementation23 Mar 2023 Ege Özsoy, Tobias Czempiel, Felix Holm, Chantal Pellegrini, Nassir Navab

The holistic representation of surgical scenes as semantic scene graphs (SGG), where entities are represented as nodes and relations between them as edges, is a promising direction for fine-grained semantic OR understanding.

Scene Graph Generation

MI-SegNet: Mutual Information-Based US Segmentation for Unseen Domain Generalization

2 code implementations22 Mar 2023 Yuan Bi, Zhongliang Jiang, Ricarda Clarenbach, Reza Ghotbi, Angelos Karlas, Nassir Navab

We validate the generalizability of the proposed domain-independent segmentation approach on several datasets with varying parameters and machines.

Anatomy Disentanglement +5

Location-Free Scene Graph Generation

1 code implementation20 Mar 2023 Ege Özsoy, Felix Holm, Mahdi Saleh, Tobias Czempiel, Chantal Pellegrini, Nassir Navab, Benjamin Busam

Scene Graph Generation (SGG) is a visual understanding task, aiming to describe a scene as a graph of entities and their relationships with each other.

Graph Generation Image Retrieval +3

Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs

no code implementations15 Mar 2023 Artem Savkin, Rachid Ellouze, Nassir Navab, Federico Tombari

Image synthesis driven by computer graphics achieved recently a remarkable realism, yet synthetic image data generated this way reveals a significant domain gap with respect to real-world data.

Autonomous Driving Image Generation +1

BEL: A Bag Embedding Loss for Transformer enhances Multiple Instance Whole Slide Image Classification

no code implementations2 Mar 2023 Daniel Sens, Ario Sadafi, Francesco Paolo Casale, Nassir Navab, Carsten Marr

Recent MIL approaches produce highly informative bag level representations by utilizing the transformer architecture's ability to model the dependencies between instances.

Image Classification Multiple Instance Learning +1

KST-Mixer: Kinematic Spatio-Temporal Data Mixer For Colon Shape Estimation

1 code implementation2 Feb 2023 Masahiro Oda, Kazuhiro Furukawa, Nassir Navab, Kensaku MORI

Kinematic data of a colonoscope and the colon, including positions and directions of their centerlines, are obtained using electromagnetic and depth sensors.

Lidar Upsampling with Sliced Wasserstein Distance

no code implementations31 Jan 2023 Artem Savkin, Yida Wang, Sebastian Wirkert, Nassir Navab, Federico Tombar

This in turn enables our method to employ a one-stage upsampling paradigm without the need for coarse and fine reconstruction.

Autonomous Driving Domain Adaptation +1

Ultra-NeRF: Neural Radiance Fields for Ultrasound Imaging

1 code implementation25 Jan 2023 Magdalena Wysocki, Mohammad Farid Azampour, Christine Eilers, Benjamin Busam, Mehrdad Salehi, Nassir Navab

In our work, we discuss direction-dependent changes in the scene and show that a physics-inspired rendering improves the fidelity of US image synthesis.

Image Generation NeRF +1

TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation

no code implementations CVPR 2023 Hanzhi Chen, Fabian Manhardt, Nassir Navab, Benjamin Busam

In this paper, we introduce neural texture learning for 6D object pose estimation from synthetic data and a few unlabelled real images.

6D Pose Estimation using RGB

SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor Environments

1 code implementation22 Dec 2022 Evin Pınar Örnek, Aravindhan K Krishnan, Shreekant Gayaka, Cheng-Hao Kuo, Arnie Sen, Nassir Navab, Federico Tombari

We introduce a zero-shot split for Tabletop Objects Dataset (TOD-Z) to enable this study and present a method that uses annotated objects to learn the ``objectness'' of pixels and generalize to unseen object categories in cluttered indoor environments.

Instance Segmentation Object +2

DisPositioNet: Disentangled Pose and Identity in Semantic Image Manipulation

no code implementations10 Nov 2022 Azade Farshad, Yousef Yeganeh, Helisa Dhamo, Federico Tombari, Nassir Navab

Graph representation of objects and their relations in a scene, known as a scene graph, provides a precise and discernible interface to manipulate a scene by modifying the nodes or the edges in the graph.

Disentanglement Image Manipulation

Improved Techniques for the Conditional Generative Augmentation of Clinical Audio Data

no code implementations5 Nov 2022 Mane Margaryan, Matthias Seibold, Indu Joshi, Mazda Farshad, Philipp Fürnstahl, Nassir Navab

In contrast to previously proposed fully convolutional models, the proposed model implements residual Squeeze and Excitation modules in the generator architecture.

Data Augmentation

What can we learn about a generated image corrupting its latent representation?

no code implementations12 Oct 2022 Agnieszka Tomczak, Aarushi Gupta, Slobodan Ilic, Nassir Navab, Shadi Albarqouni

The purpose of this work is to investigate the hypothesis that we can predict image quality based on its latent representation in the GANs bottleneck.

Image-to-Image Translation Liver Segmentation

Segmenting Known Objects and Unseen Unknowns without Prior Knowledge

1 code implementation ICCV 2023 Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari

By doing so, for the first time in panoptic segmentation with unknown objects, our U3HS is trained without unknown categories, reducing assumptions and leaving the settings as unconstrained as in real-life scenarios.

Instance Segmentation Object Detection +3

Towards Autonomous Atlas-based Ultrasound Acquisitions in Presence of Articulated Motion

1 code implementation10 Aug 2022 Zhongliang Jiang, Yuan Gao, Le Xie, Nassir Navab

Robotic ultrasound (US) imaging aims at overcoming some of the limitations of free-hand US examinations, e. g. difficulty in guaranteeing intra- and inter-operator repeatability.

DA$^2$ Dataset: Toward Dexterity-Aware Dual-Arm Grasping

no code implementations31 Jul 2022 Guangyao Zhai, Yu Zheng, Ziwei Xu, Xin Kong, Yong liu, Benjamin Busam, Yi Ren, Nassir Navab, Zhengyou Zhang

In this paper, we introduce DA$^2$, the first large-scale dual-arm dexterity-aware dataset for the generation of optimal bimanual grasping pairs for arbitrary large objects.

Speckle2Speckle: Unsupervised Learning of Ultrasound Speckle Filtering Without Clean Data

1 code implementation31 Jul 2022 Rüdiger Göbl, Christoph Hennersperger, Nassir Navab

To enable this, we make use of realistic ultrasound simulation techniques that allow for instantiation of several independent speckle realizations that represent the exact same tissue, thus allowing for the application of image reconstruction techniques that work with pairs of differently corrupted data.

Image Reconstruction

CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point Cloud Learning

no code implementations31 Jul 2022 Mahdi Saleh, Yige Wang, Nassir Navab, Benjamin Busam, Federico Tombari

The proposed hierarchical model achieves state-of-the-art shape classification in mean accuracy and yields results on par with the previous segmentation methods while requiring significantly fewer computations.

Scene Segmentation Segmentation

Spotlight on nerves: Portable multispectral optoacoustic imaging of peripheral nerve vascularization and morphology

no code implementations28 Jul 2022 Dominik Jüstel, Hedwig Irl, Florian Hinterwimmer, Christoph Dehner, Walter Simson, Nassir Navab, Gerhard Schneider, Vasilis Ntziachristos

Various morphological and functional parameters of peripheral nerves and their vascular supply are indicative of pathological changes due to injury or disease.

Exploiting Diversity of Unlabeled Data for Label-Efficient Semi-Supervised Active Learning

no code implementations25 Jul 2022 Felix Buchert, Nassir Navab, Seong Tae Kim

By considering the consistency information with the diversity in the consistency-based embedding scheme, the proposed method could select more informative samples for labeling in the semi-supervised learning setting.

Active Learning Diversity +1

Unsupervised pre-training of graph transformers on patient population graphs

2 code implementations21 Jul 2022 Chantal Pellegrini, Nassir Navab, Anees Kazi

We find that our proposed pre-training methods help in modeling the data at a patient and population level and improve performance in different fine-tuning tasks on all datasets.

Language Modeling Language Modelling +3

CACTUSS: Common Anatomical CT-US Space for US examinations

1 code implementation18 Jul 2022 Yordanka Velikova, Walter Simson, Mehrdad Salehi, Mohammad Farid Azampour, Philipp Paprottka, Nassir Navab

Abdominal aortic aneurysm (AAA) is a vascular disease in which a section of the aorta enlarges, weakening its walls and potentially rupturing the vessel.

Diagnostic Image-to-Image Translation +1

Shape-Aware Masking for Inpainting in Medical Imaging

no code implementations12 Jul 2022 Yousef Yeganeh, Azade Farshad, Nassir Navab

Inpainting has recently been proposed as a successful deep learning technique for unsupervised medical image model discovery.

Anatomy Image Reconstruction +1

Adaptive Personlization in Federated Learning for Highly Non-i.i.d. Data

no code implementations7 Jul 2022 Yousef Yeganeh, Azade Farshad, Johann Boschmann, Richard Gaus, Maximilian Frantzen, Nassir Navab

Although most medical centers conduct similar medical imaging tasks, their differences, such as specializations, number of patients, and devices, lead to distinctive data distributions.

Clustering Federated Learning +3

Unsupervised Cross-Domain Feature Extraction for Single Blood Cell Image Classification

1 code implementation1 Jul 2022 Raheleh Salehi, Ario Sadafi, Armin Gruber, Peter Lienemann, Nassir Navab, Shadi Albarqouni, Carsten Marr

Here, we propose a cross-domain adapted autoencoder to extract features in an unsupervised manner on three different datasets of single white blood cells scanned from peripheral blood smears.

Image Classification Prognosis

DeStripe: A Self2Self Spatio-Spectral Graph Neural Network with Unfolded Hessian for Stripe Artifact Removal in Light-sheet Microscopy

no code implementations27 Jun 2022 Yu Liu, Kurt Weiss, Nassir Navab, Carsten Marr, Jan Huisken, Tingying Peng

Light-sheet fluorescence microscopy (LSFM) is a cutting-edge volumetric imaging technique that allows for three-dimensional imaging of mesoscopic samples with decoupled illumination and detection paths.

Denoising Graph Neural Network

U-PET: MRI-based Dementia Detection with Joint Generation of Synthetic FDG-PET Images

no code implementations16 Jun 2022 Marcel Kollovieh, Matthias Keicher, Stephan Wunderlich, Hendrik Burwinkel, Thomas Wendler, Nassir Navab

To this end, we propose a multi-task method based on U-Net that takes T1-weighted MR images as an input to generate synthetic FDG-PET images and classifies the dementia progression of the patient into cognitive normal (CN), cognitive impairment (MCI), and AD.

Virtual embeddings and self-consistency for self-supervised learning

no code implementations13 Jun 2022 Tariq Bdair, Hossam Abdelhamid, Nassir Navab, Shadi Albarqouni

We validate TriMix on eight benchmark datasets consisting of natural and medical images with an improvement of 2. 71% and 0. 41% better than the second-best models for both data types.

Data Augmentation Representation Learning +1

BFS-Net: Weakly Supervised Cell Instance Segmentation from Bright-Field Microscopy Z-Stacks

no code implementations9 Jun 2022 Shervin Dehghani, Benjamin Busam, Nassir Navab, Ali Nasseri

Despite its broad availability, volumetric information acquisition from Bright-Field Microscopy (BFM) is inherently difficult due to the projective nature of the acquisition process.

Instance Segmentation Semantic Segmentation

VesNet-RL: Simulation-based Reinforcement Learning for Real-World US Probe Navigation

1 code implementation10 May 2022 Yuan Bi, Zhongliang Jiang, Yuan Gao, Thomas Wendler, Angelos Karlas, Nassir Navab

The results demonstrate that proposed approach can effectively and accurately navigate the probe towards the longitudinal view of vessels.

Diagnostic Navigate +2

Affective Medical Estimation and Decision Making via Visualized Learning and Deep Learning

1 code implementation9 May 2022 Mohammad Eslami, Solale Tabarestani, Ehsan Adeli, Glyn Elwyn, Tobias Elze, Mengyu Wang, Nazlee Zebardast, Nassir Navab, Malek Adjouadi

With the advent of sophisticated machine learning (ML) techniques and the promising results they yield, especially in medical applications, where they have been investigated for different tasks to enhance the decision-making process.

Decision Making Memorization +2

Y-Net: A Spatiospectral Dual-Encoder Networkfor Medical Image Segmentation

1 code implementation15 Apr 2022 Azade Farshad, Yousef Yeganeh, Peter Gehlbach, Nassir Navab

Automated segmentation of retinal optical coherence tomography (OCT) images has become an important recent direction in machine learning for medical applications.

 Ranked #1 on Retinal OCT Layer Segmentation on Duke SD-OCT (using extra training data)

Image Segmentation Medical Image Segmentation +3

Analyzing the Effects of Handling Data Imbalance on Learned Features from Medical Images by Looking Into the Models

no code implementations4 Apr 2022 Ashkan Khakzar, Yawei Li, Yang Zhang, Mirac Sanisoglu, Seong Tae Kim, Mina Rezaei, Bernd Bischl, Nassir Navab

One challenging property lurking in medical datasets is the imbalanced data distribution, where the frequency of the samples between the different classes is not balanced.

Graph-in-Graph (GiG): Learning interpretable latent graphs in non-Euclidean domain for biological and healthcare applications

no code implementations1 Apr 2022 Kamilia Mullakaeva, Luca Cosmo, Anees Kazi, Seyed-Ahmad Ahmadi, Nassir Navab, Michael M. Bronstein

In this work, we propose Graph-in-Graph (GiG), a neural network architecture for protein classification and brain imaging applications that exploits the graph representation of the input data samples and their latent relation.

Property Prediction

FlexR: Few-shot Classification with Language Embeddings for Structured Reporting of Chest X-rays

no code implementations29 Mar 2022 Matthias Keicher, Kamilia Zaripova, Tobias Czempiel, Kristina Mach, Ashkan Khakzar, Nassir Navab

The automation of chest X-ray reporting has garnered significant interest due to the time-consuming nature of the task.

Intelligent Masking: Deep Q-Learning for Context Encoding in Medical Image Analysis

1 code implementation25 Mar 2022 Mojtaba Bahrami, Mahsa Ghorbani, Nassir Navab

We show that training the agent against the prediction model can significantly improve the semantic features extracted for downstream classification tasks.

Medical Image Analysis Q-Learning +1

Unsupervised Pre-Training on Patient Population Graphs for Patient-Level Predictions

2 code implementations23 Mar 2022 Chantal Pellegrini, Anees Kazi, Nassir Navab

We test our method on two medical datasets of patient records, TADPOLE and MIMIC-III, including imaging and non-imaging features and different prediction tasks.

Disease Prediction Imputation +2

4D-OR: Semantic Scene Graphs for OR Domain Modeling

1 code implementation22 Mar 2022 Ege Özsoy, Evin Pınar Örnek, Ulrich Eck, Tobias Czempiel, Federico Tombari, Nassir Navab

Towards this goal, for the first time, we propose using semantic scene graphs (SSG) to describe and summarize the surgical scene.

Scene Graph Generation

Conditional Generative Data Augmentation for Clinical Audio Datasets

no code implementations22 Mar 2022 Matthias Seibold, Armando Hoch, Mazda Farshad, Nassir Navab, Philipp Fürnstahl

In this work, we propose a novel data augmentation method for clinical audio datasets based on a conditional Wasserstein Generative Adversarial Network with Gradient Penalty (cWGAN-GP), operating on log-mel spectrograms.

Data Augmentation Generative Adversarial Network

Surgical Workflow Recognition: from Analysis of Challenges to Architectural Study

no code implementations17 Mar 2022 Tobias Czempiel, Aidean Sharghi, Magdalini Paschali, Nassir Navab, Omid Mohareri

Algorithmic surgical workflow recognition is an ongoing research field and can be divided into laparoscopic (Internal) and operating room (External) analysis.

Know your sensORs -- A Modality Study For Surgical Action Classification

no code implementations16 Mar 2022 Lennart Bastian, Tobias Czempiel, Christian Heiliger, Konrad Karcz, Ulrich Eck, Benjamin Busam, Nassir Navab

Existing datasets from OR room cameras are thus far limited in size or modalities acquired, leaving it unclear which sensor modalities are best suited for tasks such as recognizing surgical action from videos.

Action Classification Action Recognition +1

From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction

no code implementations15 Mar 2022 Evin Pınar Örnek, Shristi Mudgal, Johanna Wald, Yida Wang, Nassir Navab, Federico Tombari

There have been numerous recently proposed methods for monocular depth prediction (MDP) coupled with the equally rapid evolution of benchmarking tools.

3D geometry Benchmarking +2

GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

3 code implementations CVPR 2022 Yan Di, Ruida Zhang, Zhiqiang Lou, Fabian Manhardt, Xiangyang Ji, Nassir Navab, Federico Tombari

While 6D object pose estimation has recently made a huge leap forward, most methods can still only handle a single or a handful of different objects, which limits their applications.

 Ranked #1 on 6D Pose Estimation on LineMOD (Mean ADD-S metric)

6D Pose Estimation 6D Pose Estimation using RGB +3

Transformers in Action: Weakly Supervised Action Segmentation

no code implementations14 Jan 2022 John Ridley, Huseyin Coskun, David Joseph Tan, Nassir Navab, Federico Tombari

The video action segmentation task is regularly explored under weaker forms of supervision, such as transcript supervision, where a list of actions is easier to obtain than dense frame-wise labels.

Action Segmentation

A Variational Bayesian Method for Similarity Learning in Non-Rigid Image Registration

1 code implementation CVPR 2022 Daniel Grzech, Mohammad Farid Azampour, Ben Glocker, Julia Schnabel, Nassir Navab, Bernhard Kainz, Loïc le Folgoc

We propose a novel variational Bayesian formulation for diffeomorphic non-rigid registration of medical images, which learns in an unsupervised way a data-specific similarity metric.

Image Registration

Wild ToFu: Improving Range and Quality of Indirect Time-of-Flight Depth with RGB Fusion in Challenging Environments

no code implementations7 Dec 2021 HyunJun Jung, Nikolas Brasch, Ales Leonardis, Nassir Navab, Benjamin Busam

Indirect Time-of-Flight (I-ToF) imaging is a widespread way of depth estimation for mobile devices due to its small size and affordable price.

Depth Estimation Depth Prediction

Object-aware Monocular Depth Prediction with Instance Convolutions

1 code implementation2 Dec 2021 Enis Simsar, Evin Pınar Örnek, Fabian Manhardt, Helisa Dhamo, Nassir Navab, Federico Tombari

With the advent of deep learning, estimating depth from a single RGB image has recently received a lot of attention, being capable of empowering many different applications ranging from path planning for robotics to computational cinematography.

Depth Estimation Depth Prediction +3

MIGS: Meta Image Generation from Scene Graphs

1 code implementation22 Oct 2021 Azade Farshad, Sabrina Musatian, Helisa Dhamo, Nassir Navab

We propose MIGS (Meta Image Generation from Scene Graphs), a meta-learning based approach for few-shot image generation from graphs that enables adapting the model to different scenes and increases the image quality by training on diverse sets of tasks.

Diversity Image Generation from Scene Graphs +2

Semantic Image Alignment for Vehicle Localization

no code implementations8 Oct 2021 Markus Herb, Matthias Lemberger, Marcel M. Schmitt, Alexander Kurz, Tobias Weiherer, Nassir Navab, Federico Tombari

Accurate and reliable localization is a fundamental requirement for autonomous vehicles to use map information in higher-level tasks such as navigation or planning.

Autonomous Vehicles Semantic Segmentation +1

Adversarial Domain Feature Adaptation for Bronchoscopic Depth Estimation

no code implementations24 Sep 2021 Mert Asim Karaoglu, Nikolas Brasch, Marijn Stollenga, Wolfgang Wein, Nassir Navab, Federico Tombari, Alexander Ladikos

The results of our experiments show that the proposed method improves the network's performance on real images by a considerable margin and can be employed in 3D reconstruction pipelines.

3D Reconstruction Depth Estimation

MetaMedSeg: Volumetric Meta-learning for Few-Shot Organ Segmentation

1 code implementation18 Sep 2021 Anastasia Makarevich, Azade Farshad, Vasileios Belagiannis, Nassir Navab

In this work, we present MetaMedSeg, a gradient-based meta-learning algorithm that redefines the meta-learning task for the volumetric medical data with the goal to capture the variety between the slices.

Image Segmentation Medical Image Segmentation +3

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs

1 code implementation ICCV 2021 Helisa Dhamo, Fabian Manhardt, Nassir Navab, Federico Tombari

Scene graphs are representations of a scene, composed of objects (nodes) and inter-object relationships (edges), proven to be particularly suited for this task, as they allow for semantic control on the generated content.

Object

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

2 code implementations ICCV 2021 Yan Di, Fabian Manhardt, Gu Wang, Xiangyang Ji, Nassir Navab, Federico Tombari

Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (e. g. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem.

6D Pose Estimation 6D Pose Estimation using RGB +1

Unconditional Scene Graph Generation

no code implementations ICCV 2021 Sarthak Garg, Helisa Dhamo, Azade Farshad, Sabrina Musatian, Nassir Navab, Federico Tombari

Scene graphs, composed of nodes as objects and directed-edges as relationships among objects, offer an alternative representation of a scene that is more semantically grounded than images.

Anomaly Detection Graph Generation +3

Tracked 3D Ultrasound and Deep Neural Network-based Thyroid Segmentation reduce Interobserver Variability in Thyroid Volumetry

no code implementations10 Aug 2021 Markus Krönke, Christine Eilers, Desislava Dimova, Melanie Köhler, Gabriel Buschner, Lilit Mirzojan, Lemonia Konstantinidou, Marcus R. Makowski, James Nagarajah, Nassir Navab, Wolfgang Weber, Thomas Wendler

Conclusion: Tracked 3D ultrasound combined with a CNN segmentation significantly reduces interobserver variability in thyroid volumetry and increases the accuracy of the measurements with shorter acquisition times.

R4Dyn: Exploring Radar for Self-Supervised Monocular Depth Estimation of Dynamic Scenes

no code implementations10 Aug 2021 Stefano Gasperini, Patrick Koch, Vinzenz Dallabetta, Nassir Navab, Benjamin Busam, Federico Tombari

While self-supervised monocular depth estimation in driving scenarios has achieved comparable performance to supervised approaches, violations of the static world assumption can still lead to erroneous depth predictions of traffic participants, posing a potential safety issue.

Autonomous Vehicles Monocular Depth Estimation

U-GAT: Multimodal Graph Attention Network for COVID-19 Outcome Prediction

no code implementations29 Jul 2021 Matthias Keicher, Hendrik Burwinkel, David Bani-Harouni, Magdalini Paschali, Tobias Czempiel, Egon Burian, Marcus R. Makowski, Rickmer Braren, Nassir Navab, Thomas Wendler

Specifically, we introduce a multimodal similarity metric to build a population graph for clustering patients and an image-based end-to-end Graph Attention Network to process this graph and predict the COVID-19 patient outcomes: admission to ICU, need for ventilation and mortality.

Clustering Decision Making +3

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

1 code implementation26 Jul 2021 Daniil Pakhomov, Sanchit Hira, Narayani Wagle, Kemar E. Green, Nassir Navab

Derived regions are consistent across different images and coincide with human-defined semantic classes on some datasets.

Image Segmentation Segmentation +1

Deep Direct Volume Rendering: Learning Visual Feature Mappings From Exemplary Images

no code implementations9 Jun 2021 Jakob Weiss, Nassir Navab

In this work, we introduce Deep Direct Volume Rendering (DeepDVR), a generalization of DVR that allows for the integration of deep neural networks into the DVR algorithm.

Colorization Inverse Rendering +1

Multimodal Semantic Scene Graphs for Holistic Modeling of Surgical Procedures

no code implementations9 Jun 2021 Ege Özsoy, Evin Pınar Örnek, Ulrich Eck, Federico Tombari, Nassir Navab

We then use MSSG to introduce a dynamically generated graphical user interface tool for surgical procedure analysis which could be used for many applications including process optimization, OR design and automatic report generation.

Cannot find the paper you are looking for? You can Submit a new open access paper.