no code implementations • 7 Sep 2024 • Mohammadmahdi Honarmand, Muhammad Abdullah Jamal, Omid Mohareri
While existing surgical VL models primarily rely on contrastive learning, we propose a more comprehensive approach to capture the intricate temporal dynamics and align video with language.
no code implementations • 5 Aug 2024 • Muhammad Abdullah Jamal, Omid Mohareri
In the second stage, we further pre-train the model using masked autoencoding and denoising/noise prediction used in diffusion models.
no code implementations • 29 Jul 2024 • Muhammad Abdullah Jamal, Omid Mohareri
Surgical scene understanding is a key technical component for enabling intelligent and context aware systems that can transform various aspects of surgical interventions.
no code implementations • 7 Jul 2024 • Idris Hamoud, Alexandros Karargyris, Aidean Sharghi, Omid Mohareri, Nicolas Padoy
Semantic segmentation and activity classification are key components to creating intelligent surgical systems able to understand and assist clinical workflow.
no code implementations • 23 Jan 2024 • Ali Mottaghi, Mohammad Abdullah Jamal, Serena Yeung, Omid Mohareri
Our method's effectiveness is validated through extensive experiments on benchmark datasets such as DomainNet, Office-Home, and VisDA-C, where AdaEmbed consistently outperforms all the baselines, setting a new state of the art for SSDA.
2 code implementations • 31 Dec 2023 • Dimitrios Psychogyios, Emanuele Colleoni, Beatrice van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi, Jente Simoens, Pieter DeBacker, Francesco Cisternino, Gabriele Furnari, Alex Mottrie, Federica Ferraguti, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Soohee Kim, Seung Hyun Lee, Kyu Eun Lee, Hyoun-Joong Kong, Kui Fu, Chao Li, Shan An, Stefanie Krell, Sebastian Bodenstedt, Nicolas Ayobi, Alejandra Perez, Santiago Rodriguez, Juanita Puentes, Pablo Arbelaez, Omid Mohareri, Danail Stoyanov
Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems.
no code implementations • 19 Dec 2023 • Idris Hamoud, Muhammad Abdullah Jamal, Vinkle Srivastav, Didier Mutter, Nicolas Padoy, Omid Mohareri
Surgical robotics holds much promise for improving patient safety and clinician experience in the Operating Room (OR).
no code implementations • 17 Oct 2023 • Adam Schmidt, Omid Mohareri, Simon DiMaio, Michael C. Yip, Septimiu E. Salcudean
In this review, we provide an update to the field of camera-based tracking and scene mapping in surgery and diagnostics in medical computer vision.
no code implementations • 28 Sep 2023 • Adam Schmidt, Omid Mohareri, Simon DiMaio, Septimiu E. Salcudean
STIR comprises hundreds of stereo video clips in both in-vivo and ex-vivo scenes with start and end points labelled in the IR spectrum.
no code implementations • 26 Sep 2023 • Muhammad Abdullah Jamal, Omid Mohareri
We present a new pre-training strategy called M$^{3}$3D ($\underline{M}$ulti-$\underline{M}$odal $\underline{M}$asked $\underline{3D}$) built based on Multi-modal masked autoencoders that can leverage 3D priors and learned cross-modal representations in RGB-D data.
no code implementations • 19 May 2023 • Muhammad Abdullah Jamal, Omid Mohareri
There has been a growing interest in using deep learning models for processing long surgical videos, in order to automatically detect clinical/operational activities and extract metrics that can enable workflow efficiency tools and applications.
no code implementations • 10 May 2023 • Adam Schmidt, Omid Mohareri, Simon DiMaio, Septimiu E. Salcudean
SENDD enables multiple downstream applications that require estimation of 3D motion in stereo endoscopy.
no code implementations • 16 Jul 2022 • Muhammad Abdullah Jamal, Omid Mohareri
Data-driven approaches to assist operating room (OR) workflow analysis depend on large curated datasets that are time consuming and expensive to collect.
no code implementations • 7 Jul 2022 • Ali Mottaghi, Aidean Sharghi, Serena Yeung, Omid Mohareri
We propose a new domain adaptation method to improve the performance of the surgical activity recognition model in a new operating room for which we only have unlabeled videos.
no code implementations • 5 May 2022 • Zhuohong He, Ali Mottaghi, Aidean Sharghi, Muhammad Abdullah Jamal, Omid Mohareri
In this paper, we investigate many state-of-the-art backbones and temporal models to find architectures that yield the strongest performance for surgical activity recognition.
no code implementations • 17 Mar 2022 • Tobias Czempiel, Aidean Sharghi, Magdalini Paschali, Nassir Navab, Omid Mohareri
Algorithmic surgical workflow recognition is an ongoing research field and can be divided into laparoscopic (Internal) and operating room (External) analysis.
no code implementations • 29 Jun 2020 • Aidean Sharghi, Helene Haugerud, Daniel Oh, Omid Mohareri
Automatic recognition of surgical activities in the operating room (OR) is a key technology for creating next generation intelligent surgical devices and workflow monitoring/support systems.
no code implementations • 20 Mar 2020 • Zhaoshuo Li, Amirreza Shaban, Jean-Gabriel Simard, Dinesh Rabindran, Simon DiMaio, Omid Mohareri
Purpose: We describe a 3D multi-view perception system for the da Vinci surgical system to enable Operating room (OR) scene understanding and context awareness.