1 code implementation • 19 Feb 2022 • Xingtong Liu, Zhaoshuo Li, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath
In endoscopy, many applications (e. g., surgical navigation) would benefit from a real-time method that can simultaneously track the endoscope and reconstruct the dense 3D geometry of the observed anatomy from a monocular endoscopic video.
1 code implementation • Conference On Robot Learning (CoRL) 2021 • Andrew Hundt, Aditya Murali, Priyanka Hubli, Ran Liu, Nakul Gopalan, Matthew Gombolay, Gregory D. Hager
Based upon this insight, we propose See-SPOT-Run (SSR), a new computational approach to robot learning that enables a robot to complete a variety of real robot tasks in novel problem domains without task-specific training.
no code implementations • 15 Oct 2021 • Weiyao Wang, Marin Kobilarov, Gregory D. Hager
Recent advances in latent space dynamics model from pixels show promising progress in vision-based model predictive control (MPC).
1 code implementation • CVPR 2021 • Xingtong Liu, Benjamin D. Killeen, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath
Extracting geometric features from 3D models is a common first step in applications such as 3D registration, tracking, and scene flow estimation.
no code implementations • 20 May 2021 • Will Pryor, Yotam Barnoy, Suraj Raval, Xiaolong Liu, Lamar Mair, Daniel Lerner, Onder Erin, Gregory D. Hager, Yancy Diaz-Mercado, Axel Krieger
Our localization method combines neural network-based segmentation and classical techniques, and we are able to consistently locate our needle with 0. 73 mm RMS error in clean environments and 2. 72 mm RMS error in challenging environments with blood and occlusion.
1 code implementation • 18 May 2021 • Gordon Christie, Kevin Foster, Shea Hagstrom, Gregory D. Hager, Myron Z. Brown
Current methods for Earth observation tasks such as semantic mapping, map alignment, and change detection rely on near-nadir images; however, often the first available images in response to dynamic world events such as natural disasters are oblique.
no code implementations • ICCV 2021 • Tae Soo Kim, Jonathan Jones, Gregory D. Hager
We present a dual-pathway approach for recognizing fine-grained interactions from videos.
no code implementations • 24 Feb 2021 • Daniel Neimark, Omri Bar, Maya Zohar, Gregory D. Hager, Dotan Asselmann
Such pre-training enables TSAN to learn workflow steps of a new laparoscopic procedure type from only a small number of labeled samples from the target procedure.
no code implementations • 3 Dec 2020 • Tae Soo Kim, Gregory D. Hager
We present a general framework for compositional action recognition -- i. e. action recognition where the labels are composed out of simpler components such as subjects, atomic-actions and objects.
no code implementations • 2 Dec 2020 • Jonathan D. Jones, Cathryn Cortesa, Amy Shelton, Barbara Landau, Sanjeev Khudanpur, Gregory D. Hager
In this paper we address the task of recognizing assembly actions as a structure (e. g. a piece of furniture or a toy block tower) is built up from a set of primitive objects.
no code implementations • 30 Nov 2020 • Qihao Liu, Weichao Qiu, Weiyao Wang, Gregory D. Hager, Alan L. Yuille
We propose an unsupervised vision-based system to estimate the joint configurations of the robot arm from a sequence of RGB or RGB-D images without knowing the model a priori, and then adapt it to the task of category-independent articulated object pose estimation.
no code implementations • 16 Nov 2020 • Ji Woong Kim, Changyan He, Muller Urias, Peter Gehlbach, Gregory D. Hager, Iulian Iordachita, Marin Kobilarov
We show that the network can reliably navigate a needle surgical tool to various desired locations within 137 microns accuracy in physical experiments and 94 microns in simulation on average, and generalizes well to unseen situations such as in the presence of auxiliary surgical tools, variable eye backgrounds, and brightness conditions.
no code implementations • 30 Oct 2020 • Lena Maier-Hein, Matthias Eisenmann, Duygu Sarikaya, Keno März, Toby Collins, Anand Malpani, Johannes Fallert, Hubertus Feussner, Stamatia Giannarou, Pietro Mascagni, Hirenkumar Nakawala, Adrian Park, Carla Pugh, Danail Stoyanov, Swaroop S. Vedula, Kevin Cleary, Gabor Fichtinger, Germain Forestier, Bernard Gibaud, Teodor Grantcharov, Makoto Hashizume, Doreen Heckmann-Nötzel, Hannes G. Kenngott, Ron Kikinis, Lars Mündermann, Nassir Navab, Sinan Onogur, Raphael Sznitman, Russell H. Taylor, Minu D. Tizabi, Martin Wagner, Gregory D. Hager, Thomas Neumuth, Nicolas Padoy, Justin Collins, Ines Gockel, Jan Goedeke, Daniel A. Hashimoto, Luc Joyeux, Kyle Lam, Daniel R. Leff, Amin Madani, Hani J. Marcus, Ozanan Meireles, Alexander Seitel, Dogu Teber, Frank Ückert, Beat P. Müller-Stich, Pierre Jannin, Stefanie Speidel
We further complement this technical perspective with (4) a review of currently available SDS products and the translational progress from academia and (5) a roadmap for faster clinical translation and exploitation of the full potential of SDS, based on an international multi-round Delphi process.
no code implementations • 11 Sep 2020 • Haomin Chen, Shun Miao, Daguang Xu, Gregory D. Hager, Adam P. Harrison
To this end, we present a deep HMLC approach for CXR CAD.
1 code implementation • 27 Aug 2020 • David Z. Li, Masaru Ishii, Russell H. Taylor, Gregory D. Hager, Ayushi Sinha
We use three different methods to manipulate these latent representations in order to predict tool presence in each frame.
no code implementations • ECCV 2020 • Haomin Chen, Yirui Wang, Kang Zheng, Weijian Li, Chi-Tung Cheng, Adam P. Harrison, Jing Xiao, Gregory D. Hager, Le Lu, Chien-Hung Liao, Shun Miao
A new contrastive feature learning component in our Siamese network is designed to optimize the deep image features being more salient corresponding to the underlying semantic asymmetries (caused by pelvic fracture occurrences).
1 code implementation • CVPR 2020 • Gordon Christie, Rodrigo Rene Rai Munoz Abujder, Kevin Foster, Shea Hagstrom, Gregory D. Hager, Myron Z. Brown
An object's geocentric pose, defined as the height above ground and orientation with respect to gravity, is a powerful representation of real-world structure for object detection, segmentation, and localization tasks using RGBD images.
no code implementations • 5 Jun 2020 • Mathias Unberath, Kimia Ghobadi, Scott Levin, Jeremiah Hinson, Gregory D. Hager
The COVID-19 crisis has brought about new clinical questions, new workflows, and accelerated distributed healthcare needs.
1 code implementation • CVPR 2020 • Helisa Dhamo, Azade Farshad, Iro Laina, Nassir Navab, Gregory D. Hager, Federico Tombari, Christian Rupprecht
In our work, we address the novel problem of image manipulation from scene graphs, in which a user can edit images by merely applying changes in the nodes or edges of a semantic graph that is generated from the image.
1 code implementation • 18 Mar 2020 • Xingtong Liu, Maia Stiber, Jindan Huang, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath
Reconstructing accurate 3D surface models of sinus anatomy directly from an endoscopic video is a promising avenue for cross-sectional and longitudinal analysis to better understand the relationship between sinus anatomy and surgical outcomes.
1 code implementation • CVPR 2020 • Xingtong Liu, Yiping Zheng, Benjamin Killeen, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath
In direct comparison to recent local and dense descriptors on an in-house sinus endoscopy dataset, we demonstrate that our proposed dense descriptor can generalize to unseen patients and scopes, thereby largely improving the performance of Structure from Motion (SfM) in terms of model density and completeness.
no code implementations • MIDL 2019 • Maya Zohar, Omri Bar, Daniel Neimark, Gregory D. Hager, Dotan Asselmann
Training a deep model to detect out-of-body and non-relevant segments in surgical videos requires suitable labeling.
no code implementations • 9 Dec 2019 • Pengfei Li, Weichao Qiu, Michael Peven, Gregory D. Hager, Alan L. Yuille
Scene context is a powerful constraint on the geometry of objects within the scene in cases, such as surveillance, where the camera geometry is unknown and image quality may be poor.
no code implementations • 8 Dec 2019 • Tae Soo Kim, Jonathan D. Jones, Michael Peven, Zihao Xiao, Jin Bai, Yi Zhang, Weichao Qiu, Alan Yuille, Gregory D. Hager
There are many realistic applications of activity recognition where the set of potential activity descriptions is combinatorially large.
no code implementations • 3 Dec 2019 • Yi Zhang, Xinyue Wei, Weichao Qiu, Zihao Xiao, Gregory D. Hager, Alan Yuille
In this paper, we propose the Randomized Simulation as Augmentation (RSA) framework which augments real-world training data with synthetic data to improve the robustness of action recognition networks.
1 code implementation • 19 Nov 2019 • Michael Peven, Gregory D. Hager, Austin Reiter
In this work, we introduce a novel representation of motion as a voxelized 3D vector field and demonstrate how it can be used to improve performance of action recognition networks.
1 code implementation • 25 Sep 2019 • Andrew Hundt, Benjamin Killeen, Nicholas Greene, Hongtao Wu, Heeyeon Kwon, Chris Paxton, Gregory D. Hager
We are able to create real stacks in 100% of trials with 61% efficiency and real rows in 100% of trials with 59% efficiency by directly loading the simulation-trained model on the real robot with no additional real-world fine-tuning.
no code implementations • 6 Sep 2019 • Xingtong Liu, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Russell H. Taylor, Mathias Unberath
We present a self-supervised learning-based pipeline for dense 3D reconstruction from full-length monocular endoscopic videos without a priori modeling of anatomy or shading.
no code implementations • 20 Jul 2019 • Robert DiPietro, Gregory D. Hager
Prior work has demonstrated the feasibility of automated activity recognition in robot-assisted surgery from motion data.
2 code implementations • 23 Mar 2019 • Andrew Hundt, Varun Jain, Gregory D. Hager
We have performed an in-depth analysis to identify limitations in a widely used search space and a recent architecture search method, Differentiable Architecture Search (DARTS).
1 code implementation • 20 Feb 2019 • Xingtong Liu, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Austin Reiter, Russell H. Taylor, Mathias Unberath
We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading.
1 code implementation • 21 Nov 2018 • Marc Bosch, Kevin Foster, Gordon Christie, Sean Wang, Gregory D. Hager, Myron Brown
The increasingly common use of incidental satellite images for stereo reconstruction versus rigidly tasked binocular or trinocular coincident collection is helping to enable timely global-scale 3D mapping; however, reliable stereo correspondence from multi-date image pairs remains very challenging due to seasonal appearance differences and scene change.
3 code implementations • 27 Oct 2018 • Andrew Hundt, Varun Jain, Chia-Hung Lin, Chris Paxton, Gregory D. Hager
We show that a mild relaxation of the task and workspace constraints implicit in existing object grasping datasets can cause neural network based grasping algorithms to fail on even a simple block stacking task when executed under more realistic circumstances.
1 code implementation • 28 Jun 2018 • Ayushi Sinha, Masaru Ishii, Russell H. Taylor, Gregory D. Hager, Austin Reiter
Several registration algorithms have been developed, many of which achieve high accuracy.
no code implementations • 8 Jun 2018 • Robert DiPietro, Gregory D. Hager
We show that it is possible to learn meaningful representations of surgical motion, without supervision, by learning to predict the future.
no code implementations • 8 Jun 2018 • Ayushi Sinha, Xingtong Liu, Austin Reiter, Masaru Ishii, Gregory D. Hager, Russell H. Taylor
Clinical examinations that involve endoscopic exploration of the nasal cavity and sinuses often do not have a reference image to provide structural context to the clinician.
1 code implementation • 30 Mar 2018 • Chris Paxton, Yotam Barnoy, Kapil Katyal, Raman Arora, Gregory D. Hager
In this work, we propose a neural network architecture and associated planning algorithm that (1) learns a representation of the world useful for generating prospective futures after the application of high-level actions, (2) uses this generative model to simulate the result of sequences of high-level actions in a variety of environments, and (3) uses this same representation to evaluate these actions and perform tree search to find a sequence of high-level actions in a new environment.
no code implementations • CVPR 2018 • Christian Rupprecht, Iro Laina, Nassir Navab, Gregory D. Hager, Federico Tombari
Interaction and collaboration between humans and intelligent machines has become increasingly important as machine learning methods move into real-world applications that involve end users.
no code implementations • ECCV 2018 • Chi Li, Jin Bai, Gregory D. Hager
To learn discriminative pose features, we integrate three new capabilities into a deep Convolutional Neural Network (CNN): an inference scheme that combines both classification and pose regression based on a uniform tessellation of the Special Euclidean group in three dimensions (SE(3)), the fusion of class priors into the training process via a tiled class map, and an additional regularization using deep supervision with an object mask.
no code implementations • 6 Mar 2018 • Kapil Katyal, Katie Popek, Chris Paxton, Joseph Moore, Kevin Wolfe, Philippe Burlina, Gregory D. Hager
In these situations, the robot's ability to reason about its future motion is often severely limited by sensor field of view (FOV).
no code implementations • 8 Jan 2018 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker
In this work, we explore an approach for injecting prior domain structure into neural network training by supervising hidden layers of a CNN with intermediate concepts that normally are not observed in practice.
no code implementations • ICLR 2018 • Sanjeev Kumar, Christian Rupprecht, Federico Tombari, Gregory D. Hager
We introduce a new approach to estimate continuous actions using actor-critic algorithms for reinforcement learning problems.
no code implementations • 8 Nov 2017 • Chris Paxton, Kapil Katyal, Christian Rupprecht, Raman Arora, Gregory D. Hager
Ideally, we would combine the ability of machine learning to leverage big data for learning the semantics of a task, while using techniques from task planning to reliably generalize to new environment.
1 code implementation • 24 Oct 2017 • Wentao Zhu, Xiang Xiang, Trac. D. Tran, Gregory D. Hager, Xiaohui Xie
Mass segmentation provides effective morphological features which are important for mass diagnosis.
2 code implementations • 11 Oct 2017 • Felix Jonathan, Chris Paxton, Gregory D. Hager
Accurate knowledge of object poses is crucial to successful robotic manipulation tasks, and yet most current approaches only work in laboratory settings.
Robotics
no code implementations • 13 Jul 2017 • Gregory D. Hager, Randal Bryant, Eric Horvitz, Maja Mataric, Vasant Honavar
Advances in Artificial Intelligence require progress across all of computer science.
no code implementations • 22 Mar 2017 • Chris Paxton, Vasumathi Raman, Gregory D. Hager, Marin Kobilarov
This paper investigates the ability of neural networks to learn both LTL constraints and control policies in order to generate task plans in complex environments.
Robotics
no code implementations • ICLR 2018 • Robert DiPietro, Christian Rupprecht, Nassir Navab, Gregory D. Hager
Recurrent neural networks (RNNs) have achieved state-of-the-art performance on many diverse tasks, from machine translation to surgical activity recognition, yet training RNNs to capture long-term dependencies remains difficult.
2 code implementations • 22 Feb 2017 • Feng Wang, Xiang Xiang, Chang Liu, Trac. D. Tran, Austin Reiter, Gregory D. Hager, Harry Quon, Jian Cheng, Alan L. Yuille
In this way, the expression intensity regression task can benefit from the rich feature representations trained on a huge amount of data for face verification.
no code implementations • CVPR 2017 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker
Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation.
no code implementations • ICCV 2017 • Christian Rupprecht, Iro Laina, Robert DiPietro, Maximilian Baust, Federico Tombari, Nassir Navab, Gregory D. Hager
In future prediction, for example, many distinct outcomes are equally valid.
5 code implementations • CVPR 2017 • Colin Lea, Michael D. Flynn, Rene Vidal, Austin Reiter, Gregory D. Hager
The ability to identify and temporally segment fine-grained human actions throughout a video is crucial for robotics, surveillance, education, and beyond.
no code implementations • 25 Oct 2016 • Seth D. Billings, Ayushi Sinha, Austin Reiter, Simon Leonard, Masaru Ishii, Gregory D. Hager, Russell H. Taylor
Functional endoscopic sinus surgery (FESS) is a surgical procedure used to treat acute cases of sinusitis and other sinus diseases.
1 code implementation • 29 Aug 2016 • Colin Lea, Rene Vidal, Austin Reiter, Gregory D. Hager
The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally, and second, input these features into a classifier that captures high-level temporal relationships, such as a Recurrent Neural Network (RNN).
Ranked #6 on Action Segmentation on JIGSAWS
no code implementations • 8 Aug 2016 • William Gray Roncal, Colin Lea, Akira Baruah, Gregory D. Hager
Our automated approach improves the local subgraph score by more than four times and the full graph score by 60 percent.
3 code implementations • 20 Jun 2016 • Robert DiPietro, Colin Lea, Anand Malpani, Narges Ahmidi, S. Swaroop Vedula, Gyusung I. Lee, Mija R. Lee, Gregory D. Hager
In contrast, we work on recognizing both gestures and longer, higher-level activites, or maneuvers, and we model the mapping from kinematics to gestures/maneuvers with recurrent neural networks.
Ranked #1 on Surgical Skills Evaluation on MISTIC-SIL
no code implementations • 9 Feb 2016 • Colin Lea, Austin Reiter, Rene Vidal, Gregory D. Hager
We propose a model for action segmentation which combines low-level spatiotemporal features with a high-level segmental classifier.
Ranked #7 on Action Segmentation on JIGSAWS
no code implementations • CVPR 2015 • Chi Li, Austin Reiter, Gregory D. Hager
In this paper, we formulate a probabilistic framework for analyzing the performance of pooling.
no code implementations • 18 Dec 2014 • Piyush Poddar, Narges Ahmidi, S. Swaroop Vedula, Lisa Ishii, Gregory D. Hager, Masaru Ishii
Previous work on surgical skill assessment using intraoperative tool motion in the operating room (OR) has focused on highly-structured surgical tasks such as cholecystectomy.
no code implementations • 25 Nov 2014 • William Gray Roncal, Dean M. Kleissas, Joshua T. Vogelstein, Priya Manavalan, Kunal Lillaney, Michael Pekala, Randal Burns, R. Jacob Vogelstein, Carey E. Priebe, Mark A. Chevillet, Gregory D. Hager
Finally, we deploy a reference end-to-end version of the pipeline on a large, publicly available data set.
1 code implementation • 7 Oct 2014 • Xiang Xiang, Minh Dao, Gregory D. Hager, Trac. D. Tran
In this paper, we design a Collaborative-Hierarchical Sparse and Low-Rank (C-HiSLR) model that is natural for recognizing human emotion in visual data.
no code implementations • 14 Mar 2014 • William Gray Roncal, Michael Pekala, Verena Kaynig-Fittkau, Dean M. Kleissas, Joshua T. Vogelstein, Hanspeter Pfister, Randal Burns, R. Jacob Vogelstein, Mark A. Chevillet, Gregory D. Hager
An open challenge problem at the forefront of modern neuroscience is to obtain a comprehensive mapping of the neural pathways that underlie human brain function; an enhanced understanding of the wiring diagram of the brain promises to lead to new breakthroughs in diagnosing and treating neurological disorders.