Search Results for author: Lalithkumar Seenivasan

Found 9 papers, 9 papers with code

Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery

1 code implementation • 19 May 2023 • Long Bai, Mobarakol Islam, Lalithkumar Seenivasan, Hongliang Ren

In this paper, we propose Visual Question Localized-Answering in Robotic Surgery (Surgical-VQLA) to localize the specific surgical area during the answer prediction.

Answer Generation object-detection +3

Paper
Code

SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery

1 code implementation • 19 Apr 2023 • Lalithkumar Seenivasan, Mobarakol Islam, Gokul Kannan, Hongliang Ren

Given the limitations of unidirectional attention in GPT models and their ability to generate coherent long paragraphs, we carefully sequence the word tokens before vision tokens, mimicking the human thought process of understanding the question to infer an answer from an image.

Question Answering Scene Segmentation +1

Paper
Code

Paced-Curriculum Distillation with Prediction and Label Uncertainty for Image Segmentation

1 code implementation • 2 Feb 2023 • Mobarakol Islam, Lalithkumar Seenivasan, S. P. Sharan, V. K. Viekash, Bhavesh Gupta, Ben Glocker, Hongliang Ren

Purpose: In curriculum learning, the idea is to train on easier samples first and gradually increase the difficulty, while in self-paced learning, a pacing function defines the speed to adapt the training progress.

Image Segmentation Medical Image Segmentation +3

Paper
Code

Task-Aware Asynchronous Multi-Task Model with Class Incremental Contrastive Learning for Surgical Scene Understanding

1 code implementation • 28 Nov 2022 • Lalithkumar Seenivasan, Mobarakol Islam, Mengya Xu, Chwee Ming Lim, Hongliang Ren

Conclusion: The proposed multi-task model was able to adapt to domain shifts, incorporate novel instruments in the target domain, and perform tool-tissue interaction detection and report generation on par with single-task models.

Contrastive Learning Decision Making +4

Paper
Code

Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer

2 code implementations • 22 Jun 2022 • Lalithkumar Seenivasan, Mobarakol Islam, Adithya K Krishna, Hongliang Ren

This overload often limits their time answering questionnaires from patients, medical students or junior residents related to surgical procedures.

Question Answering Sentence +1

Paper
Code

CholecTriplet2021: A benchmark challenge for surgical action triplet recognition

6 code implementations • 10 Apr 2022 • Chinedu Innocent Nwoye, Deepak Alapatt, Tong Yu, Armine Vardazaryan, Fangfang Xia, Zixuan Zhao, Tong Xia, Fucang Jia, Yuxuan Yang, Hao Wang, Derong Yu, Guoyan Zheng, Xiaotian Duan, Neil Getty, Ricardo Sanchez-Matilla, Maria Robu, Li Zhang, Huabin Chen, Jiacheng Wang, Liansheng Wang, Bokai Zhang, Beerend Gerats, Sista Raviteja, Rachana Sathish, Rong Tao, Satoshi Kondo, Winnie Pang, Hongliang Ren, Julian Ronald Abbing, Mohammad Hasan Sarhan, Sebastian Bodenstedt, Nithya Bhasker, Bruno Oliveira, Helena R. Torres, Li Ling, Finn Gaida, Tobias Czempiel, João L. Vilaça, Pedro Morais, Jaime Fonseca, Ruby Mae Egging, Inge Nicole Wijma, Chen Qian, GuiBin Bian, Zhen Li, Velmurugan Balasubramanian, Debdoot Sheet, Imanol Luengo, Yuanbo Zhu, Shuai Ding, Jakob-Anton Aschenbrenner, Nicolas Elini van der Kar, Mengya Xu, Mobarakol Islam, Lalithkumar Seenivasan, Alexander Jenke, Danail Stoyanov, Didier Mutter, Pietro Mascagni, Barbara Seeliger, Cristians Gonzalez, Nicolas Padoy

In this paper, we present the challenge setup and assessment of the state-of-the-art deep learning methods proposed by the participants during the challenge.

Ranked #1 on Action Triplet Recognition on CholecT50 (Challenge) (using extra training data)

Action Detection Action Triplet Recognition +1

Paper
Code

Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding

2 code implementations • 28 Jan 2022 • Lalithkumar Seenivasan, Sai Mitheran, Mobarakol Islam, Hongliang Ren

Global and local relational reasoning enable scene understanding models to perform human-like scene analysis and understanding.

Graph Attention Knowledge Distillation +5

Paper
Code

Class-Distribution-Aware Calibration for Long-Tailed Visual Recognition

1 code implementation • 11 Sep 2021 • Mobarakol Islam, Lalithkumar Seenivasan, Hongliang Ren, Ben Glocker

In CDA-TS, the scalar temperature value is replaced with the CDA temperature vector encoded with class frequency to compensate for the over-confidence.

Paper
Code

Learning and Reasoning with the Graph Structure Representation in Robotic Surgery

2 code implementations • 7 Jul 2020 • Mobarakol Islam, Lalithkumar Seenivasan, Lim Chwee Ming, Hongliang Ren

Learning to infer graph representations and performing spatial reasoning in a complex surgical environment can play a vital role in surgical scene understanding in robotic surgery.

Edge Classification Graph Generation +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.