no code implementations • 21 Apr 2024 • Abhishek Jha, Yogesh Rawat, Shruti Vyas
We propose PV-S3 (Photovoltaic-Semi Supervised Segmentation), a Semi-Supervised Learning approach for semantic segmentation of defects in EL images that reduces reliance on extensive labeling.
1 code implementation • 22 Feb 2024 • Abhishek Jha, Matthew B. Blaschko, Yuki M. Asano, Tinne Tuytelaars
Last couple of years have witnessed a tremendous progress in self-supervised learning (SSL), the success of which can be attributed to the introduction of useful inductive biases in the learning process to learn meaningful visual representations while avoiding collapse.
no code implementations • 12 Feb 2024 • David C. Oluigboa, Bikash Santra, Tejas Sudharshan Mathai, Pritam Mukherjee, Jianfei Liu, Abhishek Jha, Mayank Patel, Karel Pacak, Ronald M. Summers
Pheochromocytomas and Paragangliomas (PPGLs) are rare adrenal and extra-adrenal tumors which have the potential to metastasize.
1 code implementation • 7 Mar 2022 • Abhishek Jha, Badri N. Patro, Luc van Gool, Tinne Tuytelaars
In this paper, we propose a novel regularization for VQA models, Constrained Optimization using Barlow's theory (COB), that improves the information content of the joint space by minimizing the redundancy.
no code implementations • 27 Nov 2021 • Vaishnavi Shrivastava, Radhika Gaonkar, Shashank Gupta, Abhishek Jha
Fine-tuning pre-trained language models improves the quality of commercial reply suggestion systems, but at the cost of unsustainable training times.
1 code implementation • ICCV 2021 • Soroush Seifi, Abhishek Jha, Tinne Tuytelaars
In this paper, we propose the Glimpse-Attend-and-Explore model which: (a) employs self-attention to guide the visual exploration instead of task-specific uncertainty maps; (b) can be used for both dense and sparse prediction tasks; and (c) uses a contrastive stream to further improve the representations learned.
1 code implementation • ACM Multimedia, 2019 2019 • Prajwal K R, Rudrabha Mukhopadhyay, Jerin Philip, Abhishek Jha, Vinay Namboodiri, C. V. Jawahar
As today's digital communication becomes increasingly visual, we argue that there is a need for systems that can automatically translate a video of a person speaking in language A into a target language B with realistic lip synchronization.
Ranked #1 on Talking Face Generation on LRW (using extra training data)
no code implementations • WS 2018 • Revanth Rameshkumar, Peter Bailey, Abhishek Jha, Chris Quirk
We describe the Enron People Assignment (EPA) dataset, in which tasks that are described in emails are associated with the person(s) responsible for carrying out these tasks.
no code implementations • 16 Jun 2018 • Sauradip Nag, Pallab Kumar Ganguly, Sumit Roy, Sourab Jha, Krishna Bose, Abhishek Jha, Kousik Dasgupta
Regional language extraction from a natural scene image is always a challenging proposition due to its dependence on the text information extracted from Image.