no code implementations • 10 Sep 2021 • Shashank Bujimalla, Mahesh Subedar, Omesh Tickoo
PS-NOC is agnostic to model architecture, and primarily focuses on the training approach that uses existing fully paired image-caption data and the images with only the novel object detection labels (partially paired data).
no code implementations • 10 Jun 2021 • Shashank Bujimalla, Mahesh Subedar, Omesh Tickoo
In this paper, we study the impact of motion blur, a common quality flaw in real world images, on a state-of-the-art two-stage image captioning solution, and notice a degradation in solution performance as blur intensity increases.
no code implementations • 6 Apr 2020 • Shashank Bujimalla, Mahesh Subedar, Omesh Tickoo
The "baseline" for the policy-gradients in B-SCST is generated by averaging predictive quality metrics (CIDEr-D) of the captions drawn from the distribution obtained using a Bayesian DNN model.