Search Results for author: Prithwijit Guha

Found 8 papers, 2 papers with code

VQA with Cascade of Self- and Co-Attention Blocks

no code implementations28 Feb 2023 Aakansha Mishra, Ashish Anand, Prithwijit Guha

The use of complex attention modules has improved the performance of the Visual Question Answering (VQA) task.

Question Answering Visual Question Answering

Facial Keypoint Sequence Generation from Audio

no code implementations2 Nov 2020 Prateek Manocha, Prithwijit Guha

To the best of our knowledge, this is the first work that proposes an audio-keypoint dataset and learns a model to output the plausible keypoint sequence to go with audio of any arbitrary length.

IQ-VQA: Intelligent Visual Question Answering

1 code implementation8 Jul 2020 Vatsal Goel, Mohit Chandak, Ashish Anand, Prithwijit Guha

As a part of the cyclic framework, we propose a novel implication generator which can generate implied questions from any question-answer pair.

Question Answering Visual Question Answering

CQ-VQA: Visual Question Answering on Categorized Questions

no code implementations17 Feb 2020 Aakansha Mishra, Ashish Anand, Prithwijit Guha

The second level, referred to as answer predictor (AP), comprises of a set of distinct classifiers corresponding to each question category.

Question Answering Visual Question Answering

Reinforcement Learning via Recurrent Convolutional Neural Networks

1 code implementation9 Jan 2017 Tanmay Shankar, Santosha K. Dwivedy, Prithwijit Guha

Deep Reinforcement Learning has enabled the learning of policies for complex tasks in partially observable environments, without explicitly learning the underlying model of the tasks.

reinforcement-learning Reinforcement Learning (RL)

Overlay Text Extraction From TV News Broadcast

no code implementations2 Apr 2016 Raghvendra Kannao, Prithwijit Guha

In this paper, we present a contrast enhancement based preprocessing stage for overlay text detection and a parameter free edge density based scheme for efficient text band detection.

Optical Character Recognition (OCR) Text Detection

TV News Commercials Detection using Success based Locally Weighted Kernel Combination

no code implementations5 Jul 2015 Raghvendra Kannao, Prithwijit Guha

We adopt a intermediate fusion approach where, a SVM is trained with a weighted linear combination of different kernel functions instead of single kernel function.

An Occlusion Reasoning Scheme for Monocular Pedestrian Tracking in Dynamic Scenes

no code implementations25 Jan 2015 Sourav Garg, Swagat Kumar, Rajesh Ratnakaram, Prithwijit Guha

This paper looks into the problem of pedestrian tracking using a monocular, potentially moving, uncalibrated camera.

Cannot find the paper you are looking for? You can Submit a new open access paper.