Search Results for author: Prithwijit Guha

Found 8 papers, 2 papers with code

VQA with Cascade of Self- and Co-Attention Blocks

no code implementations • 28 Feb 2023 • Aakansha Mishra, Ashish Anand, Prithwijit Guha

The use of complex attention modules has improved the performance of the Visual Question Answering (VQA) task.

Question Answering Visual Question Answering

Paper
Add Code

Facial Keypoint Sequence Generation from Audio

no code implementations • 2 Nov 2020 • Prateek Manocha, Prithwijit Guha

To the best of our knowledge, this is the first work that proposes an audio-keypoint dataset and learns a model to output the plausible keypoint sequence to go with audio of any arbitrary length.

Paper
Add Code

IQ-VQA: Intelligent Visual Question Answering

1 code implementation • 8 Jul 2020 • Vatsal Goel, Mohit Chandak, Ashish Anand, Prithwijit Guha

As a part of the cyclic framework, we propose a novel implication generator which can generate implied questions from any question-answer pair.

Question Answering Visual Question Answering

Paper
Code

CQ-VQA: Visual Question Answering on Categorized Questions

no code implementations • 17 Feb 2020 • Aakansha Mishra, Ashish Anand, Prithwijit Guha

The second level, referred to as answer predictor (AP), comprises of a set of distinct classifiers corresponding to each question category.

Question Answering Visual Question Answering

Paper
Add Code

Reinforcement Learning via Recurrent Convolutional Neural Networks

1 code implementation • 9 Jan 2017 • Tanmay Shankar, Santosha K. Dwivedy, Prithwijit Guha

Deep Reinforcement Learning has enabled the learning of policies for complex tasks in partially observable environments, without explicitly learning the underlying model of the tasks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Overlay Text Extraction From TV News Broadcast

no code implementations • 2 Apr 2016 • Raghvendra Kannao, Prithwijit Guha

In this paper, we present a contrast enhancement based preprocessing stage for overlay text detection and a parameter free edge density based scheme for efficient text band detection.

Optical Character Recognition (OCR) Text Detection

Paper
Add Code

TV News Commercials Detection using Success based Locally Weighted Kernel Combination

no code implementations • 5 Jul 2015 • Raghvendra Kannao, Prithwijit Guha

We adopt a intermediate fusion approach where, a SVM is trained with a weighted linear combination of different kernel functions instead of single kernel function.

Paper
Add Code

An Occlusion Reasoning Scheme for Monocular Pedestrian Tracking in Dynamic Scenes

no code implementations • 25 Jan 2015 • Sourav Garg, Swagat Kumar, Rajesh Ratnakaram, Prithwijit Guha

This paper looks into the problem of pedestrian tracking using a monocular, potentially moving, uncalibrated camera.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.