Search Results for author: Yaman Kumar

Found 23 papers, 8 papers with code

Exploring Graph Neural Networks for Indian Legal Judgment Prediction

no code implementations19 Oct 2023 Mann Khatri, Mirza Yusuf, Yaman Kumar, Rajiv Ratn Shah, Ponnurangam Kumaraguru

We explored various embeddings as model features, while nodes such as time nodes and judicial acts were added and pruned to evaluate the model's performance.

Fairness Link Prediction +1

Get It Scored Using AutoSAS -- An Automated System for Scoring Short Answers

no code implementations21 Dec 2020 Yaman Kumar, Swati Aggarwal, Debanjan Mahata, Rajiv Ratn Shah, Ponnurangam Kumaraguru, Roger Zimmermann

In this paper, we present a fast, scalable, and accurate approach towards automated Short Answer Scoring (SAS).

LIFI: Towards Linguistically Informed Frame Interpolation

1 code implementation30 Oct 2020 Aradhya Neeraj Mathur, Devansh Batra, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann

We also release several datasets to test computer vision video generation models of their speech understanding.

Video Generation

"Notic My Speech" -- Blending Speech Patterns With Multimedia

no code implementations12 Jun 2020 Dhruva Sahrawat, Yaman Kumar, Shashwat Aggarwal, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

To close the gap between speech understanding and multimedia video applications, in this paper, we show the initial experiments by modelling the perception on visual speech and showing its use case on video compression.

speech-recognition Video Compression +1

audino: A Modern Annotation Tool for Audio and Speech

1 code implementation9 Jun 2020 Manraj Singh Grover, Pakhi Bamdev, Ratin Kumar Brala, Yaman Kumar, Mika Hama, Rajiv Ratn Shah

The tool allows audio data and their corresponding annotations to be uploaded and assigned to a user through a key-based API.

Action Detection Activity Detection +4

Multi-modal Automated Speech Scoring using Attention Fusion

no code implementations17 May 2020 Manraj Singh Grover, Yaman Kumar, Sumit Sarin, Payman Vafaee, Mika Hama, Rajiv Ratn Shah

In this study, we propose a novel multi-modal end-to-end neural approach for automated assessment of non-native English speakers' spontaneous speech using attention fusion.

An Annotated Dataset of Discourse Modes in Hindi Stories

no code implementations LREC 2020 Swapnil Dhanwal, Hritwik Dutta, Hitesh Nankani, Nilay Shrivastava, Yaman Kumar, Junyi Jessy Li, Debanjan Mahata, Rakesh Gosangi, Haimin Zhang, Rajiv Ratn Shah, Am Stent, a

In this paper, we present a new corpus consisting of sentences from Hindi short stories annotated for five different discourse modes argumentative, narrative, descriptive, dialogic and informative.

Descriptive Sentence

Touchless Typing using Head Movement-based Gestures

1 code implementation24 Jan 2020 Shivam Rustagi, Aakash Garg, Pranay Raj Anand, Rajesh Kumar, Yaman Kumar, Rajiv Ratn Shah

The modified GRU-based model outperforms the standard CNN-RNN and Conv3D models for three of the four scenarios.

Human-Computer Interaction I.2.7

Keyphrase Extraction from Scholarly Articles as Sequence Labeling using Contextualized Embeddings

no code implementations19 Oct 2019 Dhruva Sahrawat, Debanjan Mahata, Mayank Kulkarni, Haimin Zhang, Rakesh Gosangi, Amanda Stent, Agniv Sharma, Yaman Kumar, Rajiv Ratn Shah, Roger Zimmermann

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings.

Keyphrase Extraction Word Embeddings

BHAAV- A Text Corpus for Emotion Analysis from Hindi Stories

1 code implementation9 Oct 2019 Yaman Kumar, Debanjan Mahata, Sagar Aggarwal, Anmol Chugh, Rajat Maheshwari, Rajiv Ratn Shah

In this paper, we introduce the first and largest Hindi text corpus, named BHAAV, which means emotions in Hindi, for analyzing emotions that a writer expresses through his characters in a story, as perceived by a narrator/reader.

Emotion Recognition Sentence

MobiVSR: A Visual Speech Recognition Solution for Mobile Devices

no code implementations10 May 2019 Nilay Shrivastava, Astitwa Saxena, Yaman Kumar, Rajiv Ratn Shah, Debanjan Mahata, Amanda Stent

Visual speech recognition (VSR) is the task of recognizing spoken language from video input only, without any audio.

Lip Reading Quantization +2

Suggestion Mining from Online Reviews using ULMFiT

1 code implementation19 Apr 2019 Sarthak Anand, Debanjan Mahata, Kartik Aggarwal, Laiba Mehnaz, Simra Shahid, Haimin Zhang, Yaman Kumar, Rajiv Ratn Shah, Karan Uppal

In this paper we present our approach and the system description for Sub Task A of SemEval 2019 Task 9: Suggestion Mining from Online Reviews and Forums.

General Classification Language Modelling +4

Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition

1 code implementation29 Jan 2019 Yaman Kumar, Dhruva Sahrawat, Shubham Maheshwari, Debanjan Mahata, Amanda Stent, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

To solve this problem, we present a novel approach to zero-shot learning by generating new classes using Generative Adversarial Networks (GANs), and show how the addition of unseen class samples increases the accuracy of a VSR system by a significant margin of 27% and allows it to handle speaker-independent out-of-vocabulary phrases.

speech-recognition Visual Speech Recognition +1

Kiki Kills: Identifying Dangerous Challenge Videos from Social Media

no code implementations2 Dec 2018 Nupur Baghel, Yaman Kumar, Paavini Nanda, Rajiv Ratn Shah, Debanjan Mahata, Roger Zimmermann

There has been upsurge in the number of people participating in challenges made popular through social media channels.

Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed

no code implementations2 Jul 2018 Yaman Kumar, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah, Roger Zimmerman

Recently, research has started venturing into generating (audio) speech from silent video sequences but there have been no developments thus far in dealing with divergent views and poses of a speaker.

Sound Audio and Speech Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.