no code implementations • ICON 2021 • Loitongbam Sanayai Meetei, Laishram Rahul, Alok Singh, Salam Michael Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
Based on this dataset, a benchmark evaluation is reported for the Manipuri-English Speech-to-Text translation using two approaches: 1) a pipeline model consisting of ASR (Automatic Speech Recognition) and Machine translation, and 2) an end-to-end Speech-to-Text translation.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • ICON 2021 • Alok Singh, Loitongbam Sanayai Meetei, Salam Michael Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
Describing a video is a challenging yet attractive task since it falls into the intersection of computer vision and natural language generation.
Ranked #1 on Video Captioning on Hindi MSR-VTT
no code implementations • ICON 2021 • Salam Michael Singh, Loitongbam Sanayai Meetei, Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
In recent times, machine translation models can learn to perform implicit bridging between language pairs never seen explicitly during training and showing that transfer learning helps for languages with constrained resources.
no code implementations • 2 Sep 2024 • Priyanka Chudasama, Anil Surisetty, Aakarsh Malhotra, Alok Singh
Classification tasks present challenges due to class imbalances and evolving data distributions.
no code implementations • 18 May 2022 • Deepak Chaurasiya, Anil Surisetty, Nitish Kumar, Alok Singh, Vikrant Dey, Aakarsh Malhotra, Gaurav Dhama, Ankur Arora
We further create an open-source repository for $14$ embedding-based EA methods and present the analysis for invoking further research motivations in the field of EA.
1 code implementation • 15 Jul 2021 • Joseph Palermo, Johnny Ye, Alok Singh
We convert the DeepMind Mathematics Dataset into a reinforcement learning environment by interpreting it as a program synthesis problem.
no code implementations • Multimedia Tools and Applications 2021 • Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
The model is tested over Hindi visual genome dataset to validate the proposed approach’s performance and cross-verification is carried out for English captions with Flickr dataset.
no code implementations • journal 2021 • Alok Singh, · Thoudam Doren Singh, Sivaji Bandyopadhyay
In recent times, active research is going on for bridging the gap between computer vision and natural language.
Ranked #2 on Video Captioning on Hindi MSR-VTT
no code implementations • 30 Nov 2020 • Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
In this work, we report a comprehensive survey on the phases of video description approaches, the dataset for video description, evaluation metrics, open competitions for motivating the research on the video description, open challenges in this field, and future research directions.
no code implementations • 7 Jun 2020 • Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
Video captioning is process of summarising the content, event and action of the video into a short textual form which can be helpful in many research areas such as video guided machine translation, video sentiment analysis and providing aid to needy individual.
Ranked #10 on Video Captioning on VATEX
1 code implementation • Signal, Image and Video Processing volume 2019 • Alok Singh, Dalton Meitei Thounaojam & Saptarshi Chakraborty
Many researches have been done on shot boundary detection, but the performance of shot boundary detection approaches is yet to be addressed for the videos having sudden illumination and object/camera motion effects efficiently.
1 code implementation • 30 Jun 2019 • Jason Mancuso, Tomasz Kisielewski, David Lindner, Alok Singh
We show that if the reward corruption in a CRMDP is sufficiently "spiky", the environment is solvable.
no code implementations • 17 Apr 2018 • Alok Singh, Eric Stephan, Malachi Schram, Ilkay Altintas
In this vision paper, we outline our approach to leveraging Deep Learning algorithms to discover solutions to unique problems that arise in a system with computational infrastructure that is spread over a wide area.
no code implementations • 15 Nov 2017 • Alok Singh, Mai Nguyen, Shweta Purawat, Daniel Crawl, Ilkay Altintas
The central idea of this work is to train resource-centric machine learning agents to capture complex relationships between a set of program instructions and their performance metrics when executed on a specific resource.