1 code implementation • 6 Oct 2022 • Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay Namboodiri, C V Jawahar
The identity-aware generator takes the source image and the warped motion features as input to generate a high-quality output with fine-grained details.
1 code implementation • 24 Jun 2021 • Parul Kapoor, Rudrabha Mukhopadhyay, Sindhu B Hegde, Vinay Namboodiri, C V Jawahar
Since the current datasets are inadequate for generating sign language directly from speech, we collect and release the first Indian sign language dataset comprising speech-level annotations, text transcripts, and the corresponding sign-language videos.
1 code implementation • 7 Oct 2022 • Madhav Agarwal, Anchit Gupta, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C V Jawahar
We use a state-of-the-art face reenactment network to detect key points in the non-pivot frames and transmit them to the receiver.
1 code implementation • Findings (ACL) 2021 • Zeeshan Khan, Kartheek Akella, Vinay P. Namboodiri, C V Jawahar
We propose a novel adaptation strategy, where we iteratively prune and retrain the redundant parameters of an MNMT to improve bilingual representations while retaining the multilinguality.
1 code implementation • 4 May 2021 • Thrupthi Ann John, Vineeth N Balasubramanian, C V Jawahar
As Deep Neural Network models for face processing tasks approach human-like performance, their deployment in critical applications such as law enforcement and access control has seen an upswing, where any failure may have far-reaching consequences.
no code implementations • ICON 2020 • Kartheek Akella, Sai Himal Allu, Sridhar Suresh Ragupathi, Aman Singhal, Zeeshan Khan, Vinay P. Namboodiri, C V Jawahar
In this paper, we address the task of improving pair-wise machine translation for specific low resource Indian languages.
no code implementations • 2 Nov 2021 • Bipasha Sen, Aditya Agarwal, Rudrabha Mukhopadhyay, Vinay Namboodiri, C V Jawahar
Apart from evaluating our approach on the ALS patient, we also extend it to people with hearing impairment relying extensively on lip movements to communicate.
no code implementations • ICON 2020 • Binu Jasim, Vinay Namboodiri, C V Jawahar
Back-translation aug- ments parallel data by translating monolingual sentences in the target side to source language.
no code implementations • 13 Nov 2021 • Sachin Raja, Ajoy Mondal, C V Jawahar
Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of empty cells.
no code implementations • 12 Mar 2024 • Harsh Lunia, Ajoy Mondal, C V Jawahar
Several benchmark datasets and substantial work on deep learning models are available for Latin languages to meet this need.