no code implementations • 17 Feb 2024 • Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth
Large Language Models (LLMs), excel in natural language understanding, but their capability for complex mathematical reasoning with an amalgamation of structured tables and unstructured text is uncertain.
no code implementations • 28 May 2023 • Akshay Nambi, Vaibhav Balloli, Mercy Ranjit, Tanuja Ganu, Kabir Ahuja, Sunayana Sitaram, Kalika Bali
Our results show substantial advancements in multilingual understanding and generation across a diverse range of languages.
no code implementations • 5 May 2023 • Mercy Ranjit, Gopinath Ganapathy, Ranjit Manuel, Tanuja Ganu
We propose Retrieval Augmented Generation (RAG) as an approach for automated radiology report writing that leverages multimodally aligned embeddings from a contrastively pretrained vision language model for retrieval of relevant candidate radiology text for an input radiology image and a general domain generative model like OpenAI text-davinci-003, gpt-3. 5-turbo and gpt-4 for report generation using the relevant radiology text retrieved.
1 code implementation • 22 Mar 2023 • Kabir Ahuja, Harshita Diddee, Rishav Hada, Millicent Ochieng, Krithika Ramesh, Prachi Jain, Akshay Nambi, Tanuja Ganu, Sameer Segal, Maxamed Axmed, Kalika Bali, Sunayana Sitaram
Most studies on generative LLMs have been restricted to English and it is unclear how capable these models are at understanding and generating text in other languages.
no code implementations • 16 Nov 2022 • Anukriti Kumar, Tanuja Ganu, Saikat Guha
Infographics are often an integral component of scientific documents for reporting qualitative or quantitative findings as they make it much simpler to comprehend the underlying complex information.
no code implementations • 31 Oct 2022 • Pragya Srivastava, Tanuja Ganu, Saikat Guha
We present very early results on using GPT-3 to perform question answering on tabular data.
1 code implementation • 27 Oct 2022 • Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali
Leveraging shared learning through Massively Multilingual Models, state-of-the-art machine translation models are often able to adapt to the paucity of data for low-resource languages.
no code implementations • 21 Jun 2022 • Peya Mowar, Tanuja Ganu, Saikat Guha
Visual cues such as structure, emphasis, and icons play an important role in efficient information foraging by sighted individuals and make for a pleasurable reading experience.
no code implementations • 21 Jun 2022 • Anukriti Kumar, Tanuja Ganu, Saikat Guha
Printed documents continue to be a challenge for blind, low-vision, and other print-disabled (BLV) individuals.
no code implementations • 21 Jun 2022 • Vishal Agarwal, Tanuja Ganu, Saikat Guha
Accessing daily news content still remains a big challenge for people with print-impairment including blind and low-vision due to opacity of printed content and hindrance from online sources.
Instance Segmentation Optical Character Recognition (OCR) +2
no code implementations • 17 Oct 2021 • Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury
Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages.
1 code implementation • EACL 2021 • Mohd Sanad Zaki Rizvi, Anirudh Srinivasan, Tanuja Ganu, Monojit Choudhury, Sunayana Sitaram
Code-mixing is common in multilingual communities around the world, and processing it is challenging due to the lack of labeled and unlabeled data.
1 code implementation • 31 Mar 2020 • Harshad Khadilkar, Tanuja Ganu, Deva P Seetharam
In the context of the ongoing Covid-19 pandemic, several reports and studies have attempted to model and predict the spread of the disease.