Search Results for author: Tanuja Ganu

Found 13 papers, 4 papers with code

Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering

no code implementations17 Feb 2024 Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth

Large Language Models (LLMs), excel in natural language understanding, but their capability for complex mathematical reasoning with an amalgamation of structured tables and unstructured text is uncertain.

Arithmetic Reasoning Mathematical Reasoning +2

Breaking Language Barriers with a LEAP: Learning Strategies for Polyglot LLMs

no code implementations28 May 2023 Akshay Nambi, Vaibhav Balloli, Mercy Ranjit, Tanuja Ganu, Kabir Ahuja, Sunayana Sitaram, Kalika Bali

Our results show substantial advancements in multilingual understanding and generation across a diverse range of languages.

Question Answering Retrieval

Retrieval Augmented Chest X-Ray Report Generation using OpenAI GPT models

no code implementations5 May 2023 Mercy Ranjit, Gopinath Ganapathy, Ranjit Manuel, Tanuja Ganu

We propose Retrieval Augmented Generation (RAG) as an approach for automated radiology report writing that leverages multimodally aligned embeddings from a contrastively pretrained vision language model for retrieval of relevant candidate radiology text for an input radiology image and a general domain generative model like OpenAI text-davinci-003, gpt-3. 5-turbo and gpt-4 for report generation using the relevant radiology text retrieved.

Instruction Following Language Modelling +1

MEGA: Multilingual Evaluation of Generative AI

1 code implementation22 Mar 2023 Kabir Ahuja, Harshita Diddee, Rishav Hada, Millicent Ochieng, Krithika Ramesh, Prachi Jain, Akshay Nambi, Tanuja Ganu, Sameer Segal, Maxamed Axmed, Kalika Bali, Sunayana Sitaram

Most studies on generative LLMs have been restricted to English and it is unclear how capable these models are at understanding and generating text in other languages.

Benchmarking

ChartParser: Automatic Chart Parsing for Print-Impaired

no code implementations16 Nov 2022 Anukriti Kumar, Tanuja Ganu, Saikat Guha

Infographics are often an integral component of scientific documents for reporting qualitative or quantitative findings as they make it much simpler to comprehend the underlying complex information.

Optical Character Recognition (OCR)

Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models

1 code implementation27 Oct 2022 Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali

Leveraging shared learning through Massively Multilingual Models, state-of-the-art machine translation models are often able to adapt to the paucity of data for low-resource languages.

Knowledge Distillation Machine Translation +1

Towards Optimizing OCR for Accessibility

no code implementations21 Jun 2022 Peya Mowar, Tanuja Ganu, Saikat Guha

Visual cues such as structure, emphasis, and icons play an important role in efficient information foraging by sighted individuals and make for a pleasurable reading experience.

Optical Character Recognition (OCR)

Document Navigability: A Need for Print-Impaired

no code implementations21 Jun 2022 Anukriti Kumar, Tanuja Ganu, Saikat Guha

Printed documents continue to be a challenge for blind, low-vision, and other print-disabled (BLV) individuals.

Broken News: Making Newspapers Accessible to Print-Impaired

no code implementations21 Jun 2022 Vishal Agarwal, Tanuja Ganu, Saikat Guha

Accessing daily news content still remains a big challenge for people with print-impairment including blind and low-vision due to opacity of printed content and hindrance from online sources.

Instance Segmentation Optical Character Recognition (OCR) +2

Predicting the Performance of Multilingual NLP Models

no code implementations17 Oct 2021 Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury

Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages.

Multilingual NLP

GCM: A Toolkit for Generating Synthetic Code-mixed Text

1 code implementation EACL 2021 Mohd Sanad Zaki Rizvi, Anirudh Srinivasan, Tanuja Ganu, Monojit Choudhury, Sunayana Sitaram

Code-mixing is common in multilingual communities around the world, and processing it is challenging due to the lack of labeled and unlabeled data.

Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning

1 code implementation31 Mar 2020 Harshad Khadilkar, Tanuja Ganu, Deva P Seetharam

In the context of the ongoing Covid-19 pandemic, several reports and studies have attempted to model and predict the spread of the disease.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.