Search Results for author: Prateek Chhikara

Found 7 papers, 1 papers with code

Towards Perceiving Small Visual Details in Zero-shot Visual Question Answering with Multimodal LLMs

2 code implementations • 24 Oct 2023 • Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

In particular, we show that their zero-shot accuracy in answering visual questions is very sensitive to the size of the visual subject of the question, declining up to 46% with size.

Question Answering Visual Question Answering

Paper
Code

FIRE: Food Image to REcipe generation

no code implementations • 28 Aug 2023 • Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski

Food computing has emerged as a prominent multidisciplinary field of research in recent years.

Language Modelling Large Language Model +2

Paper
Add Code

Privacy Aware Question-Answering System for Online Mental Health Risk Assessment

no code implementations • 9 Jun 2023 • Prateek Chhikara, Ujjwal Pasupulety, John Marshall, Dhiraj Chaurasia, Shweta Kumari

Pre-trained Language Models (LMs) can assess users' social media data and classify them in terms of their mental health risk.

Question Answering

Paper
Add Code

Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models

no code implementations • 31 May 2023 • Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski

As our initial analysis of BLIP-family models revealed difficulty with answering fine-detail questions, we investigate the following question: Can visual cropping be employed to improve the performance of state-of-the-art visual question answering models on fine-detail questions?

Question Answering Visual Question Answering

Paper
Add Code

Knowledge-enhanced Agents for Interactive Text Games

no code implementations • 8 May 2023 • Prateek Chhikara, Jiarui Zhang, Filip Ilievski, Jonathan Francis, Kaixin Ma

We experiment with four models on the 10 tasks in the ScienceWorld text-based game environment, to illustrate the impact of knowledge injection on various model configurations and challenging task settings.

Instruction Following Knowledge Graphs +5

Paper
Add Code

DIGITOUR: Automatic Digital Tours for Real-Estate Properties

no code implementations • 17 Jan 2023 • Prateek Chhikara, Harshul Kuhar, Anil Goyal, Chirag Sharma

A virtual or digital tour is a form of virtual reality technology which allows a user to experience a specific location remotely.

TAG

Paper
Add Code

RE-Tagger: A light-weight Real-Estate Image Classifier

no code implementations • 12 Jul 2022 • Prateek Chhikara, Anil Goyal, Chirag Sharma

Real-estate image tagging is one of the essential use-cases to save efforts involved in manual annotation and enhance the user experience.

Image Classification Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.