2 code implementations • 24 Oct 2023 • Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski
In particular, we show that their zero-shot accuracy in answering visual questions is very sensitive to the size of the visual subject of the question, declining up to 46% with size.
no code implementations • 28 Aug 2023 • Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski
Food computing has emerged as a prominent multidisciplinary field of research in recent years.
no code implementations • 9 Jun 2023 • Prateek Chhikara, Ujjwal Pasupulety, John Marshall, Dhiraj Chaurasia, Shweta Kumari
Pre-trained Language Models (LMs) can assess users' social media data and classify them in terms of their mental health risk.
no code implementations • 31 May 2023 • Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara, Filip Ilievski
As our initial analysis of BLIP-family models revealed difficulty with answering fine-detail questions, we investigate the following question: Can visual cropping be employed to improve the performance of state-of-the-art visual question answering models on fine-detail questions?
no code implementations • 8 May 2023 • Prateek Chhikara, Jiarui Zhang, Filip Ilievski, Jonathan Francis, Kaixin Ma
We experiment with four models on the 10 tasks in the ScienceWorld text-based game environment, to illustrate the impact of knowledge injection on various model configurations and challenging task settings.
no code implementations • 17 Jan 2023 • Prateek Chhikara, Harshul Kuhar, Anil Goyal, Chirag Sharma
A virtual or digital tour is a form of virtual reality technology which allows a user to experience a specific location remotely.
no code implementations • 12 Jul 2022 • Prateek Chhikara, Anil Goyal, Chirag Sharma
Real-estate image tagging is one of the essential use-cases to save efforts involved in manual annotation and enhance the user experience.