Search Results for author: Rohan Wadhawan

Found 4 papers, 0 papers with code

ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models

no code implementations • 24 Jan 2024 • Rohan Wadhawan, Hritik Bansal, Kai-Wei Chang, Nanyun Peng

Our findings reveal a significant performance gap of 30. 8% between the best-performing LMM, GPT-4V(ision), and human capabilities using human evaluation indicating substantial room for improvement in context-sensitive text-rich visual reasoning.

Visual Reasoning

Paper
Add Code

Multi-Attributed and Structured Text-to-Face Synthesis

no code implementations • 25 Aug 2021 • Rohan Wadhawan, Tanuj Drall, Shubham Singh, Shampa Chakraverty

Generative Adversarial Networks (GANs) have revolutionized image synthesis through many applications like face generation, photograph editing, and image super-resolution.

Descriptive Face Generation +3

Paper
Add Code

Landmark-Aware and Part-based Ensemble Transfer Learning Network for Facial Expression Recognition from Static images

no code implementations • 22 Apr 2021 • Rohan Wadhawan, Tapan K. Gandhi

Facial Expression Recognition from static images is a challenging problem in computer vision applications.

Computational Efficiency Ensemble Learning +4

Paper
Add Code

Intelligent Monitoring of Stress Induced by Water Deficiency in Plants using Deep Learning

no code implementations • 16 Apr 2021 • Shiva Azimi, Rohan Wadhawan, Tapan K. Gandhi

Our model has achieved ceiling level classification performance of 98. 52% on JG-62 and 97. 78% on Pusa-372 chickpea plant data and has outperformed the best reported time-invariant technique by at least 14% for both JG-62 and Pusa-372 species, to the best of our knowledge.

Plant Phenotyping

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.