no code implementations • 24 Jan 2024 • Rohan Wadhawan, Hritik Bansal, Kai-Wei Chang, Nanyun Peng
Our findings reveal a significant performance gap of 30. 8% between the best-performing LMM, GPT-4V(ision), and human capabilities using human evaluation indicating substantial room for improvement in context-sensitive text-rich visual reasoning.
no code implementations • 25 Aug 2021 • Rohan Wadhawan, Tanuj Drall, Shubham Singh, Shampa Chakraverty
Generative Adversarial Networks (GANs) have revolutionized image synthesis through many applications like face generation, photograph editing, and image super-resolution.
no code implementations • 22 Apr 2021 • Rohan Wadhawan, Tapan K. Gandhi
Facial Expression Recognition from static images is a challenging problem in computer vision applications.
no code implementations • 16 Apr 2021 • Shiva Azimi, Rohan Wadhawan, Tapan K. Gandhi
Our model has achieved ceiling level classification performance of 98. 52% on JG-62 and 97. 78% on Pusa-372 chickpea plant data and has outperformed the best reported time-invariant technique by at least 14% for both JG-62 and Pusa-372 species, to the best of our knowledge.