no code implementations • EACL (LANTERN) 2021 • Sreyasi Nag Chowdhury, Rajarshi Bhowmik, Hareesh Ravi, Gerard de Melo, Simon Razniewski, Gerhard Weikum
Modern web content - news articles, blog posts, educational resources, marketing brochures - is predominantly multimodal.
no code implementations • 28 Feb 2023 • Wonwoong Cho, Hareesh Ravi, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David I. Inouye, Ajinkya Kale
Second, we propose timestep-dependent weight scheduling for content and style features to further improve the performance.
no code implementations • 23 Feb 2023 • Pranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen, Vinh Khuc, Midhun Harikumar, Ritiz Tambi, Sudharshan Reddy Kakumanu, Purvak Lapsiya, Alvin Ghouas, Sarah Saber, Malavika Ramprasad, Baldo Faieta, Ajinkya Kale
We observe that Diffusion Prior can be used in a memory and compute efficient way to constrain the generation to a specific domain without altering the larger Diffusion Decoder.
no code implementations • 15 Feb 2023 • Hareesh Ravi, Sachin Kelkar, Midhun Harikumar, Ajinkya Kale
We combine this with structure preserving edits on the image decoder using existing approaches such as reverse DDIM to perform text guided image editing.
1 code implementation • 22 Sep 2021 • Malihe Alikhani, Fangda Han, Hareesh Ravi, Mubbasir Kapadia, Vladimir Pavlovic, Matthew Stone
Common image-text joint understanding techniques presume that images and the associated text can universally be characterized by a single implicit model.
2 code implementations • ICCV 2021 • Hareesh Ravi, Kushal Kafle, Scott Cohen, Jonathan Brandt, Mubbasir Kapadia
Visual storytelling and story comprehension are uniquely human skills that play a central role in how we learn about and experience the world.
1 code implementation • 9 Oct 2020 • Honglu Zhou, Hareesh Ravi, Carlos M. Muniz, Vahid Azizi, Linda Ness, Gerard de Melo, Mubbasir Kapadia
Given its crucial role, there is a need to better understand and model the dynamics of GitHub as a social platform.
1 code implementation • CVPR 2018 • Hareesh Ravi, Lezi Wang, Carlos Muniz, Leonid Sigal, Dimitris Metaxas, Mubbasir Kapadia
We propose an end-to-end network for the visual illustration of a sequence of sentences forming a story.