no code implementations • 23 Feb 2023 • Pranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen, Vinh Khuc, Midhun Harikumar, Ritiz Tambi, Sudharshan Reddy Kakumanu, Purvak Lapsiya, Alvin Ghouas, Sarah Saber, Malavika Ramprasad, Baldo Faieta, Ajinkya Kale
We observe that Diffusion Prior can be used in a memory and compute efficient way to constrain the generation to a specific domain without altering the larger Diffusion Decoder.
no code implementations • 10 Mar 2022 • Dan Ruta, Andrew Gilbert, Pranav Aggarwal, Naveen Marri, Ajinkya Kale, Jo Briggs, Chris Speed, Hailin Jin, Baldo Faieta, Alex Filipkowski, Zhe Lin, John Collomosse
We present StyleBabel, a unique open access dataset of natural language captions and free-form tags describing the artistic style of over 135K digital artworks, collected via a novel participatory method from experts studying at specialist art and design schools.
2 code implementations • 15 Sep 2021 • Pranav Aggarwal, Ritiz Tambi, Ajinkya Kale
There has been a recent spike in interest in multi-modal Language and Vision problems.
1 code implementation • 24 Nov 2020 • Pranav Aggarwal, Ajinkya Kale
There has been a recent spike in interest in multi-modal Language and Vision problems.
no code implementations • 4 Oct 2020 • Aashish Kumar Misraa, Ajinkya Kale, Pranav Aggarwal, Ali Aminian
Most real world applications of image retrieval such as Adobe Stock, which is a marketplace for stock photography and illustrations, need a way for users to find images which are both visually (i. e. aesthetically) and conceptually (i. e. containing the same salient objects) as a query image.
no code implementations • 30 May 2019 • Pranav Aggarwal, Zhe Lin, Baldo Faieta, Saeid Motiian
In this paper, we propose a new method for learning text-visual embedding using both image titles and click-through data from an image search engine.
no code implementations • 4 Dec 2017 • Yueru Chen, Pranav Aggarwal, Jongmoo Choi, C. -C. Jay Kuo
A drone monitoring system that integrates deep-learning-based detection and tracking modules is proposed in this work.