Search Results for author: Suraj Kothawade

Found 31 papers, 9 papers with code

Imagen 3

2 code implementations13 Aug 2024 Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Lluis Castrejon, Kelvin Chan, YiChang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Siavash Khodadadeh, Yelin Kim, Ksenia Konyushkova, Karol Langner, Eric Lau, Rory Lawton, Shixin Luo, Soňa Mokrá, Henna Nandwani, Yasumasa Onoe, Aäron van den Oord, Zarana Parekh, Jordi Pont-Tuset, Hang Qi, Rui Qian, Deepak Ramachandran, Poorva Rane, Abdullah Rashwan, Robert Riachi, Hansa Srinivasan, Srivatsan Srinivasan, Robin Strudel, Benigno Uria, Oliver Wang, Su Wang, Austin Waters, Chris Wolff, Auriel Wright, Zhisheng Xiao, Hao Xiong, Keyang Xu, Marc van Zee, Junlin Zhang, Katie Zhang, Wenlei Zhou, Konrad Zolna, Ola Aboubakar, Canfer Akbulut, Oscar Akerlund, Isabela Albuquerque, Nina Anderson, Marco Andreetto, Lora Aroyo, Ben Bariach, David Barker, Sherry Ben, Dana Berman, Courtney Biles, Irina Blok, Pankil Botadra, Jenny Brennan, Karla Brown, John Buckley, Rudy Bunel, Elie Bursztein, Christina Butterfield, Ben Caine, Viral Carpenter, Norman Casagrande, Ming-Wei Chang, Solomon Chang, Shamik Chaudhuri, Tony Chen, John Choi, Dmitry Churbanau, Nathan Clement, Matan Cohen, Forrester Cole, Mikhail Dektiarev, Vincent Du, Praneet Dutta, Tom Eccles, Ndidi Elue, Ashley Feden, Shlomi Fruchter, Frankie Garcia, Roopal Garg, Weina Ge, Ahmed Ghazy, Bryant Gipson, Andrew Goodman, Dawid Górny, Sven Gowal, Khyatti Gupta, Yoni Halpern, Yena Han, Susan Hao, Jamie Hayes, Jonathan Heek, Amir Hertz, Ed Hirst, Emiel Hoogeboom, Tingbo Hou, Heidi Howard, Mohamed Ibrahim, Dirichi Ike-Njoku, Joana Iljazi, Vlad Ionescu, William Isaac, Reena Jana, Gemma Jennings, Donovon Jenson, Xuhui Jia, Kerry Jones, Xiaoen Ju, Ivana Kajic, Christos Kaplanis, Burcu Karagol Ayan, Jacob Kelly, Suraj Kothawade, Christina Kouridi, Ira Ktena, Jolanda Kumakaw, Dana Kurniawan, Dmitry Lagun, Lily Lavitas, Jason Lee, Tao Li, Marco Liang, Maggie Li-Calis, Yuchi Liu, Javier Lopez Alberca, Matthieu Kim Lorrain, Peggy Lu, Kristian Lum, Yukun Ma, Chase Malik, John Mellor, Thomas Mensink, Inbar Mosseri, Tom Murray, Aida Nematzadeh, Paul Nicholas, Signe Nørly, João Gabriel Oliveira, Guillermo Ortiz-Jimenez, Michela Paganini, Tom Le Paine, Roni Paiss, Alicia Parrish, Anne Peckham, Vikas Peswani, Igor Petrovski, Tobias Pfaff, Alex Pirozhenko, Ryan Poplin, Utsav Prabhu, Yuan Qi, Matthew Rahtz, Cyrus Rashtchian, Charvi Rastogi, Amit Raul, Ali Razavi, Sylvestre-Alvise Rebuffi, Susanna Ricco, Felix Riedel, Dirk Robinson, Pankaj Rohatgi, Bill Rosgen, Sarah Rumbley, MoonKyung Ryu, Anthony Salgado, Tim Salimans, Sahil Singla, Florian Schroff, Candice Schumann, Tanmay Shah, Eleni Shaw, Gregory Shaw, Brendan Shillingford, Kaushik Shivakumar, Dennis Shtatnov, Zach Singer, Evgeny Sluzhaev, Valerii Sokolov, Thibault Sottiaux, Florian Stimberg, Brad Stone, David Stutz, Yu-Chuan Su, Eric Tabellion, Shuai Tang, David Tao, Kurt Thomas, Gregory Thornton, Andeep Toor, Cristian Udrescu, Aayush Upadhyay, Cristina Vasconcelos, Alex Vasiloff, Andrey Voynov, Amanda Walker, Luyu Wang, Miaosen Wang, Simon Wang, Stanley Wang, Qifei Wang, Yuxiao Wang, Ágoston Weisz, Olivia Wiles, Chenxia Wu, Xingyu Federico Xu, Andrew Xue, Jianbo Yang, Luo Yu, Mete Yurtoglu, Ali Zand, Han Zhang, Jiageng Zhang, Catherine Zhao, Adilet Zhaxybay, Miao Zhou, Shengqi Zhu, Zhenkai Zhu, Dawn Bloxwich, Mahyar Bordbar, Luis C. Cobo, Eli Collins, Shengyang Dai, Tulsee Doshi, Anca Dragan, Douglas Eck, Demis Hassabis, Sissie Hsiao, Tom Hume, Koray Kavukcuoglu, Helen King, Jack Krawczyk, Yeqing Li, Kathy Meier-Hellstern, Andras Orban, Yury Pinsky, Amar Subramanya, Oriol Vinyals, Ting Yu, Yori Zwols

We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts.

Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning

1 code implementation16 Jul 2024 Yanting Miao, William Loh, Suraj Kothawade, Pascal Poupart, Abdullah Rashwan, Yeqing Li

In this work, we present the $\lambda$-Harmonic reward function, which provides a reliable reward signal and enables early stopping for faster training and effective regularization.

Model Selection Text-to-Image Generation

Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis

no code implementations8 Jul 2024 Emaad Khwaja, Abdullah Rashwan, Ting Chen, Oliver Wang, Suraj Kothawade, Yeqing Li

We present a one-shot text-to-image diffusion model that can generate high-resolution images from natural language descriptions.

Image Generation Super-Resolution

Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

no code implementations7 Feb 2024 Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

This paper introduces a novel approach to leverage the generalizability of Diffusion Models for Source-Free Domain Adaptation (DM-SFDA).

Source-Free Domain Adaptation Unsupervised Domain Adaptation

Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation

no code implementations2 Oct 2023 Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

Domain Adaptation (DA) is a method for enhancing a model's performance on a target domain with inadequate annotated data by applying the information the model has acquired from a related source domain with sufficient labeled data.

Source-Free Domain Adaptation

SCoRe: Submodular Combinatorial Representation Learning

no code implementations29 Sep 2023 Anay Majee, Suraj Kothawade, KrishnaTeja Killamsetty, Rishabh Iyer

In this paper we introduce the SCoRe (Submodular Combinatorial Representation Learning) framework, a novel approach in representation learning that addresses inter-class bias and intra-class variance.

Autonomous Navigation Contrastive Learning +6

Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling

no code implementations28 Sep 2023 Ke Yu, Stephen Albro, Giulia Desalvo, Suraj Kothawade, Abdullah Rashwan, Sasan Tavakkol, Kayhan Batmanghelich, Xiaoqi Yin

Training high-quality instance segmentation models requires an abundance of labeled images with instance masks and classifications, which is often expensive to procure.

Active Learning Diversity +4

Beyond Active Learning: Leveraging the Full Potential of Human Interaction via Auto-Labeling, Human Correction, and Human Verification

no code implementations2 Jun 2023 Nathan Beck, KrishnaTeja Killamsetty, Suraj Kothawade, Rishabh Iyer

Active Learning (AL) is a human-in-the-loop framework to interactively and adaptively label data instances, thereby enabling significant gains in model performance compared to random sampling.

Active Learning

STREAMLINE: Streaming Active Learning for Realistic Multi-Distributional Settings

1 code implementation18 May 2023 Nathan Beck, Suraj Kothawade, Pradeep Shenoy, Rishabh Iyer

However, learning unbiased models depends on building a dataset that is representative of a diverse range of realistic scenarios for a given task.

Active Learning Autonomous Vehicles +3

DIAGNOSE: Avoiding Out-of-distribution Data using Submodular Information Measures

no code implementations4 Oct 2022 Suraj Kothawade, Akshit Srivastava, Venkat Iyer, Ganesh Ramakrishnan, Rishabh Iyer

Avoiding out-of-distribution (OOD) data is critical for training supervised machine learning models in the medical imaging domain.

Active Learning

CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification

no code implementations4 Oct 2022 Suraj Kothawade, Atharv Savarkar, Venkat Iyer, Lakshman Tamil, Ganesh Ramakrishnan, Rishabh Iyer

It is often the case that a suboptimal performance is obtained on some classes due to the natural class imbalance issue that comes with medical data.

Active Learning Image Classification +1

Object-Level Targeted Selection via Deep Template Matching

no code implementations5 Jul 2022 Suraj Kothawade, Donna Roy, Michele Fenzi, Elmar Haussmann, Jose M. Alvarez, Christoph Angerer

Existing semantic image retrieval methods often focus on mining for larger sized geographical landmarks, and/or require extra labeled data, such as images/image-pairs with similar objects, for mining images with generic objects.

Autonomous Driving Image Retrieval +3

BASIL: Balanced Active Semi-supervised Learning for Class Imbalanced Datasets

no code implementations10 Mar 2022 Suraj Kothawade, Pavan Kumar Reddy, Ganesh Ramakrishnan, Rishabh Iyer

This issue is further pronounced in SSL methods, as they would use this biased model to obtain psuedo-labels (on the unlabeled data) during training.

Active Learning

PLATINUM: Semi-Supervised Model Agnostic Meta-Learning using Submodular Mutual Information

1 code implementation30 Jan 2022 Changbin Li, Suraj Kothawade, Feng Chen, Rishabh Iyer

Meta learning has proven to be able to learn a parametrized model for FSC by training on various other classification tasks.

Meta-Learning

AUTO-DISCERN: Autonomous Driving Using Common Sense Reasoning

no code implementations17 Oct 2021 Suraj Kothawade, Vinaya Khandelwal, Kinjal Basu, Huaduo Wang, Gopal Gupta

That is, while machine learning technology is good for observing and automatically understanding the surroundings of an automobile, driving decisions are better automated via commonsense reasoning rather than machine learning.

Autonomous Driving BIG-bench Machine Learning +3

DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent Adaptation

no code implementations10 Oct 2021 Suraj Kothawade, Anmol Mekala, Chandra Sekhara D, Mayank Kothyari, Rishabh Iyer, Ganesh Ramakrishnan, Preethi Jyothi

To address this problem, we propose DITTO (Data-efficient and faIr Targeted subseT selectiOn) that uses Submodular Mutual Information (SMI) functions as acquisition functions to find the most informative set of utterances matching a target accent within a fixed budget.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Submodular Mutual Information for Targeted Data Subset Selection

no code implementations30 Apr 2021 Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes, Rishabh Iyer

With the rapid growth of data, it is becoming increasingly difficult to train or improve deep learning models with the right subset of data.

Active Learning Image Classification

PRISM: A Rich Class of Parameterized Submodular Information Measures for Guided Subset Selection

1 code implementation27 Feb 2021 Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes, Rishabh Iyer

Examples of such problems include: i)targeted learning, where the goal is to find subsets with rare classes or rare attributes on which the model is underperforming, and ii)guided summarization, where data (e. g., image collection, text, document or video) is summarized for quicker human consumption with specific additional user intent.

Image Classification

Deep Submodular Networks for Extractive Data Summarization

no code implementations16 Oct 2020 Suraj Kothawade, Jiten Girdhar, Chandrashekhar Lavania, Rishabh Iyer

Unfortunately, these models only learn the relative importance of the different submodular functions (such as diversity, representation or importance), but cannot learn more complex feature representations, which are often required for state-of-the-art performance.

Data Summarization Diversity

Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework

no code implementations29 Jul 2020 Vishal Kaushal, Suraj Kothawade, Rishabh Iyer, Ganesh Ramakrishnan

Thirdly, we demonstrate that in the presence of multiple ground truth summaries (due to the highly subjective nature of the task), learning from a single combined ground truth summary using a single loss function is not a good idea.

Benchmarking Video Summarization

Content Based Image Retrieval from AWiFS Images Repository of IRS Resourcesat-2 Satellite Based on Water Bodies and Burnt Areas

no code implementations26 Sep 2018 Suraj Kothawade, Kunjan Mhaske, Sahil Sharma, Furkhan Shaikh

Satellite Remote Sensing Technology is becoming a major milestone in the prediction of weather anomalies, natural disasters as well as finding alternative resources in proximity using multiple multi-spectral sensors emitting electromagnetic waves at distinct wavelengths.

Content-Based Image Retrieval Retrieval +1

Vis-DSS: An Open-Source toolkit for Visual Data Selection and Summarization

1 code implementation24 Sep 2018 Rishabh Iyer, Pratik Dubal, Kunal Dargan, Suraj Kothawade, Rohan Mahadev, Vishal Kaushal

With increasing amounts of visual data being created in the form of videos and images, visual data selection and summarization are becoming ever increasing problems.

Active Learning Video Summarization

Deployment of Customized Deep Learning based Video Analytics On Surveillance Cameras

no code implementations27 May 2018 Pratik Dubal, Rohan Mahadev, Suraj Kothawade, Kunal Dargan, Rishabh Iyer

To our knowledge, this is the first work which provides a comprehensive evaluation of different deep learning models on various real-world customer deployment scenarios of surveillance video analytics.

Attribute Deep Learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.