Search Results for author: Ashwin Swaminathan

Found 11 papers, 0 papers with code

Mixed-Query Transformer: A Unified Image Segmentation Architecture

no code implementations • 6 Apr 2024 • Pei Wang, Zhaowei Cai, Hao Yang, Ashwin Swaminathan, R. Manmatha, Stefano Soatto

Existing unified image segmentation models either employ a unified architecture across multiple tasks but use separate weights tailored to each dataset, or apply a single set of weights to multiple datasets but are limited to a single task.

Data Augmentation Image Segmentation +2

Paper
Add Code

On the Scalability of Diffusion-based Text-to-Image Generation

no code implementations • 3 Apr 2024 • Hao Li, Yang Zou, Ying Wang, Orchid Majumder, Yusheng Xie, R. Manmatha, Ashwin Swaminathan, Zhuowen Tu, Stefano Ermon, Stefano Soatto

On the data scaling side, we show the quality and diversity of the training set matters more than simply dataset size.

Denoising Text-to-Image Generation

Paper
Add Code

CPR: Retrieval Augmented Generation for Copyright Protection

no code implementations • 27 Mar 2024 • Aditya Golatkar, Alessandro Achille, Luca Zancato, Yu-Xiang Wang, Ashwin Swaminathan, Stefano Soatto

To reduce risks of leaking private information contained in the retrieved set, we introduce Copy-Protected generation with Retrieval (CPR), a new method for RAG with strong copyright protection guarantees in a mixed-private setting for diffusion models. CPR allows to condition the output of diffusion models on a set of retrieved images, while also guaranteeing that unique identifiable information about those example is not exposed in the generated outputs.

Image Generation Machine Unlearning +1

Paper
Add Code

Multi-Modal Hallucination Control by Visual Information Grounding

no code implementations • 20 Mar 2024 • Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary, Pramuditha Perera, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

In particular, we show that as more tokens are generated, the reliance on the visual prompt decreases, and this behavior strongly correlates with the emergence of hallucinations.

Hallucination Visual Question Answering (VQA)

Paper
Add Code

Fast Sparse View Guided NeRF Update for Object Reconfigurations

no code implementations • 16 Mar 2024 • Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soatto

Neural Radiance Field (NeRF), as an implicit 3D scene representation, lacks inherent ability to accommodate changes made to the initial static scene.

Paper
Add Code

A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D

no code implementations • 29 Feb 2024 • Xiaohan Fei, Chethan Parameshwara, Jiawei Mo, Xiaolong Li, Ashwin Swaminathan, Cj Taylor, Paolo Favaro, Stefano Soatto

However, the SDS method is also the source of several artifacts, such as the Janus problem, the misalignment between the text prompt and the generated 3D model, and 3D model inaccuracies.

Image Generation Text to 3D

Paper
Add Code

Training Data Protection with Compositional Diffusion Models

no code implementations • 2 Aug 2023 • Aditya Golatkar, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

We introduce Compartmentalized Diffusion Models (CDM), a method to train different diffusion models (or prompts) on distinct data sources and arbitrarily compose them at inference time.

Continual Learning Memorization +1

Paper
Add Code

Towards Visual Foundational Models of Physical Scenes

no code implementations • 6 Jun 2023 • Chethan Parameshwara, Alessandro Achille, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, Cj Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto

We describe a first step towards learning general-purpose visual representations of physical scenes using only image prediction as a training criterion.

Paper
Add Code

SAFE: Machine Unlearning With Shard Graphs

no code implementations • ICCV 2023 • Yonatan Dukler, Benjamin Bowman, Alessandro Achille, Aditya Golatkar, Ashwin Swaminathan, Stefano Soatto

We present Synergy Aware Forgetting Ensemble (SAFE), a method to adapt large models on a diverse collection of data while minimizing the expected cost to remove the influence of training samples from the trained model.

Machine Unlearning

Paper
Add Code

Learning Expressive Prompting With Residuals for Vision Transformers

no code implementations • CVPR 2023 • Rajshekhar Das, Yonatan Dukler, Avinash Ravichandran, Ashwin Swaminathan

Prompt learning is an efficient approach to adapt transformers by inserting learnable set of parameters into the input and intermediate representations of a pre-trained model.

Few-Shot Learning Image Classification +2

Paper
Add Code

A Meta-Learning Approach to Predicting Performance and Data Requirements

no code implementations • CVPR 2023 • Achin Jain, Gurumurthy Swaminathan, Paolo Favaro, Hao Yang, Avinash Ravichandran, Hrayr Harutyunyan, Alessandro Achille, Onkar Dabeer, Bernt Schiele, Ashwin Swaminathan, Stefano Soatto

The PPL improves the performance estimation on average by 37% across 16 classification and 33% across 10 detection datasets, compared to the power law.

Meta-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.