Search Results for author: Ashwin Swaminathan

Found 11 papers, 0 papers with code

Mixed-Query Transformer: A Unified Image Segmentation Architecture

no code implementations6 Apr 2024 Pei Wang, Zhaowei Cai, Hao Yang, Ashwin Swaminathan, R. Manmatha, Stefano Soatto

Existing unified image segmentation models either employ a unified architecture across multiple tasks but use separate weights tailored to each dataset, or apply a single set of weights to multiple datasets but are limited to a single task.

Data Augmentation Image Segmentation +2

CPR: Retrieval Augmented Generation for Copyright Protection

no code implementations27 Mar 2024 Aditya Golatkar, Alessandro Achille, Luca Zancato, Yu-Xiang Wang, Ashwin Swaminathan, Stefano Soatto

To reduce risks of leaking private information contained in the retrieved set, we introduce Copy-Protected generation with Retrieval (CPR), a new method for RAG with strong copyright protection guarantees in a mixed-private setting for diffusion models. CPR allows to condition the output of diffusion models on a set of retrieved images, while also guaranteeing that unique identifiable information about those example is not exposed in the generated outputs.

Image Generation Machine Unlearning +1

Multi-Modal Hallucination Control by Visual Information Grounding

no code implementations20 Mar 2024 Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary, Pramuditha Perera, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

In particular, we show that as more tokens are generated, the reliance on the visual prompt decreases, and this behavior strongly correlates with the emergence of hallucinations.

Hallucination Visual Question Answering (VQA)

Fast Sparse View Guided NeRF Update for Object Reconfigurations

no code implementations16 Mar 2024 Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soatto

Neural Radiance Field (NeRF), as an implicit 3D scene representation, lacks inherent ability to accommodate changes made to the initial static scene.

A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D

no code implementations29 Feb 2024 Xiaohan Fei, Chethan Parameshwara, Jiawei Mo, Xiaolong Li, Ashwin Swaminathan, Cj Taylor, Paolo Favaro, Stefano Soatto

However, the SDS method is also the source of several artifacts, such as the Janus problem, the misalignment between the text prompt and the generated 3D model, and 3D model inaccuracies.

Image Generation Text to 3D

Training Data Protection with Compositional Diffusion Models

no code implementations2 Aug 2023 Aditya Golatkar, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

We introduce Compartmentalized Diffusion Models (CDM), a method to train different diffusion models (or prompts) on distinct data sources and arbitrarily compose them at inference time.

Continual Learning Memorization +1

Towards Visual Foundational Models of Physical Scenes

no code implementations6 Jun 2023 Chethan Parameshwara, Alessandro Achille, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, Cj Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto

We describe a first step towards learning general-purpose visual representations of physical scenes using only image prediction as a training criterion.

SAFE: Machine Unlearning With Shard Graphs

no code implementations ICCV 2023 Yonatan Dukler, Benjamin Bowman, Alessandro Achille, Aditya Golatkar, Ashwin Swaminathan, Stefano Soatto

We present Synergy Aware Forgetting Ensemble (SAFE), a method to adapt large models on a diverse collection of data while minimizing the expected cost to remove the influence of training samples from the trained model.

Machine Unlearning

Learning Expressive Prompting With Residuals for Vision Transformers

no code implementations CVPR 2023 Rajshekhar Das, Yonatan Dukler, Avinash Ravichandran, Ashwin Swaminathan

Prompt learning is an efficient approach to adapt transformers by inserting learnable set of parameters into the input and intermediate representations of a pre-trained model.

Few-Shot Learning Image Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.