Search Results for author: Pramuditha Perera

Found 22 papers, 7 papers with code

Multi-Modal Hallucination Control by Visual Information Grounding

no code implementations • 20 Mar 2024 • Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary, Pramuditha Perera, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

In particular, we show that as more tokens are generated, the reliance on the visual prompt decreases, and this behavior strongly correlates with the emergence of hallucinations.

Hallucination Visual Question Answering (VQA)

Paper
Add Code

Meaning Representations from Trajectories in Autoregressive Models

1 code implementation • 23 Oct 2023 • Tian Yu Liu, Matthew Trager, Alessandro Achille, Pramuditha Perera, Luca Zancato, Stefano Soatto

We propose to extract meaning representations from autoregressive language models by considering the distribution of all possible trajectories extending an input text.

Paper
Code

Prompt Algebra for Task Composition

no code implementations • 1 Jun 2023 • Pramuditha Perera, Matthew Trager, Luca Zancato, Alessandro Achille, Stefano Soatto

We investigate whether prompts learned independently for different tasks can be later combined through prompt algebra to obtain a model that supports composition of tasks.

Attribute Classification

Paper
Add Code

Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge

no code implementations • 30 May 2023 • Xingyu Fu, Sheng Zhang, Gukyeong Kwon, Pramuditha Perera, Henghui Zhu, Yuhao Zhang, Alexander Hanbo Li, William Yang Wang, Zhiguo Wang, Vittorio Castelli, Patrick Ng, Dan Roth, Bing Xiang

The open-ended Visual Question Answering (VQA) task requires AI models to jointly reason over visual and natural language inputs using world knowledge.

Answer Selection Visual Question Answering +1

Paper
Add Code

Benchmarking Diverse-Modal Entity Linking with Generative Models

no code implementations • 27 May 2023 • Sijia Wang, Alexander Hanbo Li, Henry Zhu, Sheng Zhang, Chung-Wei Hang, Pramuditha Perera, Jie Ma, William Wang, Zhiguo Wang, Vittorio Castelli, Bing Xiang, Patrick Ng

Entities can be expressed in diverse formats, such as texts, images, or column names and cell values in tables.

Benchmarking Entity Linking +1

Paper
Add Code

Train/Test-Time Adaptation with Retrieval

no code implementations • CVPR 2023 • Luca Zancato, Alessandro Achille, Tian Yu Liu, Matthew Trager, Pramuditha Perera, Stefano Soatto

Second, we apply ${\rm T^3AR}$ for test-time adaptation and show that exploiting a pool of external images at test-time leads to more robust representations over existing methods on DomainNet-126 and VISDA-C, especially when few adaptation data are available (up to 8%).

Retrieval Test-time Adaptation

Paper
Add Code

Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

no code implementations • ICCV 2023 • Matthew Trager, Pramuditha Perera, Luca Zancato, Alessandro Achille, Parminder Bhatia, Stefano Soatto

These vectors can be seen as "ideal words" for generating concepts directly within the embedding space of the model.

Disentanglement Retrieval

Paper
Add Code

À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting

no code implementations • 15 Feb 2023 • Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto

During inference, models can be assembled based on arbitrary selections of data sources, which we call "\`a-la-carte learning".

Continual Learning

Paper
Add Code

A-La-Carte Prompt Tuning (APT): Combining Distinct Data via Composable Prompting

no code implementations • CVPR 2023 • Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto

During inference, models can be assembled based on arbitrary selections of data sources, which we call a-la-carte learning.

Continual Learning

Paper
Add Code

Open-set Adversarial Defense with Clean-Adversarial Mutual Learning

1 code implementation • 12 Feb 2022 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

This paper proposes an Open-Set Defense Network with Clean-Adversarial Mutual Learning (OSDN-CAML) as a solution to the OSAD problem.

Adversarial Defense Denoising +2

Paper
Code

Federated Generalized Face Presentation Attack Detection

no code implementations • 14 Apr 2021 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

A face presentation attack detection model with good generalization can be obtained when it is trained with face images from different input distributions and different types of spoof attacks.

Disentanglement Face Presentation Attack Detection +2

Paper
Add Code

A Joint Representation Learning and Feature Modeling Approach for One-class Recognition

no code implementations • 24 Jan 2021 • Pramuditha Perera, Vishal Patel

First, we learn generative features using the one-class data with a generative framework.

General Classification One-Class Classification +1

Paper
Add Code

One-Class Classification: A Survey

no code implementations • 8 Jan 2021 • Pramuditha Perera, Poojan Oza, Vishal M. Patel

One-Class Classification (OCC) is a special case of multi-class classification, where data observed during training is from a single positive class.

Classification General Classification +2

Paper
Add Code

Open-set Adversarial Defense

1 code implementation • ECCV 2020 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

In this paper, we show that open-set recognition systems are vulnerable to adversarial attacks.

Adversarial Defense Denoising +1

Paper
Code

Anomaly Detection-Based Unknown Face Presentation Attack Detection

1 code implementation • 11 Jul 2020 • Yashasvi Baweja, Poojan Oza, Pramuditha Perera, Vishal M. Patel

Anomaly detection-based spoof attack detection is a recent development in face Presentation Attack Detection (fPAD), where a spoof detector is learned using only non-attacked images of users.

Anomaly Detection Face Presentation Attack Detection +1

Paper
Code

Quickest Intruder Detection for Multiple User Active Authentication

no code implementations • 21 Jun 2020 • Pramuditha Perera, Julian Fierrez, Vishal M. Patel

In this paper, we investigate how to detect intruders with low latency for Active Authentication (AA) systems with multiple-users.

Change Detection

Paper
Add Code

Generative-Discriminative Feature Representations for Open-Set Recognition

no code implementations • CVPR 2020 • Pramuditha Perera, Vlad I. Morariu, Rajiv Jain, Varun Manjunatha, Curtis Wigington, Vicente Ordonez, Vishal M. Patel

We address the problem of open-set recognition, where the goal is to determine if a given sample belongs to one of the classes used for training a model (known classes).

Open Set Learning

Paper
Add Code

Federated Face Presentation Attack Detection

no code implementations • 29 May 2020 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel

A face presentation attack detection model with good generalization can be obtained when it is trained with face images from different input distributions and different types of spoof attacks.

Face Anti-Spoofing Face Presentation Attack Detection +2

Paper
Add Code

OCGAN: One-class Novelty Detection Using GANs with Constrained Latent Representations

no code implementations • CVPR 2019 • Pramuditha Perera, Ramesh Nallapati, Bing Xiang

The key contribution of our work is our proposal to explicitly constrain the latent space to exclusively represent the given class.

Ranked #6 on Anomaly Detection on Hyper-Kvasir Dataset

Anomaly Detection Denoising +1

Paper
Add Code

Deep Transfer Learning for Multiple Class Novelty Detection

1 code implementation • CVPR 2019 • Pramuditha Perera, Vishal M. Patel

We show that thresholding the maximal activation of the proposed network can be used to identify novel objects effectively.

Novelty Detection Transfer Learning

Paper
Code

Learning Deep Features for One-Class Classification

5 code implementations • 16 Jan 2018 • Pramuditha Perera, Vishal M. Patel

We propose a deep learning-based solution for the problem of feature learning in one-class classification.

Descriptive General Classification +3

185

Paper
Code

In2I : Unsupervised Multi-Image-to-Image Translation Using Generative Adversarial Networks

1 code implementation • 26 Nov 2017 • Pramuditha Perera, Mahdi Abavisani, Vishal M. Patel

In unsupervised image-to-image translation, the goal is to learn the mapping between an input image and an output image using a set of unpaired training images.

Ranked #1 on Multimodal Unsupervised Image-To-Image Translation on EPFL NIR-VIS

Generative Adversarial Network Multimodal Unsupervised Image-To-Image Translation +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.