no code implementations • 20 Mar 2024 • Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary, Pramuditha Perera, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto
In particular, we show that as more tokens are generated, the reliance on the visual prompt decreases, and this behavior strongly correlates with the emergence of hallucinations.
1 code implementation • 23 Oct 2023 • Tian Yu Liu, Matthew Trager, Alessandro Achille, Pramuditha Perera, Luca Zancato, Stefano Soatto
We propose to extract meaning representations from autoregressive language models by considering the distribution of all possible trajectories extending an input text.
no code implementations • 1 Jun 2023 • Pramuditha Perera, Matthew Trager, Luca Zancato, Alessandro Achille, Stefano Soatto
We investigate whether prompts learned independently for different tasks can be later combined through prompt algebra to obtain a model that supports composition of tasks.
no code implementations • 30 May 2023 • Xingyu Fu, Sheng Zhang, Gukyeong Kwon, Pramuditha Perera, Henghui Zhu, Yuhao Zhang, Alexander Hanbo Li, William Yang Wang, Zhiguo Wang, Vittorio Castelli, Patrick Ng, Dan Roth, Bing Xiang
The open-ended Visual Question Answering (VQA) task requires AI models to jointly reason over visual and natural language inputs using world knowledge.
no code implementations • 27 May 2023 • Sijia Wang, Alexander Hanbo Li, Henry Zhu, Sheng Zhang, Chung-Wei Hang, Pramuditha Perera, Jie Ma, William Wang, Zhiguo Wang, Vittorio Castelli, Bing Xiang, Patrick Ng
Entities can be expressed in diverse formats, such as texts, images, or column names and cell values in tables.
no code implementations • CVPR 2023 • Luca Zancato, Alessandro Achille, Tian Yu Liu, Matthew Trager, Pramuditha Perera, Stefano Soatto
Second, we apply ${\rm T^3AR}$ for test-time adaptation and show that exploiting a pool of external images at test-time leads to more robust representations over existing methods on DomainNet-126 and VISDA-C, especially when few adaptation data are available (up to 8%).
no code implementations • ICCV 2023 • Matthew Trager, Pramuditha Perera, Luca Zancato, Alessandro Achille, Parminder Bhatia, Stefano Soatto
These vectors can be seen as "ideal words" for generating concepts directly within the embedding space of the model.
no code implementations • 15 Feb 2023 • Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto
During inference, models can be assembled based on arbitrary selections of data sources, which we call "\`a-la-carte learning".
no code implementations • CVPR 2023 • Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto
During inference, models can be assembled based on arbitrary selections of data sources, which we call a-la-carte learning.
1 code implementation • 12 Feb 2022 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel
This paper proposes an Open-Set Defense Network with Clean-Adversarial Mutual Learning (OSDN-CAML) as a solution to the OSAD problem.
no code implementations • 14 Apr 2021 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel
A face presentation attack detection model with good generalization can be obtained when it is trained with face images from different input distributions and different types of spoof attacks.
no code implementations • 24 Jan 2021 • Pramuditha Perera, Vishal Patel
First, we learn generative features using the one-class data with a generative framework.
no code implementations • 8 Jan 2021 • Pramuditha Perera, Poojan Oza, Vishal M. Patel
One-Class Classification (OCC) is a special case of multi-class classification, where data observed during training is from a single positive class.
1 code implementation • ECCV 2020 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel
In this paper, we show that open-set recognition systems are vulnerable to adversarial attacks.
1 code implementation • 11 Jul 2020 • Yashasvi Baweja, Poojan Oza, Pramuditha Perera, Vishal M. Patel
Anomaly detection-based spoof attack detection is a recent development in face Presentation Attack Detection (fPAD), where a spoof detector is learned using only non-attacked images of users.
no code implementations • 21 Jun 2020 • Pramuditha Perera, Julian Fierrez, Vishal M. Patel
In this paper, we investigate how to detect intruders with low latency for Active Authentication (AA) systems with multiple-users.
no code implementations • CVPR 2020 • Pramuditha Perera, Vlad I. Morariu, Rajiv Jain, Varun Manjunatha, Curtis Wigington, Vicente Ordonez, Vishal M. Patel
We address the problem of open-set recognition, where the goal is to determine if a given sample belongs to one of the classes used for training a model (known classes).
no code implementations • 29 May 2020 • Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel
A face presentation attack detection model with good generalization can be obtained when it is trained with face images from different input distributions and different types of spoof attacks.
no code implementations • CVPR 2019 • Pramuditha Perera, Ramesh Nallapati, Bing Xiang
The key contribution of our work is our proposal to explicitly constrain the latent space to exclusively represent the given class.
Ranked #6 on Anomaly Detection on Hyper-Kvasir Dataset
1 code implementation • CVPR 2019 • Pramuditha Perera, Vishal M. Patel
We show that thresholding the maximal activation of the proposed network can be used to identify novel objects effectively.
5 code implementations • 16 Jan 2018 • Pramuditha Perera, Vishal M. Patel
We propose a deep learning-based solution for the problem of feature learning in one-class classification.
1 code implementation • 26 Nov 2017 • Pramuditha Perera, Mahdi Abavisani, Vishal M. Patel
In unsupervised image-to-image translation, the goal is to learn the mapping between an input image and an output image using a set of unpaired training images.
Generative Adversarial Network Multimodal Unsupervised Image-To-Image Translation +2