Search Results for author: Aman Gupta

Found 12 papers, 0 papers with code

A Precise Characterization of SGD Stability Using Loss Surface Geometry

no code implementations22 Jan 2024 Gregory Dexter, Borja Ocejo, Sathiya Keerthi, Aman Gupta, Ayan Acharya, Rajiv Khanna

In this paper, we delve deeper into the relationship between linear stability and sharpness.

MultiSlot ReRanker: A Generic Model-based Re-Ranking Framework in Recommendation Systems

no code implementations11 Jan 2024 Qiang Charles Xiao, Ajith Muralidharan, Birjodh Tiwana, Johnson Jia, Fedor Borisyuk, Aman Gupta, Dawn Woodard

In this paper, we propose a generic model-based re-ranking framework, MultiSlot ReRanker, which simultaneously optimizes relevance, diversity, and freshness.

OpenAI Gym Recommendation Systems +1

QuantEase: Optimization-based Quantization for Language Models

no code implementations5 Sep 2023 Kayhan Behdin, Ayan Acharya, Aman Gupta, Qingquan Song, Siyu Zhu, Sathiya Keerthi, Rahul Mazumder

Particularly noteworthy is our outlier-aware algorithm's capability to achieve near or sub-3-bit quantization of LLMs with an acceptable drop in accuracy, obviating the need for non-uniform quantization or grouping techniques, improving upon methods such as SpQR by up to two times in terms of perplexity.

Quantization

mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

no code implementations19 Feb 2023 Kayhan Behdin, Qingquan Song, Aman Gupta, Sathiya Keerthi, Ayan Acharya, Borja Ocejo, Gregory Dexter, Rajiv Khanna, David Durfee, Rahul Mazumder

Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance.

Image Classification

Heterogeneous Calibration: A post-hoc model-agnostic framework for improved generalization

no code implementations10 Feb 2022 David Durfee, Aman Gupta, Kinjal Basu

We introduce the notion of heterogeneous calibration that applies a post-hoc model-agnostic transformation to model outputs for improving AUC performance on binary classification tasks.

Binary Classification

Logit Attenuating Weight Normalization

no code implementations12 Aug 2021 Aman Gupta, Rohan Ramanath, Jun Shi, Anika Ramachandran, Sirou Zhou, Mingzhou Zhou, S. Sathiya Keerthi

Over-parameterized deep networks trained using gradient-based optimizers are a popular choice for solving classification and ranking problems.

Image Classification Recommendation Systems

Transitioning from Real to Synthetic data: Quantifying the bias in model

no code implementations10 May 2021 Aman Gupta, Deepak Bhatt, Anubha Pandey

This study aims to establish a trade-off between bias and fairness in the models trained using synthetic data.

Fairness Synthetic Data Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.