Search Results for author: Shashank Gupta

Found 29 papers, 13 papers with code

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

1 code implementation11 Sep 2024 Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot

To advance towards this goal, we introduce SUPER, the first benchmark designed to evaluate the capability of LLMs in setting up and executing tasks from research repositories.

Practical and Robust Safety Guarantees for Advanced Counterfactual Learning to Rank

no code implementations29 Jul 2024 Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke

Our experiments show that both our novel safe doubly robust method and PRPO provide higher performance than the existing safe inverse propensity scoring approach.

counterfactual Learning-To-Rank

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

1 code implementation26 Jul 2024 Harsh Trivedi, Tushar Khot, Mareike Hartmann, Ruskin Manku, Vinty Dong, Edward Li, Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian

To remedy this gap, we built $\textbf{AppWorld Engine}$, a high-quality execution environment (60K lines of code) of 9 day-to-day apps operable via 457 APIs and populated with realistic digital activities simulating the lives of ~100 fictitious users.

Benchmarking Code Generation

Optimal Baseline Corrections for Off-Policy Contextual Bandits

1 code implementation9 May 2024 Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, Maarten de Rijke

The foundation of our framework is the derivation of an equivalent baseline correction for all of the existing control variates.

Decision Making Multi-Armed Bandits +1

A First Look at Selection Bias in Preference Elicitation for Recommendation

1 code implementation1 May 2024 Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke

Despite the fact that the extreme sparsity of preference elicitation interactions make them severely more prone to selection bias than natural interactions, the effect of selection bias in preference elicitation on the resulting recommendations has not been studied yet.

Recommendation Systems Selection bias

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

1 code implementation29 Apr 2024 Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy

Mathematical equations have been unreasonably effective in describing complex natural phenomena across various scientific disciplines.

Equation Discovery Large Language Model +3

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

no code implementations25 Apr 2024 Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, HaoNing Wu, Yixuan Gao, Yuqin Cao, ZiCheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Fengbin Guan, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao

A total of 196 participants have registered in the video track.

Image Quality Assessment Image Restoration +2

Exploring Explainability in Video Action Recognition

no code implementations13 Apr 2024 Avinab Saha, Shashank Gupta, Sravan Kumar Ankireddy, Karl Chahine, Joydeep Ghosh

To address these, we introduce Video-TCAV, by building on TCAV for Image Classification tasks, which aims to quantify the importance of specific concepts in the decision-making process of Video Action Recognition models.

Action Recognition Classification +2

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

1 code implementation8 Nov 2023 Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot

Our experiments with ChatGPT-3. 5 show that this bias is ubiquitous - 80% of our personas demonstrate bias; it is significant - some datasets show performance drops of 70%+; and can be especially harmful for certain groups - some personas suffer statistically significant drops on 80%+ of the datasets.

Fairness Math

Top K Relevant Passage Retrieval for Biomedical Question Answering

1 code implementation8 Aug 2023 Shashank Gupta

In this work, we work on the existing DPR framework for the biomedical domain and retrieve answers from the Pubmed articles which is a reliable source to answer medical questions.

Passage Retrieval Question Answering +2

Recent Advances in the Foundations and Applications of Unbiased Learning to Rank

no code implementations4 May 2023 Shashank Gupta, Philipp Hager, Jin Huang, Ali Vardasbi, Harrie Oosterhuis

This tutorial provides both an introduction to the core concepts of the field and an overview of recent advancements in its foundations along with several applications of its methods.

Fairness Learning-To-Rank

Safe Deployment for Counterfactual Learning to Rank with Exposure-Based Risk Minimization

1 code implementation26 Apr 2023 Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke

For the CLTR field, our novel exposure-based risk minimization method enables practitioners to adopt CLTR methods in a safer manner that mitigates many of the risks attached to previous methods.

counterfactual Learning-To-Rank

Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners

no code implementations16 Apr 2022 Shashank Gupta, Subhabrata Mukherjee, Krishan Subudhi, Eduardo Gonzalez, Damien Jose, Ahmed H. Awadallah, Jianfeng Gao

Traditional multi-task learning (MTL) methods use dense networks that use the same set of shared weights across several different tasks.

Multi-Task Learning

Knowledge Infused Decoding

1 code implementation ICLR 2022 Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah

Hence, they tend to suffer from counterfactual or hallucinatory generation when used in knowledge-intensive natural language generation (NLG) tasks.

counterfactual Question Answering +1

Exploring Low-Cost Transformer Model Compression for Large-Scale Commercial Reply Suggestions

no code implementations27 Nov 2021 Vaishnavi Shrivastava, Radhika Gaonkar, Shashank Gupta, Abhishek Jha

Fine-tuning pre-trained language models improves the quality of commercial reply suggestion systems, but at the cost of unsustainable training times.

Model Compression

Performance of Dense Coding and Teleportation for Random States --Augmentation via Pre-processing

no code implementations10 Dec 2020 Rivu Gupta, Shashank Gupta, Shiladitya Mal, Aditi Sen De

The local pre-processing employed here is based on positive operator valued measurements along with classical communication and we show that unlike dense coding with two-qubit random states, senders' operations are always helpful to probabilistically enhance the capabilities of implementing dense coding as well as teleportation.

Quantum Physics

Feature Extraction Functions for Neural Logic Rule Learning

no code implementations14 Aug 2020 Shashank Gupta, Antonio Robles-Kelly, Mohamed Reda Bouadjenek

Combining symbolic human knowledge with neural networks provides a rule-based ante-hoc explanation of the output.

Sentiment Analysis Sentiment Classification

Genuine Einstein-Podolsky-Rosen steering of three-qubit states by multiple sequential observers

no code implementations7 Jul 2020 Shashank Gupta, Ananda G. Maity, Debarshi Das, Arup Roy, A. S. Majumdar

We investigate the possibility of multiple use of a single copy of three-qubit states for genuine tripartite Einstein-Podolsky-Rosen (EPR) steering.

Quantum Physics

On Application of Bayesian Parametric and Non-parametric Methods for User Cohorting in Product Search

no code implementations WS 2020 Shashank Gupta

To the best of our knowledge, this is the first work that presents a comparative study of various Bayesian clustering methods in the context of product search.

Clustering

Semi-Supervised Recurrent Neural Network for Adverse Drug Reaction Mention Extraction

no code implementations6 Sep 2017 Shashank Gupta, Sachin Pawar, Nitin Ramrakhiyani, Girish Palshikar, Vasudeva Varma

Current methods in ADR mention extraction relies on supervised learning methods, which suffers from labeled data scarcity problem.

Deep Learning for Hate Speech Detection in Tweets

1 code implementation1 Jun 2017 Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, Vasudeva Varma

Hate speech detection on Twitter is critical for applications like controversial event extraction, building AI chatterbots, content recommendation, and sentiment analysis.

16k Event Extraction +3

Cannot find the paper you are looking for? You can Submit a new open access paper.