1 code implementation • 11 Sep 2024 • Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot
To advance towards this goal, we introduce SUPER, the first benchmark designed to evaluate the capability of LLMs in setting up and executing tasks from research repositories.
no code implementations • 29 Jul 2024 • Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke
Our experiments show that both our novel safe doubly robust method and PRPO provide higher performance than the existing safe inverse propensity scoring approach.
1 code implementation • 26 Jul 2024 • Harsh Trivedi, Tushar Khot, Mareike Hartmann, Ruskin Manku, Vinty Dong, Edward Li, Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian
To remedy this gap, we built $\textbf{AppWorld Engine}$, a high-quality execution environment (60K lines of code) of 9 day-to-day apps operable via 457 APIs and populated with realistic digital activities simulating the lives of ~100 fictitious users.
1 code implementation • 9 May 2024 • Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, Maarten de Rijke
The foundation of our framework is the derivation of an equivalent baseline correction for all of the existing control variates.
1 code implementation • 1 May 2024 • Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke
Despite the fact that the extreme sparsity of preference elicitation interactions make them severely more prone to selection bias than natural interactions, the effect of selection bias in preference elicitation on the resulting recommendations has not been studied yet.
1 code implementation • 29 Apr 2024 • Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy
Mathematical equations have been unreasonably effective in describing complex natural phenomena across various scientific disciplines.
no code implementations • 25 Apr 2024 • Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, HaoNing Wu, Yixuan Gao, Yuqin Cao, ZiCheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng, Jianquan Yang, Weigang Wang, Xi Fang, Xiaoxin Lv, Jun Yan, Tianwu Zhi, Yabin Zhang, Yaohui Li, Yang Li, Jingwen Xu, Jianzhao Liu, Yiting Liao, Junlin Li, Zihao Yu, Yiting Lu, Xin Li, Hossein Motamednia, S. Farhad Hosseini-Benvidi, Fengbin Guan, Ahmad Mahmoudi-Aznaveh, Azadeh Mansouri, Ganzorig Gankhuyag, Kihwan Yoon, Yifang Xu, Haotian Fan, Fangyuan Kong, Shiling Zhao, Weifeng Dong, Haibing Yin, Li Zhu, Zhiling Wang, Bingchen Huang, Avinab Saha, Sandeep Mishra, Shashank Gupta, Rajesh Sureddi, Oindrila Saha, Luigi Celona, Simone Bianco, Paolo Napoletano, Raimondo Schettini, Junfeng Yang, Jing Fu, Wei zhang, Wenzhi Cao, Limei Liu, Han Peng, Weijun Yuan, Zhan Li, Yihang Cheng, Yifan Deng, Haohui Li, Bowen Qu, Yao Li, Shuqing Luo, Shunzhou Wang, Wei Gao, Zihao Lu, Marcos V. Conde, Xinrui Wang, Zhibo Chen, Ruling Liao, Yan Ye, Qiulin Wang, Bing Li, Zhaokun Zhou, Miao Geng, Rui Chen, Xin Tao, Xiaoyu Liang, Shangkun Sun, Xingyuan Ma, Jiaze Li, Mengduo Yang, Haoran Xu, Jie zhou, Shiding Zhu, Bohan Yu, Pengfei Chen, Xinrui Xu, Jiabin Shen, Zhichao Duan, Erfan Asadi, Jiahe Liu, Qi Yan, Youran Qu, Xiaohui Zeng, Lele Wang, Renjie Liao
A total of 196 participants have registered in the video track.
no code implementations • 13 Apr 2024 • Avinab Saha, Shashank Gupta, Sravan Kumar Ankireddy, Karl Chahine, Joydeep Ghosh
To address these, we introduce Video-TCAV, by building on TCAV for Image Classification tasks, which aims to quantify the importance of specific concepts in the decision-making process of Video Action Recognition models.
1 code implementation • 22 Nov 2023 • Shashank Gupta, Xuguang Ai, Ramakanth Kavuluru
Our contribution is also the first to conduct E2ERE for the RareDis dataset.
1 code implementation • 8 Nov 2023 • Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot
Our experiments with ChatGPT-3. 5 show that this bias is ubiquitous - 80% of our personas demonstrate bias; it is significant - some datasets show performance drops of 70%+; and can be especially harmful for certain groups - some personas suffer statistically significant drops on 80%+ of the datasets.
1 code implementation • 8 Aug 2023 • Shashank Gupta
In this work, we work on the existing DPR framework for the biomedical domain and retrieve answers from the Pubmed articles which is a reliable source to answer medical questions.
no code implementations • 4 May 2023 • Shashank Gupta, Philipp Hager, Jin Huang, Ali Vardasbi, Harrie Oosterhuis
This tutorial provides both an introduction to the core concepts of the field and an overview of recent advancements in its foundations along with several applications of its methods.
1 code implementation • 26 Apr 2023 • Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke
For the CLTR field, our novel exposure-based risk minimization method enables practitioners to adopt CLTR methods in a safer manner that mitigates many of the risks attached to previous methods.
3 code implementations • NeurIPS 2023 • Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark
Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement.
no code implementations • 16 Apr 2022 • Shashank Gupta, Subhabrata Mukherjee, Krishan Subudhi, Eduardo Gonzalez, Damien Jose, Ahmed H. Awadallah, Jianfeng Gao
Traditional multi-task learning (MTL) methods use dense networks that use the same set of shared weights across several different tasks.
1 code implementation • ICLR 2022 • Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah
Hence, they tend to suffer from counterfactual or hallucinatory generation when used in knowledge-intensive natural language generation (NLG) tasks.
Ranked #2 on Question Answering on KILT: ELI5
no code implementations • 3 Jan 2022 • Parnian Afshar, Arash Mohammadi, Konstantinos N. Plataniotis, Keyvan Farahani, Justin Kirby, Anastasia Oikonomou, Amir Asif, Leonard Wee, Andre Dekker, Xin Wu, Mohammad Ariful Haque, Shahruk Hossain, Md. Kamrul Hasan, Uday Kamal, Winston Hsu, Jhih-Yuan Lin, M. Sohel Rahman, Nabil Ibtehaz, Sh. M. Amir Foisol, Kin-Man Lam, Zhong Guang, Runze Zhang, Sumohana S. Channappayya, Shashank Gupta, Chander Dev
Lung cancer is one of the deadliest cancers, and in part its effective diagnosis and treatment depend on the accurate delineation of the tumor.
no code implementations • 27 Nov 2021 • Vaishnavi Shrivastava, Radhika Gaonkar, Shashank Gupta, Abhishek Jha
Fine-tuning pre-trained language models improves the quality of commercial reply suggestion systems, but at the cost of unsustainable training times.
no code implementations • 10 Dec 2020 • Rivu Gupta, Shashank Gupta, Shiladitya Mal, Aditi Sen De
The local pre-processing employed here is based on positive operator valued measurements along with classical communication and we show that unlike dense coding with two-qubit random states, senders' operations are always helpful to probabilistically enhance the capabilities of implementing dense coding as well as teleportation.
Quantum Physics
no code implementations • 14 Aug 2020 • Shashank Gupta, Antonio Robles-Kelly, Mohamed Reda Bouadjenek
Combining symbolic human knowledge with neural networks provides a rule-based ante-hoc explanation of the output.
no code implementations • SEMEVAL 2020 • Anirudh Anil Ojha, Rohin Garg, Shashank Gupta, Ashutosh Modi
This paper describes our efforts in tackling Task 5 of SemEval-2020.
no code implementations • 7 Jul 2020 • Shashank Gupta, Ananda G. Maity, Debarshi Das, Arup Roy, A. S. Majumdar
We investigate the possibility of multiple use of a single copy of three-qubit states for genuine tripartite Einstein-Podolsky-Rosen (EPR) steering.
Quantum Physics
no code implementations • WS 2020 • Shashank Gupta
To the best of our knowledge, this is the first work that presents a comparative study of various Bayesian clustering methods in the context of product search.
1 code implementation • LREC 2018 • Daniel Khashabi, Mark Sammons, Ben Zhou, Tom Redman, Christos Christodoulopoulos, Vivek Srikumar, Nicholas Rizzolo, Lev Ratinov, Guanheng Luo, Quang Do, Chen-Tse Tsai, Subhro Roy, Stephen Mayhew, Zhili Feng, John Wieting, Xiaodong Yu, Yangqiu Song, Shashank Gupta, Shyam Upadhyay, Naveen Arivazhagan, Qiang Ning, Shaoshi Ling, Dan Roth
no code implementations • 14 Feb 2018 • Shashank Gupta, Manish Gupta, Vasudeva Varma, Sachin Pawar, Nitin Ramrakhiyani, Girish K. Palshikar
Adverse drug reactions (ADRs) are one of the leading causes of mortality in health care.
no code implementations • 14 Feb 2018 • Shashank Gupta, Manish Gupta, Vasudeva Varma, Sachin Pawar, Nitin Ramrakhiyani, Girish K. Palshikar
Adverse drug reactions (ADRs) are one of the leading causes of mortality in health care.
no code implementations • 6 Sep 2017 • Shashank Gupta, Sachin Pawar, Nitin Ramrakhiyani, Girish Palshikar, Vasudeva Varma
Current methods in ADR mention extraction relies on supervised learning methods, which suffers from labeled data scarcity problem.
no code implementations • RANLP 2017 • Divyansh Kaushik, Shashank Gupta, Chakradhar Raju, Reuben Dias, Sanjib Ghosh
The purpose of this research is to address the problem of extracting information from travel itineraries and discuss the challenges faced in the process.
1 code implementation • 1 Jun 2017 • Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, Vasudeva Varma
Hate speech detection on Twitter is critical for applications like controversial event extraction, building AI chatterbots, content recommendation, and sentiment analysis.