no code implementations • 12 Mar 2025 • Mouly Dewan, Jiqun Liu, Chirag Shah
In the information retrieval (IR) domain, evaluation plays a crucial role in optimizing search experiences and supporting diverse user intents.
no code implementations • 25 Jan 2025 • Harshita Chopra, Chirag Shah
The ability to identify and acquire missing information is a critical component of effective decision making and problem solving.
no code implementations • 8 Jan 2025 • Kirandeep Kaur, Manya Chadha, Vinayak Gupta, Chirag Shah
Our results on three real-world datasets show a significant reduction in weak users and improved robustness to subpopulations without disproportionately escalating costs.
no code implementations • 19 Dec 2024 • Chirag Shah, Ryen W. White
In the midst of the growing integration of Artificial Intelligence (AI) into various aspects of our lives, agents are experiencing a resurgence.
1 code implementation • 19 Oct 2024 • Sahil Verma, Royi Rassin, Arnav Das, Gantavya Bhatt, Preethi Seshadri, Chirag Shah, Jeff Bilmes, Hannaneh Hajishirzi, Yanai Elazar
We seek to determine the point at which a model was trained on enough instances to imitate a concept -- the imitation threshold.
no code implementations • 25 Jul 2024 • Ruoxi Shang, Gary Hsieh, Chirag Shah
Trust is not just a cognitive issue but also an emotional one, yet the research in human-AI interactions has primarily focused on the cognitive route of trust development.
1 code implementation • 21 Jun 2024 • Julia Kharchenko, Tanya Roosta, Aman Chadha, Chirag Shah
We prompt different LLMs with a series of advice requests based on 5 Hofstede Cultural Dimensions -- a quantifiable way of representing the values of a country.
no code implementations • 9 Jun 2024 • Maryam Amirizaniani, Elias Martin, Maryna Sivachenko, Afra Mashhadi, Chirag Shah
Theory of Mind (ToM) reasoning entails recognizing that other individuals possess their own intentions, emotions, and thoughts, which is vital for guiding one's own thought processes.
no code implementations • 21 May 2024 • Chirag Shah, Ryen W. White
While our focus is search and chat, with learnings from insights from a survey of over 100 individuals who have recently performed common tasks on these two modalities, we also present a more general vision for the future of information interaction using multiple modalities and the emergent capabilities of GenAI.
no code implementations • 1 May 2024 • Kirandeep Kaur, Chirag Shah
Our results on three real-world datasets show a significant reduction in weak users and improved robustness of RSs to sub-populations $(\approx12\%)$ and overall performance without disproportionately escalating costs.
no code implementations • 19 Mar 2024 • Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang
Until recently, search engines were the predominant method for people to access online information.
no code implementations • 18 Mar 2024 • Mengting Wan, Tara Safavi, Sujay Kumar Jauhar, Yujin Kim, Scott Counts, Jennifer Neville, Siddharth Suri, Chirag Shah, Ryen W White, Longqi Yang, Reid Andersen, Georg Buscher, Dhruv Joshi, Nagu Rangan
Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application.
no code implementations • 12 Mar 2024 • Preetam Prabhu Srikar Dammu, Himanshu Naidu, Mouly Dewan, Youngmin Kim, Tanya Roosta, Aman Chadha, Chirag Shah
In the midst of widespread misinformation and disinformation through social media and the proliferation of AI-generated texts, it has become increasingly difficult for people to validate and trust information they encounter.
no code implementations • 14 Feb 2024 • Maryam Amirizaniani, Jihan Yao, Adrian Lavergne, Elizabeth Snell Okada, Aman Chadha, Tanya Roosta, Chirag Shah
A case study using questions from the TruthfulQA dataset demonstrates that we can generate a reliable set of probes from one LLM that can be used to audit inconsistencies in a different LLM.
no code implementations • 14 Feb 2024 • Maryam Amirizaniani, Elias Martin, Tanya Roosta, Aman Chadha, Chirag Shah
AuditLLM's primary function is to audit a given LLM by deploying multiple probes derived from a single question, thus detecting any inconsistencies in the model's comprehension or performance.
no code implementations • 1 Jan 2024 • Chirag Shah
We need a more scientific approach to using LLMs in our research.
no code implementations • 25 Nov 2023 • Sahil Verma, Gantavya Bhatt, Avi Schwarzschild, Soumye Singhal, Arnav Mohanty Das, Chirag Shah, John P Dickerson, Pin-Yu Chen, Jeff Bilmes
We find that CleanCLIP, even with extensive hyperparameter tuning, is ineffective in poison removal when stronger pre-training objectives are used.
no code implementations • 3 Nov 2023 • Preetam Prabhu Srikar Dammu, Chirag Shah
This undesirable property is the root cause of the manifestation of spurious correlations, which render models unreliable and prone to failure in the presence of distribution shifts.
no code implementations • 30 Oct 2023 • Preetam Prabhu Srikar Dammu, Yunhe Feng, Chirag Shah
Our new method is able to (1) identify weak decision boundaries for such classes; (2) construct search queries for Google as well as text for generating images through DALL-E 2 and Stable Diffusion; and (3) show how these newly captured training samples could alleviate population bias issue.
no code implementations • 16 Sep 2023 • Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi
The traditional Dialogue State Tracking (DST) problem aims to track user preferences and intents in user-agent conversations.
no code implementations • 14 Sep 2023 • Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Scott Counts, Sarkar Snigdha Sarathi Das, Ali Montazer, Sathish Manivannan, Jennifer Neville, Xiaochuan Ni, Nagu Rangan, Tara Safavi, Siddharth Suri, Mengting Wan, Leijie Wang, Longqi Yang
However, using LLMs to generate a user intent taxonomy and apply it for log analysis can be problematic for two main reasons: (1) such a taxonomy is not externally validated; and (2) there may be an undesirable feedback loop.
no code implementations • 28 Aug 2023 • Muhammad Rahman, Sachi Figliolini, Joyce Kim, Eivy Cedeno, Charles Kleier, Chirag Shah, Aman Chadha
It is difficult to find good resources or schedule an appointment with a career counselor to help with editing a resume for a specific role.
no code implementations • 28 Aug 2023 • Sahil Verma, Ashudeep Singh, Varich Boonsanong, John P. Dickerson, Chirag Shah
To the best of our knowledge, this work is the first to conceptualize and empirically test a generalized framework for generating recourses for recommender systems.
1 code implementation • 8 May 2023 • Souvick Ghosh, Satanu Ghosh, Chirag Shah
Then, the speech acts were fed to the model to predict the corresponding system-level search actions.
no code implementations • 12 Jan 2023 • Chirag Shah, Ryen W. White, Paul Thomas, Bhaskar Mitra, Shawon Sarkar, Nicholas Belkin
For decades, scholars made a case for the role that a user's task plays in how and why that user engages in search and what a search system should do to assist.
no code implementations • 27 Nov 2022 • Sahil Verma, Chirag Shah, John P. Dickerson, Anurag Beniwal, Narayanan Sadagopan, Arjun Seshadri
We evaluate RecXplainer on five real-world and large-scale recommendation datasets using five different kinds of recommender systems to demonstrate the efficacy of RecXplainer in capturing users' preferences over item attributes and using them to explain recommendations.
no code implementations • 17 Aug 2022 • Bingbing Wen, Yunhe Feng, Yongfeng Zhang, Chirag Shah
Current explanation generation models are found to exaggerate certain emotions without accurately capturing the underlying tone or the meaning.
no code implementations • 17 Aug 2022 • Bingbing Wen, Xiaoning Bu, Chirag Shah
To the best of our knowledge, this is the first framework for explainable conversational recommendation on real-world datasets.
no code implementations • 16 Jun 2021 • Ruoyuan Gao, Yingqiang Ge, Chirag Shah
We believe our work opens up a new direction of pursuing a metric for evaluating and implementing the FAIR systems.
no code implementations • 6 May 2021 • Bin Han, Chirag Shah, Daniel Saelid
We found out that users prefer results that are more consistent and relevant to the search queries.
no code implementations • 3 Nov 2020 • Yunhe Feng, Daniel Saelid, Ke Li, Ruoyuan Gao, Chirag Shah
The results showed that our runs performed below par for re-ranking task, but above average for retrieval.
no code implementations • 20 Oct 2020 • Sahil Verma, Varich Boonsanong, Minh Hoang, Keegan E. Hines, John P. Dickerson, Chirag Shah
Machine learning plays a role in many deployed decision systems, often in ways that are difficult or impossible to understand by human stakeholders.
no code implementations • 16 Jul 2020 • Sahil Verma, Ruoyuan Gao, Chirag Shah
Several recent works have highlighted how search and recommender systems exhibit bias along different dimensions.
no code implementations • 3 Jun 2020 • Zuohui Fu, Yikun Xian, Ruoyuan Gao, Jieyu Zhao, Qiaoying Huang, Yingqiang Ge, Shuyuan Xu, Shijie Geng, Chirag Shah, Yongfeng Zhang, Gerard de Melo
There has been growing attention on fairness considerations recently, especially in the context of intelligent decision making systems.
no code implementations • 12 Feb 2019 • Daniel Hienert, Dagmar Kern, Matthew Mitsui, Chirag Shah, Nicholas J. Belkin
In Interactive Information Retrieval (IIR) experiments the user's gaze motion on web pages is often recorded with eye tracking.