no code implementations • GWC 2016 • Diptesh Kanojia, Shehzaad Dhuliawala, Pushpak Bhattacharyya
Our contribution is three fold: (1) We develop a system, which, given a synset in English, finds an appropriate image for the synset.
no code implementations • 30 Jan 2025 • Jack Lanchantin, Angelica Chen, Shehzaad Dhuliawala, Ping Yu, Jason Weston, Sainbayar Sukhbaatar, Ilia Kulikov
In this work we introduce Diverse Preference Optimization (DivPO), an optimization method which learns to generate much more diverse responses than standard pipelines, while maintaining the quality of the generations.
no code implementations • 14 Nov 2024 • Shehzaad Dhuliawala, Ilia Kulikov, Ping Yu, Asli Celikyilmaz, Jason Weston, Sainbayar Sukhbaatar, Jack Lanchantin
During language model decoding, it is known that using higher temperature sampling gives more creative responses, while lower temperatures are more factually accurate.
1 code implementation • 24 Jul 2024 • Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan
We present ALT (ALignment with Textual feedback), an approach that aligns language models with user preferences expressed in text.
1 code implementation • 23 May 2024 • Zhijing Jin, Nils Heil, Jiarui Liu, Shehzaad Dhuliawala, Yahang Qi, Bernhard Schölkopf, Rada Mihalcea, Mrinmaya Sachan
This work systematically studies IP through a rigorous mathematical formulation, a multi-perspective moral reasoning framework, and a set of case studies.
1 code implementation • 20 Oct 2023 • Shehzaad Dhuliawala, Vilém Zouhar, Mennatallah El-Assady, Mrinmaya Sachan
In a human-AI collaboration, users build a mental model of the AI system based on its reliability and how it presents its decision, e. g. its presentation of system confidence and an explanation of the output.
1 code implementation • 20 Sep 2023 • Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason Weston
Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language models.
1 code implementation • 17 May 2023 • Shehzaad Dhuliawala, Mrinmaya Sachan, Carl Allen
We present a latent variable model for classification that provides a novel probabilistic interpretation of neural network softmax classifiers.
1 code implementation • 23 Feb 2023 • Mian Zhong, Shehzaad Dhuliawala, Niklas Stoehr
We cast victim count extraction as a question answering (QA) task with a regression or classification objective.
1 code implementation • 21 Jan 2023 • Vilém Zouhar, Shehzaad Dhuliawala, Wangchunshu Zhou, Nico Daheim, Tom Kocmi, Yuchen Eleanor Jiang, Mrinmaya Sachan
Machine translation quality estimation (QE) predicts human judgements of a translation hypothesis without seeing the reference.
no code implementations • Findings (ACL) 2022 • Shehzaad Dhuliawala, Leonard Adolphs, Rajarshi Das, Mrinmaya Sachan
We show that calibrating such complex systems which contain discrete retrieval and deep reading components is challenging and current calibration techniques fail to scale to these settings.
no code implementations • ICLR 2022 • Mattia Atzeni, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan
Text-based games (TBG) have emerged as promising environments for driving research in grounded language understanding and studying problems like generalization and sample efficiency.
Deep Reinforcement Learning
Out-of-Distribution Generalization
+3
1 code implementation • 2 Oct 2021 • Vaibhav Adlakha, Shehzaad Dhuliawala, Kaheer Suleman, Harm de Vries, Siva Reddy
On average, a conversation in our dataset spans 13 question-answer turns and involves four topics (documents).
1 code implementation • 4 Aug 2021 • Leonard Adolphs, Shehzaad Dhuliawala, Thomas Hofmann
We apply this approach of querying by example to the LAMA probe and obtain substantial improvements of up to 37. 8% for BERT-large on the T-REx data when providing only 10 demonstrations--even outperforming a baseline that queries the model with up to 40 paraphrases of the question.
1 code implementation • AKBC 2020 • Rajarshi Das, Ameya Godbole, Shehzaad Dhuliawala, Manzil Zaheer, Andrew McCallum
We present a surprisingly simple yet accurate approach to reasoning in knowledge graphs (KGs) that requires \emph{no training}, and is reminiscent of case-based reasoning in classical artificial intelligence (AI).
no code implementations • 6 Nov 2019 • Timothy J. Hazen, Shehzaad Dhuliawala, Daniel Boies
This paper explores domain adaptation for enabling question answering (QA) systems to answer questions posed against documents in new specialized domains.
no code implementations • WS 2019 • Rajarshi Das, Ameya Godbole, Manzil Zaheer, Shehzaad Dhuliawala, Andrew McCallum
This paper describes our submission to the shared task on {``}Multi-hop Inference Explanation Regeneration{''} in TextGraphs workshop at EMNLP 2019 (Jansen and Ustalov, 2019).
1 code implementation • ICLR 2019 • Rajarshi Das, Shehzaad Dhuliawala, Manzil Zaheer, Andrew McCallum
This paper introduces a new framework for open-domain question answering in which the retriever and the reader iteratively interact with each other.
8 code implementations • ICLR 2018 • Rajarshi Das, Shehzaad Dhuliawala, Manzil Zaheer, Luke Vilnis, Ishan Durugkar, Akshay Krishnamurthy, Alex Smola, Andrew McCallum
Knowledge bases (KB), both automatically and manually constructed, are often incomplete --- many valid facts can be inferred from the KB by synthesizing existing information.
no code implementations • LREC 2016 • Shehzaad Dhuliawala, Diptesh Kanojia, Pushpak Bhattacharyya
We present a WordNet like structured resource for slang words and neologisms on the internet.