Search Results for author: Muhammad Khalifa

Found 17 papers, 9 papers with code

A Distributional Approach to Controlled Text Generation

1 code implementation ICLR 2021 Muhammad Khalifa, Hady Elsahar, Marc Dymetman

From that optimal representation we then train a target controlled Autoregressive LM through an adaptive distributional variant of Policy Gradient.

Text Generation

GRACE: Discriminator-Guided Chain-of-Thought Reasoning

1 code implementation24 May 2023 Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

To address this issue, we propose Guiding chain-of-thought ReAsoning with a CorrectnEss Discriminator (GRACE), a stepwise decoding approach that steers the decoding process towards producing correct reasoning steps.

GSM8K Math

Few-shot Reranking for Multi-hop QA via Language Model Prompting

2 code implementations25 May 2022 Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

To alleviate the need for a large number of labeled question-document pairs for retriever training, we propose PromptRank, which relies on large language models prompting for multi-hop path reranking.

Open-Domain Question Answering Passage Re-Ranking +2

BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

2 code implementations19 May 2023 Xin Liu, Muhammad Khalifa, Lu Wang

Energy-based models (EBMs) have gained popularity for controlled text generation due to their high applicability to a wide range of constraints.

Text Generation

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

1 code implementation30 Oct 2023 Xin Liu, Muhammad Khalifa, Lu Wang

For evaluation, we construct CaT, a benchmark consisting of eight text generation tasks, covering responses ranging from short phrases to paragraphs.

Language Modelling Text Generation

Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling

1 code implementation EACL 2021 Muhammad Khalifa, Muhammad Abdul-Mageed, Khaled Shaalan

We propose to self-train pre-trained language models in zero- and few-shot scenarios to improve performance on data-scarce varieties using only resources from data-rich ones.

Language Modelling NER +2

Exploring Demonstration Ensembling for In-context Learning

1 code implementation17 Aug 2023 Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

The standard approach for ICL is to prompt the LM with concatenated demonstrations followed by the test input.

In-Context Learning

Source-Aware Training Enables Knowledge Attribution in Language Models

1 code implementation1 Apr 2024 Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, Hao Peng

We investigate the problem of intrinsic source citation, where LLMs are required to cite the pretraining source supporting a generated response.

Data Augmentation

Semantic Source Code Search: A Study of the Past and a Glimpse at the Future

no code implementations15 Aug 2019 Muhammad Khalifa

With the recent explosion in the size and complexity of source codebases and software projects, the need for efficient source code search engines has increased dramatically.

Code Search Information Retrieval +2

Extracting Synonyms from Bilingual Dictionaries

no code implementations EACL (GWC) 2021 Mustafa Jarrar, Eman Karajah, Muhammad Khalifa, Khaled Shaalan

We present our progress in developing a novel algorithm to extract synonyms from bilingual dictionaries.

Translation

Zero-Resource Multi-Dialectal Arabic Natural Language Understanding

no code implementations14 Apr 2021 Muhammad Khalifa, Hesham Hassan, Aly Fahmy

In this paper, we investigate the zero-shot performance on Dialectal Arabic (DA) when fine-tuning a PLM on modern standard Arabic (MSA) data only -- identifying a significant performance drop when evaluating such models on DA.

named-entity-recognition Named Entity Recognition +6

A Bag of Tricks for Dialogue Summarization

no code implementations EMNLP 2021 Muhammad Khalifa, Miguel Ballesteros, Kathleen McKeown

Dialogue summarization comes with its own peculiar challenges as opposed to news or scientific articles summarization.

Language Modelling Multi-Task Learning +1

Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

no code implementations11 Oct 2022 Muhammad Khalifa, Yogarshi Vyas, Shuai Wang, Graham Horwood, Sunil Mallya, Miguel Ballesteros

The standard classification setting where categories are fixed during both training and testing falls short in dynamic environments where new document categories could potentially emerge.

Classification Document Classification +1

Novel Chapter Abstractive Summarization using Spinal Tree Aware Sub-Sentential Content Selection

no code implementations9 Nov 2022 Hardy Hardy, Miguel Ballesteros, Faisal Ladhak, Muhammad Khalifa, Vittorio Castelli, Kathleen McKeown

Summarizing novel chapters is a difficult task due to the input length and the fact that sentences that appear in the desired summaries draw content from multiple places throughout the chapter.

Abstractive Text Summarization Extractive Summarization

Cannot find the paper you are looking for? You can Submit a new open access paper.