Search Results for author: Muhammad Khalifa

Found 19 papers, 10 papers with code

Learning to Reason via Program Generation, Emulation, and Search

1 code implementation25 May 2024 Nathaniel Weir, Muhammad Khalifa, Linlu Qiu, Orion Weller, Peter Clark

CoGEX works by (1) training LMs to generate their own pseudo-programs, (2) teaching them to emulate their generated program's execution, including those leaf functions, allowing the LM's knowledge to fill in the execution gaps; and (3) using them to search over many programs to find an optimal one.

Code Generation In-Context Learning +1

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

no code implementations26 Apr 2024 Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang

Self-correction has emerged as a promising solution to boost the reasoning performance of large language models (LLMs), where LLMs refine their solutions using self-generated critiques that pinpoint the errors.


Source-Aware Training Enables Knowledge Attribution in Language Models

1 code implementation1 Apr 2024 Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, Hao Peng

We investigate the problem of intrinsic source citation, where LLMs are required to cite the pretraining source supporting a generated response.

Data Augmentation

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

1 code implementation30 Oct 2023 Xin Liu, Muhammad Khalifa, Lu Wang

For evaluation, we construct CaT, a benchmark consisting of eight text generation tasks, covering responses ranging from short phrases to paragraphs.

Language Modelling Text Generation

Exploring Demonstration Ensembling for In-context Learning

1 code implementation17 Aug 2023 Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

The standard approach for ICL is to prompt the LM with concatenated demonstrations followed by the test input.

In-Context Learning

GRACE: Discriminator-Guided Chain-of-Thought Reasoning

1 code implementation24 May 2023 Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

To address this issue, we propose Guiding chain-of-thought ReAsoning with a CorrectnEss Discriminator (GRACE), a stepwise decoding approach that steers the decoding process towards producing correct reasoning steps.

GSM8K Math

BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases

2 code implementations19 May 2023 Xin Liu, Muhammad Khalifa, Lu Wang

Energy-based models (EBMs) have gained popularity for controlled text generation due to their high applicability to a wide range of constraints.

Text Generation

Novel Chapter Abstractive Summarization using Spinal Tree Aware Sub-Sentential Content Selection

no code implementations9 Nov 2022 Hardy Hardy, Miguel Ballesteros, Faisal Ladhak, Muhammad Khalifa, Vittorio Castelli, Kathleen McKeown

Summarizing novel chapters is a difficult task due to the input length and the fact that sentences that appear in the desired summaries draw content from multiple places throughout the chapter.

Abstractive Text Summarization Extractive Summarization

Contrastive Training Improves Zero-Shot Classification of Semi-structured Documents

no code implementations11 Oct 2022 Muhammad Khalifa, Yogarshi Vyas, Shuai Wang, Graham Horwood, Sunil Mallya, Miguel Ballesteros

The standard classification setting where categories are fixed during both training and testing falls short in dynamic environments where new document categories could potentially emerge.

Classification Document Classification +1

Few-shot Reranking for Multi-hop QA via Language Model Prompting

2 code implementations25 May 2022 Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

To alleviate the need for a large number of labeled question-document pairs for retriever training, we propose PromptRank, which relies on large language models prompting for multi-hop path reranking.

Open-Domain Question Answering Passage Re-Ranking +2

A Bag of Tricks for Dialogue Summarization

no code implementations EMNLP 2021 Muhammad Khalifa, Miguel Ballesteros, Kathleen McKeown

Dialogue summarization comes with its own peculiar challenges as opposed to news or scientific articles summarization.

Language Modelling Multi-Task Learning +1

Zero-Resource Multi-Dialectal Arabic Natural Language Understanding

no code implementations14 Apr 2021 Muhammad Khalifa, Hesham Hassan, Aly Fahmy

In this paper, we investigate the zero-shot performance on Dialectal Arabic (DA) when fine-tuning a PLM on modern standard Arabic (MSA) data only -- identifying a significant performance drop when evaluating such models on DA.

named-entity-recognition Named Entity Recognition +6

Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling

1 code implementation EACL 2021 Muhammad Khalifa, Muhammad Abdul-Mageed, Khaled Shaalan

We propose to self-train pre-trained language models in zero- and few-shot scenarios to improve performance on data-scarce varieties using only resources from data-rich ones.

Language Modelling NER +2

A Distributional Approach to Controlled Text Generation

2 code implementations ICLR 2021 Muhammad Khalifa, Hady Elsahar, Marc Dymetman

From that optimal representation we then train a target controlled Autoregressive LM through an adaptive distributional variant of Policy Gradient.

Text Generation

Extracting Synonyms from Bilingual Dictionaries

no code implementations EACL (GWC) 2021 Mustafa Jarrar, Eman Karajah, Muhammad Khalifa, Khaled Shaalan

We present our progress in developing a novel algorithm to extract synonyms from bilingual dictionaries.


Semantic Source Code Search: A Study of the Past and a Glimpse at the Future

no code implementations15 Aug 2019 Muhammad Khalifa

With the recent explosion in the size and complexity of source codebases and software projects, the need for efficient source code search engines has increased dramatically.

Code Search Information Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.