Search Results for author: Hung Le

Found 61 papers, 25 papers with code

LAVIS: A Library for Language-Vision Intelligence

1 code implementation • 15 Sep 2022 • Dongxu Li, Junnan Li, Hung Le, Guangsen Wang, Silvio Savarese, Steven C. H. Hoi

We introduce LAVIS, an open-source deep learning library for LAnguage-VISion research and applications.

8,648

Paper
Code

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

1 code implementation • 3 Jan 2024 • David Junhao Zhang, Dongxu Li, Hung Le, Mike Zheng Shou, Caiming Xiong, Doyen Sahoo

This work presents Moonshot, a new video generation model that conditions simultaneously on multimodal inputs of image and text.

Image Animation Video Editing +1

8,648

Paper
Code

CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning

2 code implementations • 5 Jul 2022 • Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Silvio Savarese, Steven C. H. Hoi

To address the limitations, we propose "CodeRL", a new framework for program synthesis tasks through pretrained LMs and deep reinforcement learning (RL).

Ranked #1 on Code Generation on APPS

Code Generation Program Synthesis +2

2,580

Paper
Code

CodeT5+: Open Code Large Language Models for Code Understanding and Generation

1 code implementation • 13 May 2023 • Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi

To address these limitations, we propose ``CodeT5+'', a family of encoder-decoder LLMs for code in which component modules can be flexibly combined to suit a wide range of downstream code tasks.

Ranked #1 on Code Search on CodeXGLUE - AdvTest

Arithmetic Reasoning Code Completion +4

2,580

Paper
Code

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

1 code implementation • 31 May 2023 • Nghi D. Q. Bui, Hung Le, Yue Wang, Junnan Li, Akhilesh Deepak Gotmare, Steven C. H. Hoi

In this paper, we present CodeTF, an open-source Transformer-based library for state-of-the-art Code LLMs and code intelligence.

1,413

Paper
Code

OmniXAI: A Library for Explainable AI

2 code implementations • 1 Jun 2022 • Wenzhuo Yang, Hung Le, Tanmay Laud, Silvio Savarese, Steven C. H. Hoi

We introduce OmniXAI (short for Omni eXplainable AI), an open-source Python library of eXplainable AI (XAI), which offers omni-way explainable AI capabilities and various interpretable machine learning techniques to address the pain points of understanding and interpreting the decisions made by machine learning (ML) in practice.

counterfactual Counterfactual Explanation +5

803

Paper
Code

URLNet: Learning a URL Representation with Deep Learning for Malicious URL Detection

3 code implementations • 9 Feb 2018 • Hung Le, Quang Pham, Doyen Sahoo, Steven C. H. Hoi

This approach allows the model to capture several types of semantic information, which was not possible by the existing models.

BIG-bench Machine Learning Feature Engineering +1

146

Paper
Code

Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems

1 code implementation • ACL 2019 • Hung Le, Doyen Sahoo, Nancy F. Chen, Steven C. H. Hoi

Developing Video-Grounded Dialogue Systems (VGDS), where a dialogue is conducted based on visual and audio aspects of a given video, is significantly more challenging than traditional image or text-grounded dialogue systems because (1) feature space of videos span across multiple picture frames, making it difficult to obtain semantic information; and (2) a dialogue agent must perceive and process information from different modalities (audio, video, caption, etc.)

Ranked #4 on Response Generation on SIMMC2.0

Dialogue State Tracking Response Generation

Paper
Code

Self-Attentive Associative Memory

1 code implementation • ICML 2020 • Hung Le, Truyen Tran, Svetha Venkatesh

Heretofore, neural networks with external memory are restricted to single memory with lossy representations of memory interactions.

Ranked #1 on Question Answering on bAbi

Memorization Question Answering +1

Paper
Code

Non-Autoregressive Dialog State Tracking

1 code implementation • ICLR 2020 • Hung Le, Richard Socher, Steven C. H. Hoi

Recent efforts in Dialogue State Tracking (DST) for task-oriented dialogues have progressed toward open-vocabulary or generation-based approaches where the models can generate slot value candidates from the dialogue history itself.

Ranked #13 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.0

dialog state tracking Dialogue State Tracking +2

Paper
Code

Variational Memory Encoder-Decoder

1 code implementation • NeurIPS 2018 • Hung Le, Truyen Tran, Thin Nguyen, Svetha Venkatesh

Introducing variability while maintaining coherence is a core task in learning to generate utterances in conversation.

Paper
Code

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

1 code implementation • 13 Oct 2023 • Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq Joty

We find that by naturally encouraging the LLM to reuse the previously developed and verified sub-modules, CodeChain can significantly boost both modularity as well as correctness of the generated solutions, achieving relative pass@1 improvements of 35% on APPS and 76% on CodeContests.

Ranked #2 on Code Generation on CodeContests (Test Set pass@1 metric)

Code Generation

Paper
Code

Learning to Remember More with Less Memorization

1 code implementation • ICLR 2019 • Hung Le, Truyen Tran, Svetha Venkatesh

Memory-augmented neural networks consisting of a neural controller and an external memory have shown potentials in long-term sequential learning.

Ranked #5 on Text Classification on Yahoo! Answers

Memorization Sentiment Analysis +2

Paper
Code

Memory and attention in deep learning

1 code implementation • 3 Jul 2021 • Hung Le

Artificial neural networks model neurons and synapses in the brain by interconnecting computational units via weights, which is a typical class of machine learning algorithms that resembles memory structure.

Paper
Code

Neural Stored-program Memory

1 code implementation • ICLR 2020 • Hung Le, Truyen Tran, Svetha Venkatesh

Neural networks powered with external memory simulate computer behaviors.

Ranked #5 on Question Answering on bAbi (Mean Error Rate metric)

continual few-shot learning Few-Shot Learning +1

Paper
Code

Dual Memory Neural Computer for Asynchronous Two-view Sequential Learning

1 code implementation • 2 Feb 2018 • Hung Le, Truyen Tran, Svetha Venkatesh

One of the core tasks in multi-view learning is to capture relations among views.

MULTI-VIEW LEARNING Vocal Bursts Valence Prediction

Paper
Code

DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue

1 code implementation • ACL 2021 • Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard, Satwik Kottur

A video-grounded dialogue system is required to understand both dialogue, which contains semantic dependencies from turn to turn, and video, which contains visual cues of spatial and temporal scene variations.

Object Tracking Visual Reasoning

Paper
Code

BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues

1 code implementation • EMNLP 2020 • Hung Le, Doyen Sahoo, Nancy F. Chen, Steven C. H. Hoi

Video-grounded dialogues are very challenging due to (i) the complexity of videos which contain both spatial and temporal variations, and (ii) the complexity of user utterances which query different segments and/or different objects in videos over multiple dialogue turns.

Paper
Code

Multimodal Dialogue State Tracking

1 code implementation • NAACL 2022 • Hung Le, Nancy F. Chen, Steven C. H. Hoi

Specifically, we introduce a novel dialogue state tracking task to track the information of visual objects that are mentioned in video-grounded dialogues.

Dialogue State Tracking Video Understanding

Paper
Code

UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues

1 code implementation • EMNLP 2020 • Hung Le, Doyen Sahoo, Chenghao Liu, Nancy F. Chen, Steven C. H. Hoi

Building an end-to-end conversational agent for multi-domain task-oriented dialogues has been an open challenge for two main reasons.

Dialogue State Tracking

Paper
Code

Episodic Policy Gradient Training

1 code implementation • 3 Dec 2021 • Hung Le, Majid Abdolshah, Thommen K. George, Kien Do, Dung Nguyen, Svetha Venkatesh

We introduce a novel training procedure for policy gradient methods wherein episodic memory is used to optimize the hyperparameters of reinforcement learning algorithms on-the-fly.

Policy Gradient Methods Scheduling

Paper
Code

DeepProcess: Supporting business process execution using a MANN-based recommender system

1 code implementation • 3 Feb 2018 • Asjad Khan, Hung Le, Kien Do, Truyen Tran, Aditya Ghose, Hoa Dam, Renuka Sindhgatta

Process-aware Recommender systems can provide critical decision support functionality to aid business process execution by recommending what actions to take next.

Activity Prediction Recommendation Systems

Paper
Code

Beyond Surprise: Improving Exploration Through Surprise Novelty

1 code implementation • 9 Aug 2023 • Hung Le, Kien Do, Dung Nguyen, Svetha Venkatesh

We present a new computing model for intrinsic rewards in reinforcement learning that addresses the limitations of existing surprise-driven explorations.

Atari Games Retrieval

Paper
Code

LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying

1 code implementation • 21 Aug 2023 • Thommen George Karimpanal, Laknath Buddhika Semage, Santu Rana, Hung Le, Truyen Tran, Sunil Gupta, Svetha Venkatesh

To address this issue, we introduce SEQ (sample efficient querying), where we simultaneously train a secondary RL agent to decide when the LLM should be queried for solutions.

Decision Making reinforcement-learning +1

Paper
Code

SurvTimeSurvival: Survival Analysis On The Patient With Multiple Visits/Records

1 code implementation • 16 Nov 2023 • Hung Le, Ong Eng-Jon, Bober Miroslaw

This study introduces "SurvTimeSurvival: Survival Analysis On Patients With Multiple Visits/Records", utilizing the Transformer model to not only handle the complexities of time-varying covariates but also covariates data.

Survival Analysis Synthetic Data Generation

Paper
Code

What are the Receptive, Effective Receptive, and Projective Fields of Neurons in Convolutional Neural Networks?

no code implementations • 19 May 2017 • Hung Le, Ali Borji

In this work, we explain in detail how receptive fields, effective receptive fields, and projective fields of neurons in different layers, convolution or pooling, of a Convolutional Neural Network (CNN) are calculated.

Paper
Add Code

Dual Control Memory Augmented Neural Networks for Treatment Recommendations

no code implementations • 11 Feb 2018 • Hung Le, Truyen Tran, Svetha Venkatesh

The decoding controller generates a treatment sequence, one treatment option at a time.

Paper
Add Code

Meta-Learning with Domain Adaptation for Few-Shot Learning under Domain Shift

no code implementations • ICLR 2019 • Doyen Sahoo, Hung Le, Chenghao Liu, Steven C. H. Hoi

Most existing work assumes that both training and test tasks are drawn from the same distribution, and a large amount of labeled data is available in the training tasks.

Domain Adaptation Few-Shot Learning

Paper
Add Code

FoodAI: Food Image Recognition via Deep Learning for Smart Food Logging

no code implementations • 26 Sep 2019 • Doyen Sahoo, Wang Hao, Shu Ke, Wu Xiongwei, Hung Le, Palakorn Achananuparp, Ee-Peng Lim, Steven C. H. Hoi

FoodAI has made food logging convenient, aiding smart consumption and a healthy lifestyle.

Management

Paper
Add Code

Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory

no code implementations • 5 Nov 2019 • Duc Nguyen, Nhan Tran, Hung Le

Convolutional Recurrent Neural Networks (CRNNs) excel at scene text recognition.

Optical Character Recognition Optical Character Recognition (OCR) +1

Paper
Add Code

Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge

no code implementations • 25 Feb 2020 • Hung Le, Nancy F. Chen

Audio-Visual Scene-Aware Dialog (AVSD) is an extension from Video Question Answering (QA) whereby the dialogue agent is required to generate natural language responses to address user queries and carry on conversations.

Question Answering Video Question Answering

Paper
Add Code

Video-Grounded Dialogues with Pretrained Generation Language Models

no code implementations • ACL 2020 • Hung Le, Steven C. H. Hoi

Pre-trained language models have shown remarkable success in improving various downstream NLP tasks due to their ability to capture dependencies in textual data and generate natural responses.

Sentence

Paper
Add Code

Neurocoder: Learning General-Purpose Computation Using Stored Neural Programs

no code implementations • NeurIPS 2021 • Hung Le, Svetha Venkatesh

For the first time a Neural Program is treated as a datum in memory, paving the ways for modular, recursive and procedural neural programming.

Continual Learning Object Recognition

Paper
Add Code

VilNMN: A Neural Module Network approach to Video-Grounded Language Tasks

no code implementations • 1 Jan 2021 • Hung Le, Nancy F. Chen, Steven Hoi

Neural module networks (NMN) have achieved success in image-grounded tasks such as question answering (QA) on synthetic images.

Information Retrieval Question Answering +1

Paper
Add Code

Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues

no code implementations • ICLR 2021 • Hung Le, Nancy F. Chen, Steven C. H. Hoi

PDC model then learns to predict reasoning paths over this semantic graph.

Question Answering Visual Question Answering

Paper
Add Code

VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks

no code implementations • 16 Apr 2021 • Hung Le, Nancy F. Chen, Steven C. H. Hoi

Neural module networks (NMN) have achieved success in image-grounded tasks such as Visual Question Answering (VQA) on synthetic images.

Information Retrieval Question Answering +2

Paper
Add Code

$C^3$: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues

no code implementations • 16 Jun 2021 • Hung Le, Nancy F. Chen, Steven C. H. Hoi

Video-grounded dialogue systems aim to integrate video understanding and dialogue understanding to generate responses that are relevant to both the dialogue and video context.

Contrastive Learning counterfactual +3

Paper
Add Code

A New Representation of Successor Features for Transfer across Dissimilar Environments

no code implementations • 18 Jul 2021 • Majid Abdolshah, Hung Le, Thommen Karimpanal George, Sunil Gupta, Santu Rana, Svetha Venkatesh

Transfer in reinforcement learning is usually achieved through generalisation across tasks.

Gaussian Processes Reinforcement Learning (RL)

Paper
Add Code

Plug and Play, Model-Based Reinforcement Learning

no code implementations • 20 Aug 2021 • Majid Abdolshah, Hung Le, Thommen Karimpanal George, Sunil Gupta, Santu Rana, Svetha Venkatesh

This is achieved by representing the global transition dynamics as a union of local transition functions, each with respect to one active object in the scene.

Model-based Reinforcement Learning Object +3

Paper
Add Code

Reachability Traces for Curriculum Design in Reinforcement Learning

no code implementations • 29 Sep 2021 • Thommen Karimpanal George, Majid Abdolshah, Hung Le, Santu Rana, Sunil Gupta, Truyen Tran, Svetha Venkatesh

The objective in goal-based reinforcement learning is to learn a policy to reach a particular goal state within the environment.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Generative Pseudo-Inverse Memory

no code implementations • ICLR 2022 • Kha Pham, Hung Le, Man Ngo, Truyen Tran, Bao Ho, Svetha Venkatesh

We propose Generative Pseudo-Inverse Memory (GPM), a class of deep generative memory models that are fast to write in and read out.

Denoising

Paper
Add Code

Neural Latent Traversal with Semantic Constraints

no code implementations • 29 Sep 2021 • Majid Abdolshah, Hung Le, Thommen Karimpanal George, Vuong Le, Sunil Gupta, Santu Rana, Svetha Venkatesh

Whilst Generative Adversarial Networks (GANs) generate visually appealing high resolution images, the latent representations (or codes) of these models do not allow controllable changes on the semantic attributes of the generated images.

Paper
Add Code

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

no code implementations • NeurIPS 2021 • Hung Le, Thommen Karimpanal George, Majid Abdolshah, Truyen Tran, Svetha Venkatesh

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Balanced Q-learning: Combining the Influence of Optimistic and Pessimistic Targets

no code implementations • 3 Nov 2021 • Thommen George Karimpanal, Hung Le, Majid Abdolshah, Santu Rana, Sunil Gupta, Truyen Tran, Svetha Venkatesh

The optimistic nature of the Q-learning target leads to an overestimation bias, which is an inherent problem associated with standard $Q-$learning.

Q-Learning

Paper
Add Code

Robust Deep Reinforcement Learning for Extractive Legal Summarization

no code implementations • 13 Nov 2021 • Duy-Hung Nguyen, Bao-Sinh Nguyen, Nguyen Viet Dung Nghiem, Dung Tien Le, Mim Amina Khatun, Minh-Tien Nguyen, Hung Le

Automatic summarization of legal texts is an important and still a challenging task since legal documents are often long and complicated with unusual structures and styles.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Towards Effective and Robust Neural Trojan Defenses via Input Filtering

no code implementations • 24 Feb 2022 • Kien Do, Haripriya Harikumar, Hung Le, Dung Nguyen, Truyen Tran, Santu Rana, Dang Nguyen, Willy Susilo, Svetha Venkatesh

Trojan attacks on deep neural networks are both dangerous and surreptitious.

Data Compression Variational Inference

Paper
Add Code

Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback

no code implementations • Findings (NAACL) 2022 • Duy-Hung Nguyen, Nguyen Viet Dung Nghiem, Bao-Sinh Nguyen, Dung Tien Le, Shahab Sabahi, Minh-Tien Nguyen, Hung Le

For summarization, human preference is critical to tame outputs of the summarizer in favor of human interests, as ground-truth summaries are scarce and ambiguous.

Text Summarization

Paper
Add Code

Learning Theory of Mind via Dynamic Traits Attribution

no code implementations • 17 Apr 2022 • Dung Nguyen, Phuoc Nguyen, Hung Le, Kien Do, Svetha Venkatesh, Truyen Tran

Inspired by the observation that humans often infer the character traits of others, then use it to explain behaviour, we propose a new neural ToM architecture that learns to generate a latent trait vector of an actor from the past trajectories.

Future prediction Inductive Bias +1

Paper
Add Code

Learning to Constrain Policy Optimization with Virtual Trust Region

no code implementations • 20 Apr 2022 • Hung Le, Thommen Karimpanal George, Majid Abdolshah, Dung Nguyen, Kien Do, Sunil Gupta, Svetha Venkatesh

We introduce a constrained optimization method for policy gradient reinforcement learning, which uses a virtual trust region to regulate each policy update.

Atari Games Policy Gradient Methods

Paper
Add Code

HYCEDIS: HYbrid Confidence Engine for Deep Document Intelligence System

no code implementations • 1 Jun 2022 • Bao-Sinh Nguyen, Quang-Bach Tran, Tuan-Anh Nguyen Dang, Duc Nguyen, Hung Le

Measuring the confidence of AI models is critical for safely deploying AI in real-world industrial systems.

Paper
Add Code

VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems

no code implementations • NAACL 2022 • Hung Le, Nancy Chen, Steven Hoi

Neural module networks (NMN) have achieved success in image-grounded tasks such as Visual Question Answering (VQA) on synthetic images.

Information Retrieval Question Answering +2

Paper
Add Code

Momentum Adversarial Distillation: Handling Large Distribution Shifts in Data-Free Knowledge Distillation

no code implementations • 21 Sep 2022 • Kien Do, Hung Le, Dung Nguyen, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh

Since the EMA generator can be considered as an ensemble of the generator's old versions and often undergoes a smaller change in updates compared to the generator, training on its synthetic samples can help the student recall the past knowledge and prevent the student from adapting too quickly to new updates of the generator.

Data-free Knowledge Distillation

Paper
Add Code

Improving Document Image Understanding with Reinforcement Finetuning

no code implementations • 26 Sep 2022 • Bao-Sinh Nguyen, Dung Tien Le, Hieu M. Vu, Tuan Anh D. Nguyen, Minh-Tien Nguyen, Hung Le

In this paper, we investigate the problem of improving the performance of Artificial Intelligence systems in understanding document images, especially in cases where training data is limited.

Reinforcement Learning (RL)

Paper
Add Code

Functional Indirection Neural Estimator for Better Out-of-distribution Generalization

no code implementations • 23 Oct 2022 • Kha Pham, Hung Le, Man Ngo, Truyen Tran

FINE consists of a backbone network and a trainable semantic memory of basis weight matrices.

Out-of-Distribution Generalization

Paper
Add Code

Memory-Augmented Theory of Mind Network

no code implementations • 17 Jan 2023 • Dung Nguyen, Phuoc Nguyen, Hung Le, Kien Do, Svetha Venkatesh, Truyen Tran

Social reasoning necessitates the capacity of theory of mind (ToM), the ability to contextualise and attribute mental states to others without having access to their internal cognitive structure.

Attribute

Paper
Add Code

BO-Muse: A human expert and AI teaming framework for accelerated experimental design

no code implementations • 3 Mar 2023 • Sunil Gupta, Alistair Shilton, Arun Kumar A V, Shannon Ryan, Majid Abdolshah, Hung Le, Santu Rana, Julian Berk, Mahad Rashid, Svetha Venkatesh

In this paper we introduce BO-Muse, a new approach to human-AI teaming for the optimization of expensive black-box functions.

Experimental Design

Paper
Add Code

When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust

no code implementations • 12 May 2023 • Minh-Tien Nguyen, Duy-Hung Nguyen, Shahab Sabahi, Hung Le, Jeff Yang, Hajime Hotta

Based on the task we design a new model relied on LLMs which are empowered by additional knowledge extracted from insurance policy rulebooks and DBpedia.

Domain Adaptation Question Answering

Paper
Add Code

Universal Graph Continual Learning

no code implementations • 27 Aug 2023 • Thanh Duc Hoang, Do Viet Tung, Duy-Hung Nguyen, Bao-Sinh Nguyen, Huy Hoang Nguyen, Hung Le

We address catastrophic forgetting issues in graph learning as incoming data transits from one to another graph distribution.

Continual Learning Graph Classification +2

Paper
Add Code

Variational Flow Models: Flowing in Your Style

no code implementations • 5 Feb 2024 • Kien Do, Duc Kieu, Toan Nguyen, Dang Nguyen, Hung Le, Dung Nguyen, Thin Nguyen

We introduce "posterior flows" - generalizations of "probability flows" to a broader class of stochastic processes not necessarily diffusion processes - and propose a systematic training-free method to transform the posterior flow of a "linear" stochastic process characterized by the equation Xt = at * X0 + st * X1 into a straight constant-speed (SC) flow, reminiscent of Rectified Flow.

Variational Inference

Paper
Add Code

Revisiting the Dataset Bias Problem from a Statistical Perspective

no code implementations • 5 Feb 2024 • Kien Do, Dung Nguyen, Hung Le, Thao Le, Dang Nguyen, Haripriya Harikumar, Truyen Tran, Santu Rana, Svetha Venkatesh

To overcome this challenge, we propose to approximate \frac{1}{p(u|b)} using a biased classifier trained with "bias amplification" losses.

Attribute

Paper
Add Code

Automatic Prompt Selection for Large Language Models

no code implementations • 3 Apr 2024 • Viet-Tung Do, Van-Khanh Hoang, Duy-Hung Nguyen, Shahab Sabahi, Jeff Yang, Hajime Hotta, Minh-Tien Nguyen, Hung Le

Our approach consists of three steps: (1) clustering the training data and generating candidate prompts for each cluster using an LLM-based prompt generator; (2) synthesizing a dataset of input-prompt-output tuples for training a prompt evaluator to rank the prompts based on their relevance to the input; (3) using the prompt evaluator to select the best prompt for a new input at test time.

GSM8K Question Answering +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.