Search Results

Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models

1 code implementation18 Oct 2022

In this work, we take the first step to exploit the pre-trained (unfine-tuned) weights to mitigate backdoors in fine-tuned language models.

 Ranked #1 on Sentiment Analysis on SST-2 Binary classification (Attack Success Rate metric)

Language Modelling Sentence +4

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

8 code implementations20 Mar 2024

Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.

Language Modeling Language Modelling +1

KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

2 code implementations17 Feb 2024

To address these problems, we propose a knowledge-aware fine-tuning (KnowTuning) method to improve fine-grained and coarse-grained knowledge awareness of LLMs.

Question Answering

E2ETune: End-to-End Knob Tuning via Fine-tuned Generative Language Model

1 code implementation17 Apr 2024

Database knob tuning is a significant challenge for database administrators, as it involves tuning a large number of configuration knobs with continuous or discrete values to achieve optimal database performance.

Language Modeling Language Modelling +1

RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting

1 code implementation25 May 2023

In this work, we develop new strategies for instruction tuning and reinforcement learning to better align LLMs for cross-sentence rewriting tasks using diverse wording and structures expressed through natural languages including 1) generating rewriting instruction data from Wiki edits and public corpus through instruction generation and chain-of-thought prompting; 2) collecting comparison data for reward model training through a new ranking function.

Language Modeling Language Modelling +4

Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

1 code implementation NeurIPS 2023

We focus on diffusion models, defining the fine-tuning task as an RL problem, and updating the pre-trained text-to-image diffusion models using policy gradient to maximize the feedback-trained reward.

Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain

1 code implementation25 Nov 2024

To address these challenges, we introduce \textbf{Med-PerSAM}, a novel and straightforward one-shot framework designed for the medical domain.

In-Context Learning Prompt Engineering +1

Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic

3 code implementations19 Feb 2024

We demonstrate the effectiveness of RESTA in both parameter-efficient and full fine-tuning, covering a wide range of downstream tasks, including instruction following in Chinese, English, and Hindi, as well as problem-solving capabilities in Code and Math.

Instruction Following Math +1

DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

2 code implementations25 May 2023

We focus on diffusion models, defining the fine-tuning task as an RL problem, and updating the pre-trained text-to-image diffusion models using policy gradient to maximize the feedback-trained reward.

reinforcement-learning Reinforcement Learning (RL)

Llama 2: Open Foundation and Fine-Tuned Chat Models

19 code implementations18 Jul 2023

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters.

Arithmetic Reasoning +5