Search Results for author: Xianjun Yang

Found 25 papers, 18 papers with code

Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning

no code implementations • 16 Apr 2024 • Xiao Wang, Tianze Chen, Xianjun Yang, Qi Zhang, Xun Zhao, Dahua Lin

The open-sourcing of large language models (LLMs) accelerates application development, innovation, and scientific progress.

In-Context Learning Instruction Following

Paper
Add Code

A Safe Harbor for AI Evaluation and Red Teaming

no code implementations • 7 Mar 2024 • Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng-Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson

Independent evaluation and red teaming are critical for identifying the risks posed by generative AI systems.

Paper
Add Code

A Self-supervised Pressure Map human keypoint Detection Approch: Optimizing Generalization and Computational Efficiency Across Datasets

no code implementations • 22 Feb 2024 • Chengzhang Yu, Xianjun Yang, Wenxia Bao, Shaonan Wang, Zhiming Yao

In environments where RGB images are inadequate, pressure maps is a viable alternative, garnering scholarly attention.

Computational Efficiency Keypoint Detection

Paper
Add Code

Test-Time Backdoor Attacks on Multimodal Large Language Models

1 code implementation • 13 Feb 2024 • Dong Lu, Tianyu Pang, Chao Du, Qian Liu, Xianjun Yang, Min Lin

Backdoor attacks are commonly executed by contaminating training data, such that a trigger can activate predetermined harmful effects during the test phase.

Backdoor Attack

Paper
Code

TrustAgent: Towards Safe and Trustworthy LLM-based Agents through Agent Constitution

1 code implementation • 2 Feb 2024 • Wenyue Hua, Xianjun Yang, Zelong Li, Wei Cheng, Yongfeng Zhang

This paper presents an Agent-Constitution-based agent framework, TrustAgent, an initial investigation into improving the safety dimension of trustworthiness in LLM-based agents.

Paper
Code

Navigating the OverKill in Large Language Models

no code implementations • 31 Jan 2024 • Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin

Large language models are meticulously aligned to be both helpful and harmless.

Paper
Add Code

Weak-to-Strong Jailbreaking on Large Language Models

1 code implementation • 30 Jan 2024 • Xuandong Zhao, Xianjun Yang, Tianyu Pang, Chao Du, Lei LI, Yu-Xiang Wang, William Yang Wang

In this paper, we propose the weak-to-strong jailbreaking attack, an efficient method to attack aligned LLMs to produce harmful text.

Paper
Code

PLLaMa: An Open-source Large Language Model for Plant Science

1 code implementation • 3 Jan 2024 • Xianjun Yang, Junfeng Gao, Wenxin Xue, Erik Alexandersson

Large Language Models (LLMs) have exhibited remarkable capabilities in understanding and interacting with natural language across various sectors.

Language Modelling Large Language Model

Paper
Code

Quokka: An Open-source Large Language Model ChatBot for Material Science

1 code implementation • 2 Jan 2024 • Xianjun Yang, Stephen D. Wilson, Linda Petzold

This paper presents the development of a specialized chatbot for materials science, leveraging the Llama-2 language model, and continuing pre-training on the expansive research articles in the materials science domain from the S2ORC dataset.

Chatbot Language Modelling +1

Paper
Code

A Survey on Detection of LLMs-Generated Content

1 code implementation • 24 Oct 2023 • Xianjun Yang, Liangming Pan, Xuandong Zhao, Haifeng Chen, Linda Petzold, William Yang Wang, Wei Cheng

The burgeoning capabilities of advanced large language models (LLMs) such as ChatGPT have led to an increase in synthetic content generation with implications across a variety of sectors, including media, cybersecurity, public discourse, and education.

121

Paper
Code

AlpaCare:Instruction-tuned Large Language Models for Medical Application

1 code implementation • 23 Oct 2023 • Xinlu Zhang, Chenxin Tian, Xianjun Yang, Lichang Chen, Zekun Li, Linda Ruth Petzold

Instruction-finetuning (IFT) has become crucial in aligning Large Language Models (LLMs) with diverse human needs and has shown great potential in medical applications.

Instruction Following

Paper
Code

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

1 code implementation • 10 Oct 2023 • Xiao Wang, Yuansen Zhang, Tianze Chen, Songyang Gao, Senjie Jin, Xianjun Yang, Zhiheng Xi, Rui Zheng, Yicheng Zou, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we introduce TRACE, a novel benchmark designed to evaluate continual learning in LLMs.

Code Generation Continual Learning +3

Paper
Code

Zero-Shot Detection of Machine-Generated Codes

1 code implementation • 8 Oct 2023 • Xianjun Yang, Kexun Zhang, Haifeng Chen, Linda Petzold, William Yang Wang, Wei Cheng

We then modify the previous zero-shot text detection method, DetectGPT (Mitchell et al., 2023) by utilizing a surrogate white-box model to estimate the probability of the rightmost tokens, allowing us to identify code snippets generated by language models.

Language Modelling Text Detection

Paper
Code

Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models

no code implementations • 4 Oct 2023 • Xianjun Yang, Xiao Wang, Qi Zhang, Linda Petzold, William Yang Wang, Xun Zhao, Dahua Lin

This study serves as a clarion call for a collective effort to overhaul and fortify the safety of open-source LLMs against malicious attackers.

Paper
Add Code

Large Language Models Can Be Good Privacy Protection Learners

no code implementations • 3 Oct 2023 • Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Haifeng Chen, Wei Wang, Wei Cheng

To address this challenge, we introduce Privacy Protection Language Models (PPLM), a novel paradigm for fine-tuning LLMs that effectively injects domain-specific knowledge while safeguarding data privacy.

Paper
Add Code

DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text

1 code implementation • 27 May 2023 • Xianjun Yang, Wei Cheng, Yue Wu, Linda Petzold, William Yang Wang, Haifeng Chen

However, this progress also presents a significant challenge in detecting the origin of a given text, and current research on detection methods lags behind the rapid evolution of LLMs.

Paper
Code

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

1 code implementation • 22 May 2023 • Xinlu Zhang, Shiyang Li, Xianjun Yang, Chenxin Tian, Yao Qin, Linda Ruth Petzold

Large language models (LLMs) demonstrate remarkable medical expertise, but data privacy concerns impede their direct use in healthcare environments.

Decision Making Privacy Preserving

Paper
Code

LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation

1 code implementation • NeurIPS 2023 • Yujie Lu, Xianjun Yang, Xiujun Li, Xin Eric Wang, William Yang Wang

Existing automatic evaluation on text-to-image synthesis can only provide an image-text matching score, without considering the object-level compositionality, which results in poor correlation with human judgments.

Attribute Image Generation +2

112

Paper
Code

Dynamic Prompting: A Unified Framework for Prompt Tuning

1 code implementation • 6 Mar 2023 • Xianjun Yang, Wei Cheng, Xujiang Zhao, Wenchao Yu, Linda Petzold, Haifeng Chen

Experimental results underscore the significant performance improvement achieved by dynamic prompt tuning across a wide range of tasks, including NLP tasks, vision recognition tasks, and vision-language tasks.

Position

Paper
Code

Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization

no code implementations • 16 Feb 2023 • Xianjun Yang, Yan Li, Xinlu Zhang, Haifeng Chen, Wei Cheng

Text summarization has been a crucial problem in natural language processing (NLP) for several decades.

Abstractive Text Summarization

Paper
Add Code

MatKB: Semantic Search for Polycrystalline Materials Synthesis Procedures

1 code implementation • 11 Feb 2023 • Xianjun Yang, Stephen Wilson, Linda Petzold

In this paper, we present a novel approach to knowledge extraction and retrieval using Natural Language Processing (NLP) techniques for material science.

Document Classification Retrieval

Paper
Code

ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval

1 code implementation • 5 Feb 2023 • Kexun Zhang, Xianjun Yang, William Yang Wang, Lei LI

Diffusion models show promising generation capability for a variety of data.

Image Generation Image Stylization +1

Paper
Code

OASum: Large-Scale Open Domain Aspect-based Summarization

1 code implementation • 19 Dec 2022 • Xianjun Yang, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Xiaoman Pan, Linda Petzold, Dong Yu

Specifically, zero/few-shot and fine-tuning results show that the model pre-trained on our corpus demonstrates a strong aspect or query-focused generation ability compared with the backbone model.

Paper
Code

PcMSP: A Dataset for Scientific Action Graphs Extraction from Polycrystalline Materials Synthesis Procedure Text

1 code implementation • 22 Oct 2022 • Xianjun Yang, Ya Zhuo, Julia Zuo, Xinlu Zhang, Stephen Wilson, Linda Petzold

Scientific action graphs extraction from materials synthesis procedures is important for reproducible research, machine automation, and material prediction.

Named Entity Recognition Named Entity Recognition (NER) +3

Paper
Code

Few-Shot Document-Level Event Argument Extraction

1 code implementation • 6 Sep 2022 • Xianjun Yang, Yujie Lu, Linda Petzold

To fill this gap, we present FewDocAE, a Few-Shot Document-Level Event Argument Extraction benchmark, based on the existing document-level event extraction dataset.

Document-level Event Extraction Event Argument Extraction +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.