Search Results for author: Jiayang Song

Found 5 papers, 2 papers with code

Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward

no code implementations12 Apr 2024 Xuan Xie, Jiayang Song, Zhehua Zhou, Yuheng Huang, Da Song, Lei Ma

To bridge this gap, we conduct in this work a comprehensive evaluation of the effectiveness of existing online safety analysis methods on LLMs.

Fairness

LUNA: A Model-Based Universal Analysis Framework for Large Language Models

no code implementations22 Oct 2023 Da Song, Xuan Xie, Jiayang Song, Derui Zhu, Yuheng Huang, Felix Juefei-Xu, Lei Ma

the trustworthiness perspective, is bound to and enriches the abstract model with semantics, which enables more detailed analysis applications for diverse purposes.

Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics

1 code implementation13 Sep 2023 Jiayang Song, Zhehua Zhou, Jiawei Liu, Chunrong Fang, Zhan Shu, Lei Ma

Then, the performance of the reward function is assessed, and the results are presented back to the LLM for guiding its self-refinement process.

Common Sense Reasoning Language Modelling +1

ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon Sequential Task Planning

1 code implementation26 Aug 2023 Zhehua Zhou, Jiayang Song, Kunpeng Yao, Zhan Shu, Lei Ma

Motivated by the substantial achievements observed in Large Language Models (LLMs) in the field of natural language processing, recent research has commenced investigations into the application of LLMs for complex, long-horizon sequential task planning challenges in robotics.

Language Modelling Large Language Model

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models

no code implementations16 Jul 2023 Yuheng Huang, Jiayang Song, Zhijie Wang, Shengming Zhao, Huaming Chen, Felix Juefei-Xu, Lei Ma

In particular, we experiment with twelve uncertainty estimation methods and four LLMs on four prominent natural language processing (NLP) tasks to investigate to what extent uncertainty estimation techniques could help characterize the prediction risks of LLMs.

Code Generation Hallucination +1

Cannot find the paper you are looking for? You can Submit a new open access paper.