Search Results for author: Yuheng Huang

Found 11 papers, 4 papers with code

Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward

no code implementations12 Apr 2024 Xuan Xie, Jiayang Song, Zhehua Zhou, Yuheng Huang, Da Song, Lei Ma

To bridge this gap, we conduct in this work a comprehensive evaluation of the effectiveness of existing online safety analysis methods on LLMs.

Fairness

PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement

1 code implementation6 Mar 2024 Zhijie Wang, Yuheng Huang, Da Song, Lei Ma, Tianyi Zhang

However, prompting remains challenging for novice users due to the complexity of the stable diffusion model and the non-trivial efforts required for iteratively editing and refining the text prompts.

Image Inpainting Prompt Engineering +1

LUNA: A Model-Based Universal Analysis Framework for Large Language Models

no code implementations22 Oct 2023 Da Song, Xuan Xie, Jiayang Song, Derui Zhu, Yuheng Huang, Felix Juefei-Xu, Lei Ma

the trustworthiness perspective, is bound to and enriches the abstract model with semantics, which enables more detailed analysis applications for diverse purposes.

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition

no code implementations8 Aug 2023 Yu Pan, Yuguang Yang, Yuheng Huang, Jixun Yao, JingJing Yin, Yanni Hu, Heng Lu, Lei Ma, Jianjun Zhao

Despite notable progress, speech emotion recognition (SER) remains challenging due to the intricate and ambiguous nature of speech emotion, particularly in wild world.

Attribute Cross-corpus +2

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models

no code implementations16 Jul 2023 Yuheng Huang, Jiayang Song, Zhijie Wang, Shengming Zhao, Huaming Chen, Felix Juefei-Xu, Lei Ma

In particular, we experiment with twelve uncertainty estimation methods and four LLMs on four prominent natural language processing (NLP) tasks to investigate to what extent uncertainty estimation techniques could help characterize the prediction risks of LLMs.

Code Generation Hallucination +1

Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System

no code implementations23 May 2023 Zhixuan Zhang, Yuheng Huang, Dan Ou, Sen Li, Longbin Li, Qingwen Liu, Xiaoyi Zeng

As such, the metric of a pre-ranking model follows the ranking model using Area Under ROC (AUC) for offline evaluation.

DeepLens: Interactive Out-of-distribution Data Detection in NLP Models

1 code implementation2 Mar 2023 Da Song, Zhijie Wang, Yuheng Huang, Lei Ma, Tianyi Zhang

In this work, we propose DeepLens, an interactive system that helps users detect and explore OOD issues in massive text corpora.

Text Clustering

DeepSeer: Interactive RNN Explanation and Debugging via State Abstraction

1 code implementation2 Mar 2023 Zhijie Wang, Yuheng Huang, Da Song, Lei Ma, Tianyi Zhang

The core of DeepSeer is a state abstraction method that bundles semantically similar hidden states in an RNN model and abstracts the model as a finite state machine.

Explainable Artificial Intelligence (XAI)

An Exploratory Study of AI System Risk Assessment from the Lens of Data Distribution and Uncertainty

no code implementations13 Dec 2022 Zhijie Wang, Yuheng Huang, Lei Ma, Haruki Yokoyama, Susumu Tokumoto, Kazuki Munakata

More importantly, it also lacks systematic investigation on how to perform the risk assessment for AI systems from unit level to system level.

PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing

no code implementations19 Nov 2021 Yuheng Huang, Lei Ma, Yuanchun Li

Vision Transformer (ViT) is known to be highly nonlinear like other classical neural networks and could be easily fooled by both natural and adversarial patch perturbations.

Neuron-level Structured Pruning using Polarization Regularizer

1 code implementation NeurIPS 2020 Tao Zhuang, Zhixuan Zhang, Yuheng Huang, Xiaoyi Zeng, Kai Shuang, Xiang Li

Experimentally, we show that structured pruning using polarization regularizer achieves much better results than using L1 regularizer.

Cannot find the paper you are looking for? You can Submit a new open access paper.