Search Results for author: Yucheng Li

Found 20 papers, 14 papers with code

CM-Gen: A Neural Framework for Chinese Metaphor Generation with Explicit Context Modelling

1 code implementation COLING 2022 Yucheng Li, Chenghua Lin, Frank Guerin

The metaphor identification module is able to perform a self-training procedure, which discovers novel metaphors from a large-scale unlabeled corpus for NM generation.

On the Rigour of Scientific Writing: Criteria, Analysis, and Insights

no code implementations7 Oct 2024 Joseph James, Chenghao Xiao, Yucheng Li, Chenghua Lin

Rigour is crucial for scientific research as it ensures the reproducibility and validity of results and findings.

With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models

no code implementations23 Sep 2024 Tyler Loakman, Yucheng Li, Chenghua Lin

To investigate this, we analyse the ability of VLMs and LLMs to demonstrate sound symbolism (i. e., to recognise a non-arbitrary link between sounds and concepts) as well as their ability to ``hear'' via the interplay of the language and vision modules of open and closed-source multimodal models.

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

2 code implementations2 Jul 2024 Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

With the pattern and sparse indices, we perform efficient sparse attention calculations via our optimized GPU kernels to significantly reduce the latency in the pre-filling stage of long-context LLMs.

Language Modelling Large Language Model

Evaluating Large Language Models for Generalization and Robustness via Data Compression

1 code implementation1 Feb 2024 Yucheng Li, Yunhao Guo, Frank Guerin, Chenghua Lin

We measure: 1) the compression performance on the testing period as a measure of generalization on unseen data; and 2) the performance gap between the training and testing period as a measure of robustness.

Data Compression

Finding Challenging Metaphors that Confuse Pretrained Language Models

no code implementations29 Jan 2024 Yucheng Li, Frank Guerin, Chenghua Lin

In this paper, we test various NLP models on the VUA metaphor dataset and quantify to what extent metaphors affect models' performance on various downstream tasks.

Machine Translation

LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction

1 code implementation19 Dec 2023 Yucheng Li, Frank Guerin, Chenghua Lin

LatestEval avoids data contamination by only using texts published within a recent time window, ensuring no overlap with the training corpora of pre-trained language models.

Language Modelling Reading Comprehension

An Open Source Data Contamination Report for Large Language Models

1 code implementation26 Oct 2023 Yucheng Li, Frank Guerin, Chenghua Lin

We also introduce an open-source pipeline that enables the community to perform contamination analysis on customised data and models.

HellaSwag Language Modelling +3

Compressing Context to Enhance Inference Efficiency of Large Language Models

2 code implementations9 Oct 2023 Yucheng Li, Bo Dong, Chenghua Lin, Frank Guerin

This paper proposes a method called Selective Context that enhances the inference efficiency of LLMs by identifying and pruning redundancy in the input context to make the input more compact.

Question Answering Response Generation

Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation

1 code implementation19 Sep 2023 Yucheng Li

Data contamination in model evaluation is getting increasingly prevalent as the massive training corpora of large language models often unintentionally include benchmark samples.

Language Modelling Multiple-choice +1

Metaphor Detection via Explicit Basic Meanings Modelling

1 code implementation26 May 2023 Yucheng Li, Shun Wang, Chenghua Lin, Guerin Frank

One noticeable trend in metaphor detection is the embrace of linguistic theories such as the metaphor identification procedure (MIP) for model architecture design.

Sentence

Unlocking Context Constraints of LLMs: Enhancing Context Efficiency of LLMs with Self-Information-Based Content Filtering

1 code implementation24 Apr 2023 Yucheng Li

Large language models (LLMs) have received significant attention by achieving remarkable performance across various tasks.

Question Answering

Metaphor Detection with Effective Context Denoising

1 code implementation11 Feb 2023 Shun Wang, Yucheng Li, Chenghua Lin, Loïc Barrault, Frank Guerin

We propose a novel RoBERTa-based model, RoPPT, which introduces a target-oriented parse tree structure in metaphor detection.

Denoising

FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning

1 code implementation9 Feb 2023 Yucheng Li, Shun Wang, Chenghua Lin, Frank Guerin, Loïc Barrault

In this paper, we propose FrameBERT, a RoBERTa-based model that can explicitly learn and incorporate FrameNet Embeddings for concept-level metaphor detection.

The Secret of Metaphor on Expressing Stronger Emotion

1 code implementation30 Jan 2023 Yucheng Li, Frank Guerin, Chenghua Lin

Metaphors are proven to have stronger emotional impact than literal expressions.

Specificity

Nominal Metaphor Generation with Multitask Learning

1 code implementation10 Jun 2022 Yucheng Li, Chenghua Lin, Frank Geurin

Metaphor generation is a challenging task which can impact many downstream tasks such as improving user satisfaction with dialogue systems and story generation.

Story Generation

Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever

no code implementations30 May 2022 Jin Chen, Defu Lian, Yucheng Li, Baoyun Wang, Kai Zheng, Enhong Chen

Recommender retrievers aim to rapidly retrieve a fraction of items from the entire item corpus when a user query requests, with the representative two-tower model trained with the log softmax loss.

GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring

1 code implementation12 Apr 2016 Wenying Ma, Liangliang Cao, Lei Yu, Guoping Long, Yucheng Li

We also applied GPU-FV for realtime video monitoring tasks and found that GPU-FV outperforms a number of previous works.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.