Search Results for author: Chenghua Lin

Found 96 papers, 54 papers with code

CM-Gen: A Neural Framework for Chinese Metaphor Generation with Explicit Context Modelling

1 code implementation • COLING 2022 • Yucheng Li, Chenghua Lin, Frank Guerin

The metaphor identification module is able to perform a self-training procedure, which discovers novel metaphors from a large-scale unlabeled corpus for NM generation.

Paper
Code

Development of a Benchmark Corpus to Support Entity Recognition in Job Descriptions

no code implementations • LREC 2022 • Thomas Green, Diana Maynard, Chenghua Lin

We present the development of a benchmark suite consisting of an annotation schema, training corpus and baseline model for Entity Recognition (ER) in job descriptions, published under a Creative Commons license.

Recommendation Systems

Paper
Add Code

MuPT: A Generative Symbolic Music Pretrained Transformer

no code implementations • 9 Apr 2024 • Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan, Stephen W. Huang, Wenhu Chen, Jie Fu, Ge Zhang

In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music.

Music Generation Music Modeling

Paper
Add Code

Structured Information Matters: Incorporating Abstract Meaning Representation into LLMs for Improved Open-Domain Dialogue Evaluation

no code implementations • 1 Apr 2024 • Bohao Yang, Kun Zhao, Chen Tang, Liang Zhan, Chenghua Lin

Trainable evaluation metrics are commonly trained with true positive and randomly selected negative responses, resulting in a tendency for them to assign a higher score to the responses that share higher content similarity with a given context.

Dialogue Evaluation In-Context Learning +1

Paper
Add Code

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning

no code implementations • 26 Mar 2024 • Yuelin Bai, Xinrun Du, Yiming Liang, Yonggang Jin, Ziqiang Liu, Junting Zhou, Tianyu Zheng, Xincheng Zhang, Nuo Ma, Zekun Wang, Ruibin Yuan, Haihong Wu, Hongquan Lin, Wenhao Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Min Yang, Shiwen Ni, Ge Zhang

To bridge this gap, we introduce COIG-CQIA, a high-quality Chinese instruction tuning dataset.

Paper
Add Code

Train & Constrain: Phonologically Informed Tongue-Twister Generation from Topics and Paraphrases

no code implementations • 20 Mar 2024 • Tyler Loakman, Chen Tang, Chenghua Lin

Previous work in phonologically and phonetically grounded language generation has mainly focused on domains such as puns and poetry.

Language Modelling Text Generation

Paper
Add Code

DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning

no code implementations • 7 Mar 2024 • Xingwei Qu, Yiming Liang, Yucheng Wang, Tianyu Zheng, Tommy Yue, Lei Ma, Stephen W. Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Ge Zhang

It has long been assumed that the sheer number of parameters in large language models (LLMs) drives in-context learning (ICL) capabilities, enabling remarkable performance improvements by leveraging task-specific demonstrations.

Few-Shot Learning In-Context Learning +1

Paper
Add Code

ChatMusician: Understanding and Generating Music Intrinsically with LLM

1 code implementation • 25 Feb 2024 • Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, Jingcheng Wu, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Emmanouil Benetos, Jie Fu, Gus Xia, Roger Dannenberg, Wei Xue, Shiyin Kang, Yike Guo

It is based on continual pre-training and finetuning LLaMA2 on a text-compatible music representation, ABC notation, and the music is treated as a second language.

Text Generation

146

Paper
Code

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models

no code implementations • 20 Feb 2024 • Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Zekun Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Stephen W. Huang, Chenghua Lin, Wenhu Chen, Jie Fu

The advancement of large language models (LLMs) has enhanced the ability to generalize across a wide range of unseen natural language processing (NLP) tasks through instruction-following.

Instruction Following

Paper
Add Code

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation

1 code implementation • 20 Feb 2024 • Yujie Shao, Xinrong Yao, Xingwei Qu, Chenghua Lin, Shi Wang, Stephen W. Huang, Ge Zhang, Jie Fu

These models are able to generate creative and fluent metaphor sentences more frequently induced by selected samples from our dataset, demonstrating the value of our corpus for Chinese metaphor research.

Paper
Code

Pixel Sentence Representation Learning

1 code implementation • 13 Feb 2024 • Chenghao Xiao, Zhuoxu Huang, Danlu Chen, G Thomas Hudson, Yizhi Li, Haoran Duan, Chenghua Lin, Jie Fu, Jungong Han, Noura Al Moubayed

To our knowledge, this is the first representation learning method devoid of traditional language models for understanding sentence and document semantics, marking a stride closer to human-like textual comprehension.

Natural Language Inference Representation Learning +3

Paper
Code

Evaluating Large Language Models for Generalization and Robustness via Data Compression

1 code implementation • 1 Feb 2024 • Yucheng Li, Yunhao Guo, Frank Guerin, Chenghua Lin

We measure: 1) the compression performance on the testing period as a measure of generalization on unseen data; and 2) the performance gap between the training and testing period as a measure of robustness.

Data Compression

Paper
Code

Finding Challenging Metaphors that Confuse Pretrained Language Models

no code implementations • 29 Jan 2024 • Yucheng Li, Frank Guerin, Chenghua Lin

In this paper, we test various NLP models on the VUA metaphor dataset and quantify to what extent metaphors affect models' performance on various downstream tasks.

Machine Translation

Paper
Add Code

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

1 code implementation • 24 Jan 2024 • Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin

We further annotate the image-text pairs with two-level subset-subcategory hierarchy annotations to facilitate a more comprehensive evaluation of the baselines.

Benchmarking Image Captioning +3

Paper
Code

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

1 code implementation • 22 Jan 2024 • Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang, Xingwei Qu, Junjie Wang, Ruibin Yuan, Yizhi Li, Zekun Wang, Yudong Liu, Yu-Hsuan Tsai, Fengji Zhang, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu

We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to evaluate LMMs on tasks demanding college-level subject knowledge and deliberate reasoning in a Chinese context.

7,112

Paper
Code

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation

1 code implementation • 12 Jan 2024 • Tianyu Zheng, Shuyue Guo, Xingwei Qu, Jiawei Guo, Weixu Zhang, Xinrun Du, Qi Jia, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu, Ge Zhang

In this paper, we introduce Kun, a novel approach for creating high-quality instruction-tuning datasets for large language models (LLMs) without relying on manual annotations.

Instruction Following Translation

Paper
Code

Language Model as an Annotator: Unsupervised Context-aware Quality Phrase Generation

no code implementations • 28 Dec 2023 • Zhihao Zhang, Yuan Zuo, Chenghua Lin, Junjie Wu

Finally, we merge the quality phrases from both the Annotator and Generator as the final predictions, considering their complementary nature and distinct characteristics.

Informativeness Language Modelling +1

Paper
Add Code

LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction

1 code implementation • 19 Dec 2023 • Yucheng Li, Frank Guerin, Chenghua Lin

LatestEval avoids data contamination by only using texts published within a recent time window, ensuring no overlap with the training corpora of pre-trained language models.

Language Modelling Reading Comprehension

Paper
Code

How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey

1 code implementation • 8 Dec 2023 • Jun Bai, Xiaofeng Zhang, Chen Li, Hanhua Hong, Xi Xu, Chenghua Lin, Wenge Rong

However, there is a lack of a comprehensive comparison between these estimation methods yet.

Language Modelling Text Generation

Paper
Code

A Cross-Attention Augmented Model for Event-Triggered Context-Aware Story Generation

1 code implementation • 19 Nov 2023 • Chen Tang, Tyler Loakman, Chenghua Lin

These results underscore the effectiveness of our model in leveraging context and event features to improve the quality of generated narratives.

Story Generation

Paper
Code

LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores

no code implementations • 16 Nov 2023 • Yiqi Liu, Nafise Sadat Moosavi, Chenghua Lin

Automatic evaluation of generated textual content presents an ongoing challenge within the field of NLP.

Language Modelling

Paper
Add Code

The Iron(ic) Melting Pot: Reviewing Human Evaluation in Humour, Irony and Sarcasm Generation

no code implementations • 9 Nov 2023 • Tyler Loakman, Aaron Maladry, Chenghua Lin

Human evaluation is often considered to be the gold standard method of evaluating a Natural Language Generation system.

Text Generation

Paper
Add Code

ACL Anthology Helper: A Tool to Retrieve and Manage Literature from ACL Anthology

no code implementations • 31 Oct 2023 • Chen Tang, Frank Guerin, Chenghua Lin

This paper presents a tool called ``ACL Anthology Helper''.

Management Retrieval

Paper
Add Code

An Open Source Data Contamination Report for Large Language Models

1 code implementation • 26 Oct 2023 • Yucheng Li, Frank Guerin, Chenghua Lin

We also introduce an open-source pipeline that enables the community to perform contamination analysis on customised data and models.

Language Modelling Large Language Model +1

Paper
Code

Enhancing Biomedical Lay Summarisation with External Knowledge Graphs

1 code implementation • 24 Oct 2023 • Tomas Goldsack, Zhihao Zhang, Chen Tang, Carolina Scarton, Chenghua Lin

Previous approaches for automatic lay summarisation are exclusively reliant on the source article that, given it is written for a technical audience (e. g., researchers), is unlikely to explicitly define all technical concepts or state all of the background information that is relevant for a lay audience.

Knowledge Graphs

Paper
Code

Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers

1 code implementation • 24 Oct 2023 • Chen Tang, Shun Wang, Tomas Goldsack, Chenghua Lin

Abstracts derived from biomedical literature possess distinct domain-specific characteristics, including specialised writing styles and biomedical terminologies, which necessitate a deep understanding of the related literature.

Paper
Code

Length is a Curse and a Blessing for Document-level Semantics

1 code implementation • 24 Oct 2023 • Chenghao Xiao, Yizhi Li, G Thomas Hudson, Chenghua Lin, Noura Al Moubayed

In recent years, contrastive learning (CL) has been extensively utilized to recover sentence and document-level encoding capability from pre-trained language models.

Contrastive Learning Information Retrieval +3

Paper
Code

Compressing Context to Enhance Inference Efficiency of Large Language Models

1 code implementation • 9 Oct 2023 • Yucheng Li, Bo Dong, Chenghua Lin, Frank Guerin

This paper proposes a method called Selective Context that enhances the inference efficiency of LLMs by identifying and pruning redundancy in the input context to make the input more compact.

Question Answering Response Generation

191

Paper
Code

Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles

no code implementations • 29 Sep 2023 • Tomas Goldsack, Zheheng Luo, Qianqian Xie, Carolina Scarton, Matthew Shardlow, Sophia Ananiadou, Chenghua Lin

This paper presents the results of the shared task on Lay Summarisation of Biomedical Research Articles (BioLaySumm), hosted at the BioNLP Workshop at ACL 2023.

Lay Summarization

Paper
Add Code

Effective Distillation of Table-based Reasoning Ability from LLMs

no code implementations • 22 Sep 2023 • Bohao Yang, Chen Tang, Kun Zhao, Chenghao Xiao, Chenghua Lin

Large Language Models (LLMs) have demonstrated remarkable performance across a wide range of natural language processing tasks.

Table-to-Text Generation

Paper
Add Code

Audio Contrastive based Fine-tuning

no code implementations • 21 Sep 2023 • Yang Wang, Qibin Liang, Chenghao Xiao, Yizhi Li, Noura Al Moubayed, Chenghua Lin

Audio classification plays a crucial role in speech and sound processing tasks with a wide range of applications.

Audio Classification Contrastive Learning

Paper
Add Code

Improving Medical Dialogue Generation with Abstract Meaning Representations

1 code implementation • 19 Sep 2023 • Bohao Yang, Chen Tang, Chenghua Lin

In this paper, We propose a novel framework that models dialogues between patients and healthcare professionals using AMR graphs, where the neural networks incorporate textual and graphical knowledge with a dual attention mechanism.

Dialogue Generation

Paper
Code

On the Effectiveness of Speech Self-supervised Learning for Music

no code implementations • 11 Jul 2023 • Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger Dannenberg, Yike Guo, Jie Fu

Our findings suggest that training with music data can generally improve performance on MIR tasks, even when models are trained using paradigms designed for speech.

Information Retrieval Music Information Retrieval +2

Paper
Add Code

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

1 code implementation • 29 Jun 2023 • Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo

We introduce LyricWhiz, a robust, multilingual, and zero-shot automatic lyrics transcription method achieving state-of-the-art performance on various lyrics transcription datasets, even in challenging genres such as rock and metal.

Automatic Lyrics Transcription Language Modelling +3

Paper
Code

Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation

1 code implementation • 28 Jun 2023 • Chen Tang, Hongbo Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin

Further analysis also shows that our representation learning framework can fill the semantic gap by coagulating representations of both text and graph knowledge.

Dialogue Generation Graph Attention +2

Paper
Code

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

1 code implementation • NeurIPS 2023 • Ruibin Yuan, Yinghao Ma, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Le Zhuo, Yiqi Liu, Jiawen Huang, Zeyue Tian, Binyue Deng, Ningzhi Wang, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Roger Dannenberg, Wenhu Chen, Gus Xia, Wei Xue, Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu

This is evident in the limited work on deep music representations, the scarcity of large-scale datasets, and the absence of a universal and community-driven benchmark.

Image Generation Information Retrieval +1

Paper
Code

TwistList: Resources and Baselines for Tongue Twister Generation

1 code implementation • 6 Jun 2023 • Tyler Loakman, Chen Tang, Chenghua Lin

Previous work in phonetically-grounded language generation has mainly focused on domains such as lyrics and poetry.

Text Generation

Paper
Code

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

1 code implementation • 31 May 2023 • Yizhi Li, Ruibin Yuan, Ge Zhang, Yinghao Ma, Xingran Chen, Hanzhi Yin, Chenghao Xiao, Chenghua Lin, Anton Ragni, Emmanouil Benetos, Norbert Gyenge, Roger Dannenberg, Ruibo Liu, Wenhu Chen, Gus Xia, Yemin Shi, Wenhao Huang, Zili Wang, Yike Guo, Jie Fu

Although SSL has been proven effective in speech and audio, its application to music audio has yet to be thoroughly explored.

Language Modelling Quantization +1

242

Paper
Code

Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information

1 code implementation • 26 May 2023 • Kun Zhao, Bohao Yang, Chenghua Lin, Wenge Rong, Aline Villavicencio, Xiaohui Cui

The long-standing one-to-many issue of the open-domain dialogues poses significant challenges for automatic evaluation methods, i. e., there may be multiple suitable responses which differ in semantics for a given conversational context.

Paper
Code

Metaphor Detection via Explicit Basic Meanings Modelling

1 code implementation • 26 May 2023 • Yucheng Li, Shun Wang, Chenghua Lin, Guerin Frank

One noticeable trend in metaphor detection is the embrace of linguistic theories such as the metaphor identification procedure (MIP) for model architecture design.

Sentence

Paper
Code

Interactive Natural Language Processing

no code implementations • 22 May 2023 • Zekun Wang, Ge Zhang, Kexin Yang, Ning Shi, Wangchunshu Zhou, Shaochun Hao, Guangzheng Xiong, Yizhi Li, Mong Yuan Sim, Xiuying Chen, Qingqing Zhu, Zhenzhu Yang, Adam Nik, Qi Liu, Chenghua Lin, Shi Wang, Ruibo Liu, Wenhu Chen, Ke Xu, Dayiheng Liu, Yike Guo, Jie Fu

Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within the field of NLP, aimed at addressing limitations in existing frameworks while aligning with the ultimate goals of artificial intelligence.

Decision Making

Paper
Add Code

DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining

2 code implementations • 20 May 2023 • Weifeng Jiang, Qianren Mao, Chenghua Lin, JianXin Li, Ting Deng, Weiyi Yang, Zheng Wang

Many text mining models are constructed by fine-tuning a large deep pre-trained language model (PLM) in downstream tasks.

Extractive Summarization Knowledge Distillation +3

Paper
Code

CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge Aggregation

1 code implementation • 10 May 2023 • Hongbo Zhang, Chen Tang, Tyler Loakman, Chenghua Lin, Stefan Goetze

In this paper, we propose a novel context-aware graph-attention model (Context-aware GAT), which can effectively incorporate global features of relevant knowledge graphs based on a context-enhanced knowledge aggregation process.

Dialogue Generation Graph Attention +2

Paper
Code

Chinese Open Instruction Generalist: A Preliminary Release

2 code implementations • 17 Apr 2023 • Ge Zhang, Yemin Shi, Ruibo Liu, Ruibin Yuan, Yizhi Li, Siwei Dong, Yu Shu, Zhaoqun Li, Zekun Wang, Chenghua Lin, Wenhao Huang, Jie Fu

Instruction tuning is widely recognized as a key technique for building generalist language models, which has attracted the attention of researchers and the public with the release of InstructGPT~\citep{ouyang2022training} and ChatGPT\footnote{\url{https://chat. openai. com/}}.

167

Paper
Code

Requirement Formalisation using Natural Language Processing and Machine Learning: A Systematic Review

no code implementations • 18 Mar 2023 • Shekoufeh Kolahdouz-Rahimi, Kevin Lano, Chenghua Lin

We found that heuristic NLP approaches are the most common NLP techniques used for automatic RF, primary operating on structured and semi-structured data.

Paper
Add Code

Metaphor Detection with Effective Context Denoising

1 code implementation • 11 Feb 2023 • Shun Wang, Yucheng Li, Chenghua Lin, Loïc Barrault, Frank Guerin

We propose a novel RoBERTa-based model, RoPPT, which introduces a target-oriented parse tree structure in metaphor detection.

Denoising

Paper
Code

FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning

1 code implementation • 9 Feb 2023 • Yucheng Li, Shun Wang, Chenghua Lin, Frank Guerin, Loïc Barrault

In this paper, we propose FrameBERT, a RoBERTa-based model that can explicitly learn and incorporate FrameNet Embeddings for concept-level metaphor detection.

Paper
Code

The Secret of Metaphor on Expressing Stronger Emotion

1 code implementation • 30 Jan 2023 • Yucheng Li, Frank Guerin, Chenghua Lin

Metaphors are proven to have stronger emotional impact than literal expressions.

Specificity

Paper
Code

CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation

1 code implementation • 1 Jan 2023 • Ge Zhang, Yizhi Li, Yaoyao Wu, Linyuan Zhang, Chenghua Lin, Jiayi Geng, Shi Wang, Jie Fu

As natural language processing (NLP) for gender bias becomes a significant interdisciplinary topic, the prevalent data-driven techniques such as large-scale language models suffer from data inadequacy and biased corpus, especially for languages with insufficient resources such as Chinese.

Sentence

Paper
Code

Routine Outcome Monitoring in Psychotherapy Treatment using Sentiment-Topic Modelling Approach

no code implementations • 8 Dec 2022 • Noor Fazilla Abd Yusof, Chenghua Lin

While outcome monitoring tends to improve the therapy outcomes, however, there are many challenges in the current method, e. g. time and financial burden for administering questionnaires, scoring and analysing the results.

Paper
Add Code

MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning

no code implementations • 5 Dec 2022 • Yizhi Li, Ruibin Yuan, Ge Zhang, Yinghao Ma, Chenghua Lin, Xingran Chen, Anton Ragni, Hanzhi Yin, Zhijie Hu, Haoyu He, Emmanouil Benetos, Norbert Gyenge, Ruibo Liu, Jie Fu

The deep learning community has witnessed an exponentially growing interest in self-supervised learning (SSL).

Representation Learning Self-Supervised Learning

Paper
Add Code

HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models

1 code implementation • 5 Nov 2022 • Yizhi Li, Ge Zhang, Bohao Yang, Chenghua Lin, Shi Wang, Anton Ragni, Jie Fu

In addition to verifying the existence of regional bias in LMs, we find that the biases on regional groups can be strongly influenced by the geographical clustering of the groups.

Fairness

Paper
Code

Improving Variational Autoencoders with Density Gap-based Regularization

1 code implementation • 1 Nov 2022 • Jianfei Zhang, Jun Bai, Chenghua Lin, Yanmeng Wang, Wenge Rong

There are effective ways proposed to prevent posterior collapse in VAEs, but we observe that they in essence make trade-offs between posterior collapse and hole problem, i. e., mismatch between the aggregated posterior distribution and the prior distribution.

Language Modelling Representation Learning

Paper
Code

Terminology-aware Medical Dialogue Generation

1 code implementation • 27 Oct 2022 • Chen Tang, Hongbo Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin

In this paper, we propose a novel framework to improve medical dialogue generation by considering features centered on domain-specific terminology.

Dialogue Generation

Paper
Code

EtriCA: Event-Triggered Context-Aware Story Generation Augmented by Cross Attention

1 code implementation • 22 Oct 2022 • Chen Tang, Chenghua Lin, Henglin Huang, Frank Guerin, Zhihao Zhang

One of the key challenges of automatic story generation is how to generate a long narrative that can maintain fluency, relevance, and coherence.

Story Generation

Paper
Code

NGEP: A Graph-based Event Planning Framework for Story Generation

1 code implementation • 19 Oct 2022 • Chen Tang, Zhihao Zhang, Tyler Loakman, Chenghua Lin, Frank Guerin

To improve the performance of long text generation, recent studies have leveraged automatically planned event structures (i. e. storylines) to guide story generation.

Hallucination Story Generation

Paper
Code

Improving Chinese Story Generation via Awareness of Syntactic Dependencies and Semantics

1 code implementation • 19 Oct 2022 • Henglin Huang, Chen Tang, Tyler Loakman, Frank Guerin, Chenghua Lin

In spite of the success of prior works with the application of pre-trained models, current neural models for Chinese stories still struggle to generate high-quality long text narratives.

Denoising Representation Learning +1

Paper
Code

Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature

1 code implementation • 18 Oct 2022 • Tomas Goldsack, Zhihao Zhang, Chenghua Lin, Carolina Scarton

Lay summarisation aims to jointly summarise and simplify a given text, thus making its content more comprehensible to non-experts.

Ranked #1 on Lay Summarization on PLOS

Lay Summarization

Paper
Code

PUF-Phenotype: A Robust and Noise-Resilient Approach to Aid Intra-Group-based Authentication with DRAM-PUFs Using Machine Learning

no code implementations • 11 Jul 2022 • Owen Millwood, Jack Miskelly, Bohao Yang, Prosanta Gope, Elif Kavun, Chenghua Lin

As the demand for highly secure and dependable lightweight systems increases in the modern world, Physically Unclonable Functions (PUFs) continue to promise a lightweight alternative to high-cost encryption techniques and secure key storage.

Denoising

Paper
Add Code

Nominal Metaphor Generation with Multitask Learning

1 code implementation • 10 Jun 2022 • Yucheng Li, Chenghua Lin, Frank Geurin

Metaphor generation is a challenging task which can impact many downstream tasks such as improving user satisfaction with dialogue systems and story generation.

Story Generation

Paper
Code

TranSHER: Translating Knowledge Graph Embedding with Hyper-Ellipsoidal Restriction

1 code implementation • 27 Apr 2022 • Yizhi Li, Wei Fan, Chao Liu, Chenghua Lin, Jiang Qian

However, such a method strictly restricts entities on the hyper-ellipsoid surfaces which limits the optimization of entity distribution, leading to suboptimal performance of knowledge graph completion.

Knowledge Graph Embedding Link Prediction +2

Paper
Code

Recent Advances in Neural Text Generation: A Task-Agnostic Survey

1 code implementation • 6 Mar 2022 • Chen Tang, Frank Guerin, Chenghua Lin

In recent years, considerable research has been dedicated to the application of neural models in the field of natural language generation (NLG).

Text Generation

Paper
Code

Tell Me How to Survey: Literature Review Made Simple with Automatic Reading Path Generation

1 code implementation • 12 Oct 2021 • Jiayuan Ding, Tong Xiang, Zijing Ou, Wangyang Zuo, Ruihui Zhao, Chenghua Lin, Yefeng Zheng, Bang Liu

In this paper, we introduce a new task named Reading Path Generation (RPG) which aims at automatically producing a path of papers to read for a given query.

Paper
Code

On the Latent Holes of VAEs for Text Generation

no code implementations • 7 Oct 2021 • Ruizhe Li, Xutan Peng, Chenghua Lin

In this paper, we provide the first focused study on the discontinuities (aka.

Text Generation

Paper
Add Code

On the Latent Holes 🧀 of VAEs for Text Generation

no code implementations • 29 Sep 2021 • Ruizhe Li, Xutan Peng, Chenghua Lin

In this paper, we provide the first focused study on the discontinuities (aka.

Text Generation

Paper
Add Code

Extractive and Abstractive Sentence Labelling of Sentiment-bearing Topics

no code implementations • 29 Aug 2021 • Mohamad Hardyman Barawi, Chenghua Lin, Advaith Siddharthan, Yinbin Liu

Our experimental results on three real-world datasets show that both the extractive and abstractive approaches outperform four strong baselines in terms of facilitating topic understanding and interpretation.

Descriptive Sentence +1

Paper
Add Code

Affective Decoding for Empathetic Response Generation

1 code implementation • INLG (ACL) 2021 • Chengkun Zeng, Guanyi Chen, Chenghua Lin, Ruizhe Li, Zhigang Chen

Understanding speaker's feelings and producing appropriate responses with emotion connection is a key communicative skill for empathetic dialogue systems.

Empathetic Response Generation Response Generation

Paper
Code

Cross-Lingual Word Embedding Refinement by $\ell_1$ Norm Optimisation

1 code implementation • NAACL 2021 • Xutan Peng, Chenghua Lin, Mark Stevenson

It is therefore recommended that this strategy be adopted as a standard for CLWE methods.

Bilingual Lexicon Induction Cross-Lingual Transfer +3

Paper
Code

Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting

no code implementations • ACL 2021 • Yi Cheng, SiYao Li, Bang Liu, Ruihui Zhao, Sujian Li, Chenghua Lin, Yefeng Zheng

This paper explores the task of Difficulty-Controllable Question Generation (DCQG), which aims at generating questions with required difficulty levels.

Question Answering Question Generation +1

Paper
Add Code

Cross-Lingual Word Embedding Refinement by $\ell_{1}$ Norm Optimisation

1 code implementation • 11 Apr 2021 • Xutan Peng, Chenghua Lin, Mark Stevenson

It is therefore recommended that this strategy be adopted as a standard for CLWE methods.

Bilingual Lexicon Induction Cross-Lingual Transfer +3

Paper
Code

Highly Efficient Knowledge Graph Embedding Learning with Orthogonal Procrustes Analysis

1 code implementation • NAACL 2021 • Xutan Peng, Guanyi Chen, Chenghua Lin, Mark Stevenson

Knowledge Graph Embeddings (KGEs) have been intensively explored in recent years due to their promise for a wide range of applications.

Entity Embeddings Knowledge Graph Embedding +1

Paper
Code

Interpreting Verbal Metaphors by Paraphrasing

no code implementations • 7 Apr 2021 • Rui Mao, Chenghua Lin, Frank Guerin

Metaphorical expressions are difficult linguistic phenomena, challenging diverse Natural Language Processing tasks.

Machine Translation Translation

Paper
Add Code

Combining Pre-trained Word Embeddings and Linguistic Features for Sequential Metaphor Identification

no code implementations • 7 Apr 2021 • Rui Mao, Chenghua Lin, Frank Guerin

The pre-trained word embeddings GloVe, ELMo and BERT have individually shown good performance on sequential metaphor identification.

Word Embeddings

Paper
Add Code

Summarising Historical Text in Modern Languages

1 code implementation • EACL 2021 • Xutan Peng, Yi Zheng, Chenghua Lin, Advaith Siddharthan

We introduce the task of historical text summarisation, where documents in historical forms of a language are summarised in the corresponding modern language.

Cross-Lingual Transfer Transfer Learning

Paper
Code

Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling

no code implementations • 2 Dec 2020 • Jing Su, Chenghua Lin, Mian Zhou, Qingyun Dai, Haoyu Lv

In this paper, we propose an end-to-end CNN-LSTM model for generating descriptions for sequential images with a local-object attention mechanism.

Paper
Add Code

Improving Variational Autoencoder for Text Modelling with Timestep-Wise Regularisation

1 code implementation • COLING 2020 • Ruizhe Li, Xiao Li, Guanyi Chen, Chenghua Lin

The Variational Autoencoder (VAE) is a popular and powerful model applied to text modelling to generate diverse sentences.

Language Modelling Response Generation +1

Paper
Code

DGST: a Dual-Generator Network for Text Style Transfer

no code implementations • EMNLP 2020 • Xiao Li, Guanyi Chen, Chenghua Lin, Ruizhe Li

We propose DGST, a novel and simple Dual-Generator network architecture for text Style Transfer.

Style Transfer Text Style Transfer

Paper
Add Code

A Text Reassembling Approach to Natural Language Generation

no code implementations • 16 May 2020 • Xiao Li, Kees Van Deemter, Chenghua Lin

Recent years have seen a number of proposals for performing Natural Language Generation (NLG) based in large part on statistical techniques.

Text Generation

Paper
Add Code

Fast and Scalable Dialogue State Tracking with Explicit Modular Decomposition

no code implementations • NAACL 2021 • Dingmin Wang, Chenghua Lin, Qi Liu, Kam-Fai Wong

We present a fast and scalable architecture called Explicit Modular Decomposition (EMD), in which we incorporate both classification-based and extraction-based methods and design four modules (for classification and sequence labelling) to jointly extract dialogue states.

Classification Dialogue State Tracking +3

Paper
Add Code

Understanding Linearity of Cross-Lingual Word Embedding Mappings

1 code implementation • 2 Apr 2020 • Xutan Peng, Mark Stevenson, Chenghua Lin, Chen Li

The technique of Cross-Lingual Word Embedding (CLWE) plays a fundamental role in tackling Natural Language Processing challenges for low-resource languages.

Representation Learning Word Embeddings

Paper
Code

A Stable Variational Autoencoder for Text Modelling

1 code implementation • WS 2019 • Ruizhe Li, Xiao Li, Chenghua Lin, Matthew Collinson, Rui Mao

Variational Autoencoder (VAE) is a powerful method for learning representations of high-dimensional data.

Paper
Code

QTUNA: A Corpus for Understanding How Speakers Use Quantification

1 code implementation • WS 2019 • Guanyi Chen, Kees Van Deemter, Silvia Pagliaro, Louk Smalbil, Chenghua Lin

To inform these algorithms, we conducted on a series of elicitation experiments in which human speakers were asked to perform a linguistic task that invites the use of quantified expressions.

Text Generation

Paper
Code

Generating Quantified Descriptions of Abstract Visual Scenes

no code implementations • WS 2019 • Guanyi Chen, Kees Van Deemter, Chenghua Lin

Quantified expressions have always taken up a central position in formal theories of meaning and language use.

Position Text Generation

Paper
Add Code

Deep Ensemble Learning for News Stance Detection

no code implementations • 13 Sep 2019 • Wenjun Liao, Chenghua Lin

The second approach is based on word embedding, where word2vec model is introduced and two document similarities calculation algorithms are implemented: wor2vec cosine similarity and WMD distance.

Ensemble Learning Fact Checking +1

Paper
Add Code

Latent Space Factorisation and Manipulation via Matrix Subspace Projection

2 code implementations • ICML 2020 • Xiao Li, Chenghua Lin, Ruizhe Li, Chaozheng Wang, Frank Guerin

We demonstrate the utility of our method for attribute manipulation in autoencoders trained across varied domains, using both human evaluation and automated methods.

Ranked #7 on Image Generation on CelebA 256x256 (FID metric)

Attribute Face Generation +1

Paper
Code

End-to-End Sequential Metaphor Identification Inspired by Linguistic Theories

1 code implementation • ACL 2019 • Rui Mao, Chenghua Lin, Frank Guerin

End-to-end training with Deep Neural Networks (DNN) is a currently popular method for metaphor identification.

Paper
Code

Modelling Pro-drop with the Rational Speech Acts Model

no code implementations • WS 2018 • Guanyi Chen, Kees Van Deemter, Chenghua Lin

We extend the classic Referring Expressions Generation task by considering zero pronouns in {``}pro-drop{''} languages such as Chinese, modelling their use by means of the Bayesian Rational Speech Acts model (Frank and Goodman, 2012).

Coreference Resolution Machine Translation +1

Paper
Add Code

Generating Description for Sequential Images with Local-Object Attention Conditioned on Global Semantic Context

no code implementations • WS 2018 • Jing Su, Chenghua Lin, Mian Zhou, Qingyun Dai, Haoyu Lv

Image Captioning Text Generation

Paper
Add Code

Statistical NLG for Generating the Content and Form of Referring Expressions

no code implementations • WS 2018 • Xiao Li, Kees Van Deemter, Chenghua Lin

This paper argues that a new generic approach to statistical NLG can be made to perform Referring Expression Generation (REG) successfully.

Attribute Referring Expression +2

Paper
Add Code

SimpleNLG-ZH: a Linguistic Realisation Engine for Mandarin

1 code implementation • WS 2018 • Guanyi Chen, Kees Van Deemter, Chenghua Lin

We introduce SimpleNLG-ZH, a realisation engine for Mandarin that follows the software design paradigm of SimpleNLG (Gatt and Reiter, 2009).

Morphological Inflection Text Generation

Paper
Code

A Dual-Attention Hierarchical Recurrent Neural Network for Dialogue Act Classification

no code implementations • CONLL 2019 • Ruizhe Li, Chenghua Lin, Matthew Collinson, Xiao Li, Guanyi Chen

Recognising dialogue acts (DA) is important for many natural language processing tasks such as dialogue generation and intention recognition.

Ranked #4 on Dialogue Act Classification on Switchboard corpus

Classification Dialogue Act Classification +3

Paper
Add Code

Word Embedding and WordNet Based Metaphor Identification and Interpretation

no code implementations • ACL 2018 • Rui Mao, Chenghua Lin, Frank Guerin

Metaphoric expressions are widespread in natural language, posing a significant challenge for various natural language processing tasks such as Machine Translation.

Decision Making Machine Translation +4

Paper
Add Code

ABDN at SemEval-2018 Task 10: Recognising Discriminative Attributes using Context Embeddings and WordNet

no code implementations • SEMEVAL 2018 • Rui Mao, Guanyi Chen, Ruizhe Li, Chenghua Lin

This paper describes the system that we submitted for SemEval-2018 task 10: capturing discriminative attributes.

Attribute Semantic Textual Similarity +1

Paper
Add Code

Analysing the Causes of Depressed Mood from Depression Vulnerable Individuals

no code implementations • WS 2017 • Noor Fazilla Abd Yusof, Chenghua Lin, Frank Guerin

We develop a computational model to discover the potential causes of depression by analysing the topics in a usergenerated text.

Paper
Add Code

Extracting and Understanding Contrastive Opinion through Topic Relevant Sentences

no code implementations • IJCNLP 2017 • Ebuka Ibeke, Chenghua Lin, Adam Wyner, Mohamad Hardyman Barawi

Contrastive opinion mining is essential in identifying, extracting and organising opinions from user generated texts.

Opinion Mining

Paper
Add Code

Statistics-Based Lexical Choice for NLG from Quantitative Information

no code implementations • WS 2016 • Xiao Li, Kees Van Deemter, Chenghua Lin

Text Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.