A Watermark for Low-entropy and Unbiased Generation in Large Language Models

1 code implementation23 May 2024 Minjia Mao, Dongjun Wei, Zeyu Chen, Xiao Fang, Michael Chau

This study proposes the Sampling One Then Accepting (STA-1) method, an unbiased watermark that does not require access to LLMs nor prompts during detection and has statistical guarantees for the type II error.

Qualit\'e vocale dans l'acquisition d'une langue \'etrang\`ere : le cas des apprenants sinophones en FLE (Voice quality in the second language acquisition: The case of Chinese learners of French as Foreign Language)

no code implementations JEPTALNRECITAL 2020 Dongjun Wei, Mohamed Embarki

Les mesures acoustiques, Fo moyenne du texte lu et Fo moyenne de la voyelle [a], pr{\'e}sentent dans les deux langues des variations ordonn{\'e}es intra- et interindividuelles, entre lecture en L1 chinois et lecture en L2 fran{\c{c}}ais, et entre locuteurs L1 fran{\c{c}}ais et apprenants L2 fran{\c{c}}ais.

Language Acquisition

AutoSUM: Automating Feature Extraction and Multi-user Preference Simulation for Entity Summarization

1 code implementation25 May 2020 Dongjun Wei, Yaxin Liu, Fuqing Zhu, Liangjun Zang, Wei Zhou, Yijun Lu, Songlin Hu

In this paper, a novel integration method called AutoSUM is proposed for automatic feature extraction and multi-user preference simulation to overcome the drawbacks of previous methods.

feature selection Word Embeddings

MPSUM: Entity Summarization with Predicate-based Matching

1 code implementation25 May 2020 Dongjun Wei, Shiyuan Gao, Yaxin Liu, Zhibing Liu, Longtao Hang

With the development of Semantic Web, entity summarization has become an emerging task to generate concrete summaries for real world entities.

TransSent: Towards Generation of Structured Sentences with Discourse Marker

no code implementations5 Sep 2019 Xing Wu, Dongjun Wei, Liangjun Zang, Jizhong Han, Songlin Hu

Automatic and human evaluation results show that TransSent can generate structured sentences with high quality, and has certain scalability in different tasks.

Dialogue Generation Sentence

ESA: Entity Summarization with Attention

2 code implementations25 May 2019 Dongjun Wei, Yaxin Liu, Fuqing Zhu, Liangjun Zang, Wei Zhou, Jizhong Han, Songlin Hu

Entity summarization aims at creating brief but informative descriptions of entities from knowledge graphs.

Clustering Knowledge Graphs

