Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication

12 Sep 2020 · Haihua Chen, Huyen Nguyen ·

Citation function and citation sentiment are two essential aspects of citation content analysis (CCA), which are useful for influence analysis, the recommendation of scientific publications. However, existing studies are mostly traditional machine learning methods, although deep learning techniques have also been explored, the improvement of the performance seems not significant due to insufficient training data, which brings difficulties to applications. In this paper, we propose to fine-tune pre-trained contextual embeddings ULMFiT, BERT, and XLNet for the task. Experiments on three public datasets show that our strategy outperforms all the baselines in terms of the F1 score. For citation function identification, the XLNet model achieves 87.2%, 86.90%, and 81.6% on DFKI, UMICH, and TKDE2019 datasets respectively, while it achieves 91.72% and 91.56% on DFKI and UMICH in term of citation sentiment identification. Our method can be used to enhance the influence analysis of scholars and scholarly publications.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

Activation Regularization • Adam • Attention Dropout • AWD-LSTM • BERT • BPE • Dense Connections • Discriminative Fine-Tuning • DropConnect • Dropout • Embedding Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • LSTM • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • SentencePiece • Sigmoid Activation • Slanted Triangular Learning Rates • Softmax • Tanh Activation • Temporal Activation Regularization • ULMFiT • Variational Dropout • Weight Decay • Weight Tying • WordPiece • XLNet

Edit Social Preview

Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove