Citation Prediction

8 papers with code • 5 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

SPECTER: Document-level Representation Learning using Citation-informed Transformers

allenai/specter ACL 2020

We propose SPECTER, a new method to generate document-level embedding of scientific documents based on pretraining a Transformer language model on a powerful signal of document-level relatedness: the citation graph.

BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

UKPLab/beir 17 Apr 2021

To address this, and to facilitate researchers to broadly evaluate the effectiveness of their models, we introduce Benchmarking-IR (BEIR), a robust and heterogeneous evaluation benchmark for information retrieval.

Patent Citation Dynamics Modeling via Multi-Attention Recurrent Networks

TaoranJ/PC-RNN 22 May 2019

For the problem of patent citations, we observe that forecasting a patent's chain of citations benefits from not only the patent's history itself but also from the historical citations of assignees and inventors associated with that patent.

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

malteos/scincl 14 Feb 2022

Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics.

SGPT: GPT Sentence Embeddings for Semantic Search

muennighoff/sgpt 17 Feb 2022

To this end, we propose SGPT to use decoders for sentence embeddings and semantic search via prompting or fine-tuning.

No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval

guilhermemr04/scaling-zero-shot-retrieval 6 Jun 2022

This has made distilled and dense models, due to latency constraints, the go-to choice for deployment in real-world retrieval applications.

Galactica: A Large Language Model for Science

paperswithcode/galai 16 Nov 2022

We believe these results demonstrate the potential for language models as a new interface for science.

Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens

ronaldseoh/multi2spe 8 Sep 2023

Many useful tasks on scientific documents, such as topic classification and citation prediction, involve corpora that span multiple scientific domains.