Citation Prediction

8 papers with code • 5 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Citation Prediction

Dataset	Best Model	Compare
PWC Citations	GAL 120B	See all
Extended Citations	GAL 120B	See all
Contextual Citations	GAL 120B	See all
SciDocs (BEIR)	monoT5-3B	See all
SciDocs (Citation Prediction)	SciNCL	See all

Datasets

BEIR
REFCAT

Most implemented papers

Most implemented Social Latest No code

SPECTER: Document-level Representation Learning using Citation-informed Transformers

allenai/specter • • ACL 2020

We propose SPECTER, a new method to generate document-level embedding of scientific documents based on pretraining a Transformer language model on a powerful signal of document-level relatedness: the citation graph.

Paper
Code

BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

UKPLab/beir • • 17 Apr 2021

To address this, and to facilitate researchers to broadly evaluate the effectiveness of their models, we introduce Benchmarking-IR (BEIR), a robust and heterogeneous evaluation benchmark for information retrieval.

Paper
Code

Patent Citation Dynamics Modeling via Multi-Attention Recurrent Networks

TaoranJ/PC-RNN • • 22 May 2019

For the problem of patent citations, we observe that forecasting a patent's chain of citations benefits from not only the patent's history itself but also from the historical citations of assignees and inventors associated with that patent.

Paper
Code

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

malteos/scincl • • 14 Feb 2022

Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics.

Paper
Code

SGPT: GPT Sentence Embeddings for Semantic Search

muennighoff/sgpt • • 17 Feb 2022

To this end, we propose SGPT to use decoders for sentence embeddings and semantic search via prompting or fine-tuning.

Paper
Code

No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval

guilhermemr04/scaling-zero-shot-retrieval • 6 Jun 2022

This has made distilled and dense models, due to latency constraints, the go-to choice for deployment in real-world retrieval applications.

Paper
Code

Galactica: A Large Language Model for Science

paperswithcode/galai • • 16 Nov 2022

We believe these results demonstrate the potential for language models as a new interface for science.

Paper
Code

Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens

ronaldseoh/multi2spe • • 8 Sep 2023

Many useful tasks on scientific documents, such as topic classification and citation prediction, involve corpora that span multiple scientific domains.

Paper
Code

Citation Prediction

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result