Clone Detection

18 papers with code • 2 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Clone Detection

Trend	Dataset	Best Model	Paper	Code	Compare
	CodeXGLUE - BigCloneBench	CodeT5			See all
	CodeXGLUE - POJ-104	CodeBERT			See all

Datasets

CodeXGLUE

Most implemented papers

Most implemented Social Latest No code

CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation

salesforce/codet5 • • EMNLP 2021

We present CodeT5, a unified pre-trained encoder-decoder Transformer model that better leverages the code semantics conveyed from the developer-assigned identifiers.

Paper
Code

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

microsoft/CodeXGLUE • • 9 Feb 2021

Benchmark datasets have a significant impact on accelerating research in programming language tasks.

Paper
Code

Detecting Code Clones with Graph Neural Networkand Flow-Augmented Abstract Syntax Tree

jacobwwh/graphmatch_clone • • 20 Feb 2020

As far as we have concerned, we are the first to apply graph neural networks on the domain of code clone detection.

Paper
Code

Contrastive Code Representation Learning

parasj/contracode • • EMNLP 2021

Recent work learns contextual representations of source code by reconstructing tokens from their context.

Paper
Code

GraphCodeBERT: Pre-training Code Representations with Data Flow

microsoft/CodeBERT • • ICLR 2021

Instead of taking syntactic-level structure of code like abstract syntax tree (AST), we use data flow in the pre-training stage, which is a semantic-level structure of code that encodes the relation of "where-the-value-comes-from" between variables.

Paper
Code

Unified Pre-training for Program Understanding and Generation

wasiahmad/PLBART • • NAACL 2021

Experiments on code summarization in the English language, code generation, and code translation in seven programming languages show that PLBART outperforms or rivals state-of-the-art models.

Paper
Code

Bridging Pre-trained Models and Downstream Tasks for Source Code Understanding

wangdeze18/DACL • • 4 Dec 2021

In this paper, we propose an approach to bridge pre-trained models and code-related tasks.

Paper
Code

Learning Program Semantics with Code Representations: An Empirical Study

jingkai92/learning-program-representation • • 22 Mar 2022

However, currently, a comprehensive and systematic study on evaluating different program representation techniques across diverse tasks is still missed.

Paper
Code

On The Cross-Modal Transfer from Natural Language to Code through Adapter Modules

fardfh-lab/NL-Code-Adapter • • 19 Apr 2022

Although adapters are known to facilitate adapting to many downstream tasks compared to fine-tuning the model that require retraining all of the models' parameters -- which owes to the adapters' plug and play nature and being parameter efficient -- their usage in software engineering is not explored.

Paper
Code

A Neural Network Architecture for Program Understanding Inspired by Human Behaviors

recklessronan/pgnn-ek • • ACL 2022

In this paper, we consider human behaviors and propose the PGNN-EK model that consists of two main components.

Paper
Code

Clone Detection

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result