Code Completion

63 papers with code • 4 benchmarks • 9 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Code Completion

Dataset	Best Model	Compare
SAFIM	deepseek-coder-33b-base	See all
CodeXGLUE - PY150	CodeGPT-adapted	See all
CodeXGLUE - Github Java Corpus	CodeGPT-adapted	See all
DotPrompts	SantaCoder-MGD	See all

Libraries

Use these libraries to find Code Completion models and implementations

microsoft/CodeBERT

3 papers

1,972

Datasets

Most implemented papers

Most implemented Social Latest No code

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

microsoft/CodeXGLUE • • 9 Feb 2021

Benchmark datasets have a significant impact on accelerating research in programming language tasks.

Paper
Code

Open Vocabulary Learning on Source Code with a Graph-Structured Cache

mwcvitkovic/Deep_Learning_On_Code_With_A_Graph_Vocabulary--Code_Preprocessor • ICLR 2019

Machine learning models that take computer program source code as input typically use Natural Language Processing (NLP) techniques.

Paper
Code

Structural Language Models of Code

tech-srl/slm-code-generation • • ICML 2020

We introduce a new approach to any-code completion that leverages the strict syntax of programming languages to model a code snippet as a tree - structural language modeling (SLM).

Paper
Code

Neural Software Analysis

superli3/codenavi • • 16 Nov 2020

The resulting tools complement and outperform traditional program analyses, and are used in industrial practice.

Paper
Code

UniXcoder: Unified Cross-Modal Pre-training for Code Representation

microsoft/CodeBERT • • ACL 2022

Furthermore, we propose to utilize multi-modal contents to learn representation of code fragment with contrastive learning, and then align representations among programming languages using a cross-modal generation task.

Paper
Code

Multi-lingual Evaluation of Code Generation Models

amazon-research/mbxp-exec-eval • 26 Oct 2022

Using these benchmarks, we are able to assess the performance of code generation models in a multi-lingual fashion, and discovered generalization ability of language models on out-of-domain languages, advantages of multi-lingual models over mono-lingual, the ability of few-shot prompting to teach the model new languages, and zero-shot translation abilities even on mono-lingual settings.

Paper
Code

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

greshake/llm-security • 23 Feb 2023

Large Language Models (LLMs) are increasingly being integrated into various applications.

Paper
Code

CodeKGC: Code Language Model for Generative Knowledge Graph Construction

zjunlp/deepke • • 18 Apr 2023

However, large generative language model trained on structured data such as code has demonstrated impressive capability in understanding natural language for structural prediction and reasoning tasks.

Paper
Code

MPI-rical: Data-Driven MPI Distributed Parallelism Assistance with Transformers

scientific-computing-lab-nrcn/mpi-rical • 16 May 2023

Message Passing Interface (MPI) plays a crucial role in distributed memory parallelization across multiple nodes.

Paper
Code

Scope is all you need: Transforming LLMs for HPC Code

scientific-computing-lab-nrcn/tokompiler • • 18 Aug 2023

With easier access to powerful compute resources, there is a growing trend in the field of AI for software development to develop larger and larger language models (LLMs) to address a variety of programming tasks.

Paper
Code

Code Completion

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result