Code Search

49 papers with code • 5 benchmarks • 10 datasets

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Benchmarks

Add a Result

These leaderboards are used to track progress in Code Search

Dataset	Best Model	Compare
CodeSearchNet	cpt-code M	See all
CoDesc	Self-attention	See all
CodeXGLUE - AdvTest	CodeT5+ 770M	See all
CodeSearchNet - Ruby	Uni-SBT	See all
CodeXGLUE - WebQueryTest	CodeBERT	See all

Libraries

Use these libraries to find Code Search models and implementations

microsoft/CodeBERT

5 papers

1,976

facebookresearch/CodeGen

2 papers

673

Datasets

Subtasks

Annotated Code Search

Latest papers with no code

Most implemented Social Latest No code

Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search

no code yet • 9 Jan 2024

In code search, the Generation-Augmented Retrieval (GAR) framework, which generates exemplar code snippets to augment queries, has emerged as a promising strategy to address the principal challenge of modality misalignment between code snippets and natural language queries, particularly with the demonstrated code generation capabilities of Large Language Models (LLMs).

Paper
Add Code

Code Search Debiasing:Improve Search Results beyond Overall Ranking Performance

no code yet • 25 Nov 2023

To mitigate biases, we develop a general debiasing framework that employs reranking to calibrate search results.

Paper
Add Code

GenCodeSearchNet: A Benchmark Test Suite for Evaluating Generalization in Programming Language Understanding

no code yet • 16 Nov 2023

Language models can serve as a valuable tool for software developers to increase productivity.

Paper
Add Code

Noisy Pair Corrector for Dense Retrieval

no code yet • 7 Nov 2023

Most dense retrieval models contain an implicit assumption: the training query-document pairs are exactly matched.

Paper
Add Code

Contrastive Prompt Learning-based Code Search based on Interaction Matrix

no code yet • 10 Oct 2023

However, existing code search methods still suffer from two performance constraints: inadequate semantic representation and the semantic gap between natural language (NL) and programming language (PL).

Paper
Add Code

Code Representation Pre-training with Complements from Program Executions

no code yet • 4 Sep 2023

The test cases are obtained with the assistance of a customized fuzzer and are only required during pre-training.

Paper
Add Code

Laminar: A New Serverless Stream-based Framework with Semantic Code Search and Code Completion

no code yet • 1 Sep 2023

This paper introduces Laminar, a novel serverless framework based on dispel4py, a parallel stream-based dataflow library.

Paper
Add Code

Evaluating and Optimizing the Effectiveness of Neural Machine Translation in Supporting Code Retrieval Models: A Study on the CAT Benchmark

no code yet • 9 Aug 2023

Our NMT models of learning ASTTrans Representation can boost the Mean Reciprocal Rank of these state-of-the-art code search processes by up to 3. 08% and improve 23. 08% of queries' results over the CAT benchmark.

Paper
Add Code

CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search

no code yet • 19 May 2023

We consider the clone detection and information retrieval problems for source code, well-known tasks important for any programming language.

Paper
Add Code

Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets

no code yet • 19 May 2023

Code search is an important task that has seen many developments in recent years.

Paper
Add Code

Code Search

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result