ETH Py150 Open

Introduced by Kanade et al. in Learning and Evaluating Contextual Embedding of Source Code

A massive, deduplicated corpus of 7.4M Python files from GitHub.

Source: Learning and Evaluating Contextual Embedding of Source Code

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages