Dataset Notes - January 2021 - ML-Verse

Released: January 2021

Data source: GitHub

Compiler: compiler-2020-Python

Projects included

The dataset contains Github projects that indicated Python as the primary language.

Programming languages processed (stored as ASTs)

  • Python (any file with .py file extension)

Known Bugs/Limitations

None