Supplementary Material - Mining Billions of AST Nodes to Study Actual and Potential Usage of Java Language Features

by Robert Dyer, Hridesh Rajan, Hoan Anh Nguyen, and Tien N. Nguyen

Table of Contents


In this work, we used the Boa language and infrastructure to answer our research questions. Boa's website provides an interface for submitting queries in the form of Boa programs and selecting a dataset to query.

In this document we provide all Boa source code used to generate the data for this study. We also provide the output of running these Boa programs, which is the raw results for this study. These results were further processed to present them in the paper in a form easier to digest. The Java programs used to process the raw data and generate the tables and charts in the paper are also provided.

We provide these programs and data as an archival supplement to the paper, to allow researchers to understand our methodology and, if they wish, use the data to investigate their own research questions.

To generate the LaTeX tables used in the paper, you can run the following command:


These commands assume the bash, java, and javac commands are in your path.

If you wish to cite this work, please use the following:

  author = {Dyer, Robert and Rajan, Hridesh and Nguyen, Hoan Anh and Nguyen, Tien N.},
  title = {Mining Billions of {AST} Nodes to Study Actual and Potential Usage of {Java} Language Features},
  booktitle = {Proceedings of the 36th International Conference on Software Engineering},
  year = {2014},
  pages = {779--790},
  series = {{ICSE}'14},
  location = {Hyderabad, India},


Download this supplemental material (74MB)

Read the study's conference paper

License for source code and data files

Last modified: $Id: index.html,v 1.9 2014/05/09 00:44:10 rdyer Exp $