Data

In the spirit of collaboration, I am happy to host datasets related to my prior research.

RE 2013 Policy Document Corpus

The RE 2013 Policy Document Corpus consists of 2,061 Privacy Policies, Terms of Use, Terms and Conditions, and Terms of Service documents, and other policy documents. The policy documents in this repository were originally examined in the following research paper, which details the collection process:

Aaron K. Massey, Jacob Eisenstein, Annie I. Antón, and Peter P. Swire. “Automated Text Mining for Requirements Analysis of Policy Documents” 21st IEEE International Requirements Engineering Conference. Rio de Janeiro, Brazil, July 2013. DOI: 10.1109/RE.2013.6636700

This dataset is made available for research, teaching, and scholarship purposes only, with further parameters in the spirit of a Creative Commons Attribution-NonCommercial License. Please cite the above research paper for attribution purposes. For your convenience, you may simply download the BibTeX for the citation information.

Please contact me if you have any questions or need additional information.

Published Research

If you have published research using any of these datasets, please contact me. I would like to link to any research using any of these datasets.