GooDiff datasets





GooDiff is a consumer-oriented service for keeping track of changes of important documents and indirectly the services described by these documents provided by selected Internet service providers. GooDiff is free software and the dataset of collected legal documents are provided in git bundle format. There are two kinds of dataset :

    One is the converted and cleaned legal (ToS, EULA, ...) document from the monitored service providers in text format

    The other one is the raw HTML pages of the legal document mirrored from the monitored service providers

GooDiff is free software. The content itself it's another story but assuming that's legal document that the public should know, verbatim copying should be allowed (IANAL).