Datalinks Wiki
Advertisement
Read the Web

Type

Dataset

Link

http://rtw.ml.cmu.edu/rtw/resources

Source

Ckan.net

This data includes facts extracted from 500 million web pages.

From the href="http://rtw.ml.cmu.edu/rtw/overview">project's website:

To build a never-ending machine learning system that acquires the ability to extract structured information from unstructured web pages. If successful, this will result in a knowledge base (i.e., a relational database) of structured information that mirrors the content of the Web. We call this system NELL (Never-Ending Language Learner).

Advertisement