Populating the Semantic Web by Macro-Reading Internet Text

Mitchell, Tom; Betteridge, Justin; Carlson, Andrew; Hruschka, Estevan; Wang, Richard

doi:10.1184/R1/6476249.v1

Populating the Semantic Web by Macro-Reading Internet Text

journal contribution

posted on 2009-10-01, 00:00 authored by Tom MitchellTom Mitchell, Justin Betteridge, Andrew Carlson, Estevan Hruschka, Richard Wang

A key question regarding the future of the semantic web is “how will we acquire structured information to populate the semantic web on a vast scale?” One approach is to enter this information manually. A second approach is to take advantage of pre-existing databases, and to develop common ontologies, publishing standards, and reward systems to make this data widely accessible. We consider here a third approach: developing software that automatically extracts structured information from unstructured text present on the web. We also describe preliminary results demonstrating that machine learning algorithms can learn to extract tens of thousands of facts to populate a diverse ontology, with imperfect but reasonably good accuracy.

History

Publisher Statement

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-04930-9_66

Date

2009-10-01

Usage metrics

Keywords

Machine Learning Knowledge Representation and Machine Learning

Licence

In Copyright

Populating the Semantic Web by Macro-Reading Internet Text

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports