Servizio Comunitario di Informazione in materia di Ricerca e Sviluppo - CORDIS

Scraping ontologies from file systems

As SWAP (Semantic Web and Peer-to-Peer) is a system without any central repository, every participant has to be enabled to easily provide his own knowledge. The combination of the extracted structures from his personal computer with other background knowledge allows the creation of ontologies, viz. “Emergent Semantics”.

Currently Ontoscrape can extract structures from: folders and files from MS Windows systems, emails and their structures from MS Outlook, addresses from the address book, and bookmark structures from the MS Internet Explorer. The user can choose the structures he would like to share through an intuitive user interface. The knowledge sources are then integrated automatically into the knowledge repository of the local peer.

Even though the use seems very simple, various processes are working in the background. Ontoscrape connects to the different systems such as MS Outlook and the Internet Explorer. Files are extracted including attributes as author, creation date, format, etc., emails additionally extract topics and email addresses, the address book entries are saved with the most important attributes as name, email, and address, and the favourite links are saved with URL and title.

More information on the SWAP -project can be found at:

Reported by

University of Karlsruhe
Institute AIFB
76128 Karlsruhe
See on map