The RDF Topicmaps project offers software that extracts noun phrases and simple thesaurus-like relationships from web pages and packages them in RDF. The distribution is comprised of the following components:
- web page harvester, which also extracts metadata from pages and normalizes their content
- topicmap generator, using the normalized content as input
- web based interface to the topicmaps
Please see the RDF Topicmaps Project Page for more information.
As of 2006 we are issuing software under the Apache License, Version 2.0.
If you would like to use this software under the Apache license, please contact us and we may be able to update the software to use the Apache license.