Topic: Open Source Java
Phoenix IE is an information extraction toolkit to parse information from any XML documents to arbitrary java objects.
The main purpose of Phoenix IE is to parse information from MS Word / OpenOffice documents based upon the layout (e.g. bold, paragraph breaks, ...). It is in production use as authoring environment for a medical training system d3web.Train (german).
Phoenix was developed to easily enrich existing content for use in the Semantic Web.Programming Language: Java