Davisor Offisor is a Pure Java software component for transforming the content of popular but hard-to-read office documents to standard XML. The first two supported document input formats are Microsoft Word™ and real-world HTML.
Davisor Offisor can be embedded into any Pure Java application. In particular, Davisor Offisor does NOT require any proprietary external support software or services. When applied on HTML files, Davisor Offisor will make sense of even the worst real-life unvalidated HTML documents with common and grave HTML syntax errors.