Docvert is a web application which takes word processor files (typically .doc) and converts them to OpenDocument and clean HTML.
The resulting OpenDocument is then optionally converted to HTML or any XML. This is done with XML Pipelines, an approach that supports XSLT, breaking up content over headings or sections, and saving those results to multiple files (e.g., chapter1.html, chapter2.html, ...). The result is returned in a .zip file.
Docvert has a user-friendly interface, and it's easy to integrate with other software as it uses a simple REST-style interface. The XML produced is easier to understand and more structured than the WordML or .DOC formats.