Word2cleanhtml cleans up HTML pasted from Word documents. It applies filters to fix various things that Microsoft Office puts in its HTML and gives you a well formatted result that you can paste directly into a web page or content editing system.
Is it private?
The conversion process is completely automated. I don’t get to see your document and no copy will be kept of your document.
The only exception to this is if you file a bug report and choose to include a copy of your document – then your document will be emailed to me along with the bug report.
How does it work?
It uses the Python programming language to manipulate the HTML produced by Microsoft Word. The lxml library does most of the work.