Convert Word documents to Clean HTML

Word2cleanhtml cleans up HTML pasted from Word documents. It applies filters to fix various things that Microsoft Office puts in its HTML and gives you a well formatted result that you can paste directly into a web page or content editing system.
Is it private?

The conversion process is completely automated. I don’t get to see your document and no copy will be kept of your document.

The only exception to this is if you file a bug report and choose to include a copy of your document – then your document will be emailed to me along with the bug report.

How does it work?
It uses the Python programming language to manipulate the HTML produced by Microsoft Word. The lxml library does most of the work.


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s