Cleans up the otherwise-ugly output generated by Microsoft Word when you use the File -> Save As HTML option and optionally includes Twitter Bootstrap. Relies heavily on WordPress's texturize engine and internal encoding functions to do the heavy lifting. It also converts footnotes into a format compatible with Bootstrap's tooltips.
- Normalizes encoding and line endings
- Removes uneccessary tags and attributes, such as inline CSS and extraneous IDs
- Cleans up encoding problems and HTML entities
- Balances tags to ensure valid HTML
- Typesets everything via the Texturize engine
- Converts Word's footnotes into a format compatible with Bootstrap's tooltips
- Strips HTML comments
- Removes consecutive spaces
- Removes empty paragraph tags
- Removes the otherwise-hard word wrap
- Converts
<b>
's to<strong>
's and<i>
's to<em>
's - Optionally includes a stand-alone version of Twitter Bootstrap
- LAMP Server
- WordPress (uses texturize and kses engines)
- Place plugin within server's web root, and adjust the path to
wp-load.php
in the project'sindex.php
- Open the target file in Microsoft Word
- Go to "
File
" -> "Save As Web Page
" - Save the file someplace convenient
- Open
index.php
in your favorite web browser and select the file - This script will return the clean up file as a download
Licensed under the GNU General Public License Version 3 or later.