CW3 (CleanWWWW) v2.51


Es gibt noch keine deutsche version verfügbar.

Contents of this document


1 - What is CW3

CW3 (the former WW2HTM program) is a converter. It convertes the (so-called) HTML output of various Windows programs like Winword, Frontpage and Office 97 into true, resolution, hard- & sofware independed HTML. The output of CW3 is in most cases smaller then the input - good for loading times and server performance.

This is an BETA version and not quite ready. It is not a port, but a complete rewrite from scratch.

Please read the Licence before using this program. This BETA is full functional, except that it doesnt do much cleaning/checking yet. I concentrated on the core engine of processing a tree of files and to split HTML files into TAG's and TEXT's. This is ready, and plugging in some checking rules is now easy, but had to wait for the final version.

There is not much documentation yet. Here is a list of all Warnings and Errors encountered by CW3.

In the subdirectoy TESTCASE are some files that try to stress CW3 and to show what it can and what not yet.

The files in the CFG directory are not read or used yet.

The latest version can be found on hobbes or simtel-net.


2 - Input format

CW3 takes the HTML files generated by the various programs or existing HTML code. It works great on entire sites or homepages.

Have a look at the testcase file and see what CW3 outputs.


21 - Changes made by CW3 to the HTML source code

CW3 changes the HTML source code of the input file, but tries to preserve the final look of the document in the browser.
You can refer to the TESTCASE documents to seewhat CW3 realy does.
Here is a general description of the working manner of CW3:

3 - Known limitations and bugs

4 - ToDo list

So much needs to be done :-)

5 - History Revision

View here the History Revision


6 - Contact the author

You can either visit my web page at www.pobox.com/~tels or send me a (PGP 2.6.3 encrypted prefered) mail.


[06/21/97] Tels. Last update: End Of Document.