CW3 (CleanWWW) - Documentation
Es gibt noch keine deutsche version!
Contents of this document
1 - Changes made by CW3 to the HTML source code
CW3 changes the HTML source code of the input file, but
tries to preserve the final look of the document in the browser.
You can refer to the TESTCASE documents to seewhat CW3 really does.
Here is a general description of the working manner of CW3:
- CW3 reads the file in and splits it into TAGS and TEXTS.
- The TAGS are separeted into NAME and PARAMETERS
- CW3 combines TAGS that are the same and have different PARAMETERS
if the are not seperated by TEXT. (like <FONT COLOR=RED><FONT SIZE=1>
</FONT</FONT> will become <FONT COLOR=RED SIZE=1></FONT> )
- CW3 combines TAG's,statements like in <FONT COLOR=RED>This is red</FONT><BR><FONT COLOR=RED>
this is red, too</FONT> will become
<FONT COLOR=RED>This is red<BR>this is red, too</FONT>
- CW3 removes then senseless TAG statements (like <B></B>)
- CW3 replaces/removes then TAGS and TEXT as specified by a list of RULES (rules.txt).
- CW3 checks for the proper structure of the document and the places of some
important TAGS (like HTML, HEAD, BODY etc)
- CW3 extracts all LINKS from the file.
- CW3 rebuilds the TAGS from their name and list of parameters, and writes
the TAGS and the TEXT back to the output file.
2 - Known limitations and bugs
Please refer to the Troubleshooting section.
3 - ToDo list
- Showing an index of the HTML tree, letting the user changing files in it.
- Checking structure of HTML documents (HTML, HEAD, BODY etc)
- Checking length of title, size of images
- Converting all comments to inline text so that you can see what is hidden
in your documents.
- Removing all comments for keeping your privacy on the net.
- Implementing the other rules
- Implementing index generation
- Implementing the FONT FACE checks
So much needs to be done :-)
4 - History Revision
View here the History Revision
[06/21/97] Tels. Last update:
End Of Document.