WebArchiveX Converter

How to use WebArchiveX Converter    About Web Archives and WebArchiveX


How to use WebArchiveX Converter

Web Archive file is a single file (with extension MHT) that, unlike regular HTML, embodies all needed resources e.g. frames, images, style sheets and scripts. It is an Internet standard for sending HTML documents.

Basic workflow:

  1. Choose an input path (file with extension HTML) you want to convert to a web archive file.
  2. Choose where to store the output web archive (file with extension MHT).
  3. Click "Create Archive" button and that's it!

Advanced workflow:

Open "Options" dialog by clicking on the "Set Options" button.
  1. You can enable / disable logging to a specified file at a specified level (Only errors, Only errors and warnings or Everything)
  2. You can add / change / delete resource tags. These tags will be scanned by WebArchiveX and an attached content will be included in the output web archive file
  3. You can add / change / delete MIME types. Files of these types will be scanned by WebArchiveX their content will be included in the output web archive file
  4. You can add / change / delete Script types. Files of these types will be scanned by WebArchiveX for dynamically loaded images. Detected images will be included in the output web archive file
Now you can create web archives as described in the basic workflow.



About Web Archives and WebArchiveX

What it does

The WebArchiveX Converter is based on
WebArchiveX technology, which allows fast and precise translation of one or more HTML files into a single web archive file (similar to "Save as Web Archive" feature of MS Internet Explorer 5 or later).

Why do I need it

Packing Web pages with the WebArchiveX Converter avoids errors such as missing images, style sheets or scripts when you redistribute your Web pages. By using WebArchiveX, you can easily send the Web page as a single file via e-mail, ASP scripts or save the Web page for an offline viewing. You can also unpack Web Archives using the Internet Explorer 5.0 or higher.

Reasons to prefer WebArchiveX over MS Internet Explorer

  • WebArchiveX detects and processes dynamically loaded images (Image objects).
  • WebArchiveX supports custom resource tags, which allows detection of references to other documents.
  • WebArchiveX allows usage of unknown / custom MIME types.
  • WebArchiveX is completely standalone and need no third-party software.
  • IE requires Outlook Express in order to enable the "Save as Web Archive" feature.
  • Next version (v3.0) will support downloading of complete web sites into a single web archive file (so-called "spidering").
  • WebArchiveX is also available as ActiveX (COM) DLL for integration in other platforms.

WebArchiveX ActiveX fully supports:

  • DHTML
  • Different MIME types
  • External and internal frames
  • External and internal scripts
  • External and internal style sheets
  • Multi-threaded environments
  • All programming languages that support COM
  • All character sets

Acknowledgments

We would like to jointly thank the following people for both their technical review as well as the time they spent on testing and benchmarking:
  • Meiron Cohen - our first ß-site
  • Jim Ryan
  • Jean Lucovsky
  • Mark Brian

C Systems - Creative software solutions since 1996. All rights reserved.