And this is a second.
HTML uses markup tags to tell the Web browser how to display the text. The above example uses: The
paragraph-separator tag. HTML tags consist of a left angle bracket (<), (a ``less than'' symbol to mathematicians), followed by name of the tag and closed by a right angular bracket (>). Tags are usually paired, e.g.
tag. There is no such thing as
. NOTE: HTML is not case sensitive.And this is a second.
However, to preserve readability in HTML files, headings should be on separate lines, and paragraphs should be separated by blank lines (in addition to the
tags). NCSA Mosaic handles
by ending the current paragraph and inserting a blank line. In HTML+, a successor to HTML currently in development,
becomes a ``container'' of text, just as the text of a level-one heading is ``contained'' within
This is a paragraph in HTML+.
The difference is that the closing tag can always be omitted. (That is, if a browser sees a, it knows that there must be an implied
to end the previous paragraph.) In other words, in HTML+,is a beginning-of- paragraph marker. The advantage of this change is that you will be able to specify formatting options for a paragraph. For example, in HTML+, you will be able to center a paragraph by coding
This is a centered paragraph. This is HTML+, so you can't do it yet.
This change won't effect any documents you write now, and they will
continue to look just the same with HTML+ browsers.
1.3.3 - Linking to Other Documents
The chief power of HTML comes from its ability to link regions of text
(and also images) to another document. The browser highlights these
regions (usually with color and/or underlines) to indicate that they
are hypertext links (often shortened to hyperlinks or simply links).
HTML's single hypertext-related tag is , which stands for anchor. To
include an anchor in your document:
Start the anchor with . (There's a space after the .) Specify the
document that's being pointed to by entering the parameter
paragraph tags. 1.4.1.2 - Numbered Lists A numbered list (also called an ordered list, from which the tag name derives) is identical to an unnumbered list, except it uses
paragraph tags), lists, or other definition information. 1.4.1.4 - Nested Lists Lists can be arbitrarily nested, although in practice you probably should limit the nesting to three levels. You can also have a number of paragraphs, each containing a nested list, in a single list item. An example nested list:
tag (which stands for ``preformatted'') to generate text in a fixed-width font and cause spaces, new lines, and tabs to be significant. (That is, multiple spaces are displayed as multiple spaces, and lines break in the same locations as in the source HTML file.) This is useful for program listings. For example, the following lines#!/bin/csh cd $SCR cfs get mysrc.f:mycfsdir/mysrc.f cfs get myinfile:mycfsdir/myinfile fc -02 -o mya.out mysrc.f mya.out cfs save myoutfile:mycfsdir/myoutfile rm *display as #!/bin/csh cd $SCR cfs get mysrc.f:mycfsdir/mysrc.f cfs get myinfile:mycfsdir/myinfile fc -02 -o mya.out mysrc.f mya.out cfs save myoutfile:mycfsdir/myoutfile rm * Hyperlinks can be used withinsections. You should avoid using other HTML tags withinsections, however. Note that because <, >, and &; have special meaning in HTML, you have to use their escape sequences (&;lt;, &;gt;, and &;amp;, respectively) to enter these characters. See the section 1.5.3 - Special Characters for more information. 1.4.3 - Extended Quotations Use thetag to include quotations in a separate block on the screen. Most browsers generally indent to separate it from surrounding text. An example:I still have a dream. It is a dream deeply rooted in the American dream.The result is: I still have a dream. It is a dream deeply rooted in the American dream. I have a dream that one day this nation will rise up and live out the true meaning of its creed. We hold these truths to be self-evident that all men are created equal. 1.4.4 - Addresses The tag is generally used to specify the author of a document and a means of contacting the author (e.g., an email address). This is usually the last item in a file. For example, the last line of the online version of this guide is A Beginner's Guide to HTML / NCSA / pubs@ncsa.uiuc.edu The result is A Beginner's Guide to HTML / NCSA / pubs@ncsa.uiuc.edu NOTE: is not used for postal addresses. See ``Forced Line Breaks'' on page 10 to see how to format postal addresses. 1.5 - Character Formatting You can code individual words or sentences with special styles. There are two types of styles: logical and physical. Logical styles tag text according to its meaning, while physical styles specify the specific appearance of a section. For example, in the preceding sentence, the words ``logical styles'' was tagged as a ``definition.'' The same effect (formatting those words in italics), could have been achieved via a different tag that specifies merely ``put these words in italics.'' 1.5.1 - Physical Versus Logical: Use Logical Styles When Possible If physical and logical styles produce the same result on the screen, why are there both? We devolve, for a couple of paragraphs, into the philosophy of SGML, which can be summed in a Zen-like mantra: ``Trust your browser.'' In the ideal SGML universe, content is divorced from presentation. Thus, SGML tags a level-one heading as a level-one heading, but does not specify that the level-one heading should be, for instance, 24-point bold Times centered on the top of a page. The advantage of this approach (it's similar in concept to style sheets in many word processors) is that if you decide to change level-one headings to be 20-point left-justified Helvetica, all you have to do is change the definition of the level-one heading in the presentation device (i.e., your World Wide Web browser). The other advantage of logical tags is that they help enforce consistency in your documents. It's easier to tag something asI have a dream that one day this nation will rise up and live out the true meaning of its creed. We hold these truths to be self-evident that all men are created equal.
than to remember that level-one headings are 24-point bold Times or whatever. The same is true for character styles. For example, consider the tag. Most browsers render it in bold text. However, it is possible that a reader would prefer that these sections be displayed in red instead. Logical styles offer this flexibility. 1.5.1.1 - Logical Styles for a word being defined. Typically displayed in italics. (NCSA Mosaic is a World Wide Web browser.) for emphasis. Typically displayed in italics. (Watch out for pickpockets.) for titles of books, films, etc. Typically displayed in italics. (A Beginner's Guide to HTML)
for snippets of computer code. Displayed in a fixed-width font. (The
header file) for user keyboard entry. Should be displayed in a bold fixed-width font, but many browsers render it in the plain fixed-width font. (Enter passwd to change your password.) for computer status messages. Displayed in a fixed-width font. (Segmentation fault: Core dumped.) for strong emphasis. Typically displayed in bold. (Important) for a ``metasyntactic'' variable, where the user is to replace the variable with a specific instance. Typically displayed in italics. (rm filename deletes the file.) 1.5.1.2 - Physical Styles bold text italic text typewriter text, e.g. fixed-width font. 1.5.2 - Using Character Tags To apply a character style, Start with , where >is the desired character formatting tag, to indicate the beginning of the tagged text. Enter the tagged text. End the passage with . 1.5.3 - Special Characters 1.5.3.1 - Escape Sequences Four characters of the ASCII character set -- the left angle bracket (<), the right angle bracket (>), the ampersand (&) and the double quote (") -- have special meaning within HTML and therefore cannot be used ``as is'' in text. (The angle brackets are used to indicate the beginning and end of HTML tags, and the ampersand is used to indicate the beginning of an escape sequence.) To use one of these characters in an HTML document, you must enter its escape sequence instead: &;lt; the escape sequence for < &;gt; the escape sequence for > &;amp; the escape sequence for ; &;quot; the escape sequence for " Additional escape sequences support accented characters. For example: &;ouml; the escape sequence for a lowercase o with an umlaut. &;ntilde; the escape sequence for a lowercase n with an tilde. &;Egrave; the escape sequence for an uppercase E with a grave accent. http://info.cern.ch/hypertext/WWW/MarkUp/ISOlat1.html A full list of supported characters can be found at CERN. NOTE: Unlike the rest of HTML, the escape sequences are case sensitive. You cannot, for instance, use &;LT; instead of &;lt;. 1.5.3.2 - Forced Line Breaks The
tag forces a line break with no extra space between lines. (By contrast, most browsers format theparagraph tag with an additional blank line to more clearly indicate the beginning the new paragraph.) One use of
is in formatting addresses: National Center for Supercomputing Applications
605 East Springfield Avenue
Champaign, Illinois 61820-5518
1.5.3.3 - Horizontal Rules The
tag produces a horizontal line the width of the browser window. 1.6 - In-line Images Most Web browsers can display in-line images (that is, images next to text) that are in X Bitmap (XBM) or GIF format. Each image takes time to process and slows down the initial display of the document, so generally you should not include too many or overly large images. To include an in-line image, usewhere image_URL is the URL of the image file. The syntax for IMG SRC URLs is identical to that used in an anchor HREF. If the image file is a GIF file, then the filename part of image_URL must end with .gif. Filenames of X Bitmap images must end with .xbm.
By default the bottom of an image is aligned with the text.
Add the ALIGN=TOP option if you want the browser to align adjacent text with the top of the image as shown in this paragraph. The full in-line image tag with the top alignment is:
image_URL
ALIGN=MIDDLE aligns the text with the center of the image. 1.6.1 - Alternate Text for Browsers That Can't Display Images Some World Wide Web browsers, primarily those that run on VT100 terminals, cannot display images. The ALT option allows you to specify text to be displayed when an image cannot be. For example:
where UpArrow.gif is the picture of an upward pointing arrow. With NCSA Mosaic and other graphics-capable viewers, the user sees the up arrow graphic. With a VT100 browser, such as lynx, the user sees the word ``Up.'' 1.7 - External Images, Sounds, and Animations You may want to have an image open as a separate document when a user activates a link on either a word or a smaller, in-line version of the image included in your document. This is considered an external image and is useful if you do not wish to slow down the loading of the main document with large in-line images. To include a reference to an external image, use link anchor Use the same syntax is for links to external animations and sounds. The only difference is the file extension of the linked file. For example, link anchor specifies a link to a QuickTime movie. Some common file types and their extensions are: File Type Extension Plain text .txt HTML document .html GIF image .gif TIFF image .tiff XBM bitmap image .xbm JPEG image .jpg or .jpeg PostScript file .ps AIFF sound .aiff AU sound .au QuickTime movie .mov MPEG movie .mpeg or .mpg Make sure your intended audience has the necessary viewers. Most UNIX workstations, for instance, cannot view QuickTime movies. 1.8 - Troubleshooting 1.8.1 - Avoid Overlapping Tags Consider this snippet of HTML: This is an example of overlapping HTML tags.
The word ``overlapping'' is contained within both the and tags. How does the browser format it? You won't know until you look, and different browsers will likely react differently. In general, avoid overlapping tags. 1.8.2 - Embed Anchors and Character Tags, But Nothing Else It is acceptable to embed anchors within another HTML element:
My heading
Do not embed a heading or another HTML element within an anchor:My heading
Although most browsers currently handle this example, it is forbidden by the official HTML and HTML+ specifications, and will not work with future browsers. Character tags modify the appearance of other tags:
- A bold list item
However, avoid embedding other types of HTML element tags. For example, it is tempting to embed a heading within a list, in order to make the font size larger:
- An italic list item
A large heading
Although some browsers, such as NCSA Mosaic for the X Window System, format this construct quite nicely, it is unpredictable (because it is undefined) what other browsers will do. For compatibility with all browsers, avoid these kinds of constructs. What's the difference between embedding a within a
Something slightly smaller
- tag as opposed to embedding a
within a
- ? This is again a question of SGML. The semantic meaning of
is that it's the main heading of a document and that it should be followed by the content of the document.Thus it doesn't make sense to find a
within a list. Character formatting tags also are generally not additive. You might expect that some text would produce bold-italic text. On some browsers it does; other browsers interpret only the innermost tag (here, the italics). 1.8.3 - Check Your Links When an
tag points at an image that does not exist, a dummy image is substituted. When this happens, make sure that the referenced image does in fact exist, that the hyperlink has the correct information in the URL, and that the file permission is set appropriately (world-readable). 1.9 - A Longer Example Here is a longer example of an HTML document:
A Longer Example A Longer Example
This is a simple HTML document. This is the first paragraph.This is the second paragraph, which shows special effects. This is a word in italics. This is a word in bold. Here is an in-lined GIF image:
.
This is the third paragraph, which demonstrates links. Here is a hypertext link from the word foo to a document called "subdir/myfile.html". (If you try to follow this link, you will get an error screen.)
A second-level header
Here is a section of text that should display as a fixed-width font:
On the stiff twig up there Hunches a wet black rook Arranging and rearranging its feathers in the rain ...This is a unordered list with two items:
This is the end of my example document.
- cranberries
- blueberries
Me (me@mycomputer.univ.edu) In addition to tags already discussed, this example also uses the ... and ... tags, which separate the document into introductory information about the document and the main text of the document. These tags don't change the appearance of the formatted document at all, but are useful for several purposes (for example, NCSA Mosaic for Macintosh 2.0, for example, allows you to browse just the header portion of document before deciding whether to download the rest), and it is recommended that you use these tags. 1.10 - For More Information This guide is only an introduction to HTML and not a comprehensive reference. Below are additional sources of information. 1.10.1 - Fill-out Forms One major feature not discussed here is fill-out forms, which allows users to return information to the World Wide Web server. For information on fill-out forms, look at this /SDG/Software/Mosaic/Docs/fill-out-forms/overview.html Fill-out Forms Overview 1.10.2 - Style Guides The following offer advice on how to write ``good'' HTML: http://www.willamette.edu/html-composition/strict-html.html Composing Good HTML http://info.cern.ch/hypertext/WWW/Provider/Style/Introduction.html CERN's style guide for online hypert 1.10.3 - Other Introductory Documents These cover similar information as this guide: http://www.ucc.ie/info/net/htmldoc.html How to Write HTML Files http://melmac.harris-atd.com/about_html.html Introduction to HTML 1.10.4 - Additional References http://kuhttp.cc.ukans.edu/lynx_help/HTML_quick.html The HTML Quick Reference Guide which provides a comprehensive listing of HTML codes http://info.cern.ch/hypertext/WWW/MarkUp/MarkUp.html The official HTML specification http://info.cern.ch/hypertext/WWW/MarkUp/SGML.html A description of SGML, the Standard Generalized Markup Language http://www.hal.com/~connolly/drafts/html-design.html Dan Connolly's HTML Design Notebook - Dan Connolly is one of the originators of HTML. National Center for Supercomputing Applications / pubs@ncsa.uiuc.edu