![]() |
![]() |
![]() |
![]() |
The following information is described in this section:
Charset Recognition in IE3
This charset recognition specification defines which charset identifiers Internet Explorer recognizes in the HTTP header of HTTP replies, and which charset IDs it recognizes in the <META ... CHARSET=charsetID> tag. It also specifies which built-in charset translation the charset ID maps to. This does not specify what IE should send out as the ACCEPT-CHARSET parameter in the HTTP request.
Table of Base Charsets, Display Names, and Aliases
In the following table, the base charset is the basic translation built into IE3. Aliases lists all other charset IDs that are recognized and can be represented without translation, using the "base charset" translation method. This does not, in all cases, mean that alias and base charset represent the same charset; the alias charset can be a subset of the base charset. Base charset is not a recognized name unless repeated in the "aliases" column.
Base Character | Display Name | Aliases |
1252 | Western | us-ascii, iso8859-1, ascii, iso_8859-1, iso-8859-1, ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO_646.irv:1991, ISO646-US, us, IBM367, cp367, csASCII, latin1, iso_8859-1:1987, iso-ir-100, ibm819, cp819 |
28592 | Central European (ISO) | iso8859-2, iso-8859-2, iso_8859-2, latin2, iso_8859-2:1987, iso-ir-101, l2, csISOLatin2 |
1250 | Central European (Windows) | windows-1250, x-cp1250 |
1251 | Cyrillic (Windows) | windows-1251, x-cp1251 |
1253 | Greek (Windows) | windows-1253 |
1254 | Turkish (Windows) | windows-1254 |
932 | Shift-JIS | shift_jis, x-sjis, ms_Kanji, csShiftJIS |
EUC-JP | EUC | Extended_UNIX_Code_Packed_Format_for_Japanese, csEUCPkdFmtJapanese, x-euc-jp |
JIS | JIS | csISO2022JP, iso-2022-jp |
1257 | windows-1257 | |
950 | Traditional Chinese (BIG5) | big5, csbig5, x-x-big5 |
936 | Simplified Chinese | GB_2312-80, iso-ir-58, chinese, csISO58GB231280, csGB2312, gb2312 |
20866 | Cyrillic (KOI8-R) | csKOI8R, koi8-r |
949 | Korean | ks_c_5601, ks_c_5601-1987, korean, csKSC56011987 |
Correct Usage
The correct usage is as specified in RFC 1341. For example:
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=Windows-1251">
This should be in or before HEAD but certainly before BODY.
Priority
The following list shows the priorities of charset declarations that IE will use.
A frameset can have differing charsets per frame.
Position of <META .. CHARSET=..> in the Document
The <META .. CHARSET=..> sequence can appear anywhere in the document BEFORE the BODY tag. In any case, it affects the whole document, including TITLEs, appearing before the <META CHARSET> tag.
ISO Latin-1 Character Set
The following table contains the ISO Latin-1 character set. The table describes each character, its decimal code, and its special entity reference for HTML, as well as providing a brief description.
Character | Decimal Code | HTML | Description |
À | À | À | Capital A, grave accent |
à | à | à | Small a, grave accent |
Á | Á | Á | Capital A, acute accent |
á | á | á | Small a, acute accent |
 |  |  | Capital A, circumflex |
â | â | â | Small a, circumflex |
à | à | à | Capital A, tilde |
ã | ã | ã | Small a, tilde |
Ä | Ä | Ä | Capital A, diæresis / umlaut |
ä | ä | ä | Small a, diæresis / umlaut |
Å | Å | Å | Capital A, ring |
å | å | å | Small a, ring |
Æ | Æ | Æ | Capital AE ligature |
æ | æ | æ | Small ae ligature |
Ç | Ç | Ç | Capital C, cedilla |
ç | ç | ç | Small c, cedilla |
È | È | È | Capital E, grave accent |
è | è | è | Small e, grave accent |
É | É | É | Capital E, acute accent |
é | é | é | Small e, acute accent |
Ê | Ê | Ê | Capital E, circumflex |
ê | ê | ê | Small e, circumflex |
Ë | Ë | Ë | Capital E, diæresis / umlaut |
ë | ë | ë | Small e, diæresis / umlaut |
Ì | Ì | Ì | Capital I, grave accent |
ì | ì | ì | Small i, grave accent |
Í | Í | Í | Capital I, acute accent |
í | í | í | Small i, acute accent |
Î | Î | Î | Capital I, circumflex |
î | î | î | Small i, circumflex |
Ï | Ï | Ï | Capital I, diæresis / umlaut |
ï | ï | ï | Small i, diæresis / umlaut |
Ð | Ð | Ð | Capital Eth, Icelandic |
ð | ð | ð | Small eth, Icelandic |
Ñ | Ñ | Ñ | Capital N, tilde |
ñ | ñ | ñ | Small n, tilde |
Ò | Ò | Ò | Capital O, grave accent |
ò | ò | ò | Small o, grave accent |
Ó | Ó | Ó | Capital O, acute accent |
ó | ó | ó | Small o, acute accent |
Ô | Ô | Ô | Capital O, circumflex |
ô | ô | ô | Small o, circumflex |
Õ | Õ | Õ | Capital O, tilde |
õ | õ | õ | Small o, tilde |
Ö | Ö | Ö | Capital O, diæresis / umlaut |
ö | ö | ö | Small o, diæresis / umlaut |
Ø | Ø | Ø | Capital O, slash |
ø | ø | ø | Small o, slash |
Ù | Ù | Ù | Capital U, grave accent |
ù | ù | ù | Small u, grave accent |
Ú | Ú | Ú | Capital U, acute accent |
ú | ú | ú | Small u, acute accent |
Û | Û | Û | Capital U, circumflex |
û | û | û | Small u, circumflex |
Ü | Ü | Ü | Capital U, diæresis / umlaut |
ü | ü | ü | Small u, diæresis / umlaut |
Ý | Ý | Ý | Capital Y, acute accent |
ý | ý | ý | Small y, acute accent |
Þ | Þ | Þ | Capital Thorn, Icelandic |
þ | þ | þ | Small thorn, Icelandic |
ß | ß | ß | Small sharp s, German sz |
ÿ | ÿ | ÿ | Small y, diæresis / umlaut |
Character Set
The following table describes the complete character set for Internet Explorer 3.0 English (U.S.). The first column shows the character as it appears in Internet Explorer 3.0. The second column shows the decimal number as it is written in an HTML document to produce the characters. Occasionally, special characters have mnemonic names. For example, the registered trademark character can be written in HTML as ®. The third column lists these HTML characters. The last column gives a description of each character where appropriate.
Character | Decimal Code | HTML | Description |
� | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
	 | Horizontal tab | ||
| Line feed | ||
 | Unused | ||
 | Unused | ||
| Carriage Return | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
 | Unused | ||
  | Space | ||
! | ! | Exclamation mark | |
" | " | " | Quotation mark |
# | # | Number sign | |
$ | $ | Dollar sign | |
% | % | Percent sign | |
& | & | & | Ampersand |
' | ' | Apostrophe | |
( | ( | Left parenthesis | |
) | ) | Right parenthesis | |
* | * | Asterisk | |
+ | + | Plus sign | |
, | , | Comma | |
- | - | Hyphen | |
. | . | Period (fullstop) | |
/ | / | Solidus (slash) | |
0 | 0 | Digit 0 | |
1 | 1 | Digit 1 | |
2 | 2 | Digit 2 | |
3 | 3 | Digit 3 | |
4 | 4 | Digit 4 | |
5 | 5 | Digit 5 | |
6 | 6 | Digit 6 | |
7 | 7 | Digit 7 | |
8 | 8 | Digit 8 | |
9 | 9 | Digit 9 | |
: | : | Colon | |
; | ; | Semicolon | |
< | < | < | Less than |
= | = | Equals sign | |
> | > | > | Greater than |
? | ? | Question mark | |
@ | @ | Commercial at | |
A | A | Capital A | |
B | B | Capital B | |
C | C | Capital C | |
D | D | Capital D | |
E | E | Capital E | |
F | F | Capital F | |
G | G | Capital G | |
H | H | Capital H | |
I | I | Capital I | |
J | J | Capital J | |
K | K | Capital K | |
L | L | Capital L | |
M | M | Capital M | |
N | N | Capital N | |
O | O | Capital O | |
P | P | Capital P | |
Q | Q | Capital Q | |
R | R | Capital R | |
S | S | Capital S | |
T | T | Capital T | |
U | U | Capital U | |
V | V | Capital V | |
W | W | Capital W | |
X | X | Capital X | |
Y | Y | Capital Y | |
Z | Z | Capital Z | |
[ | [ | Left square bracket | |
\ | \ | Reverse solidus (backslash) | |
] | ] | Right square bracket | |
^ | ^ | Caret | |
_ | _ | Horizontal bar (underscore) | |
` | ` | Acute accent | |
a | a | Small a | |
b | b | Small b | |
c | c | Small c | |
d | d | Small d | |
e | e | Small e | |
f | f | Small f | |
g | g | Small g | |
h | h | Small h | |
i | i | Small i | |
j | j | Small j | |
k | k | Small k | |
l | l | Small l | |
m | m | Small m | |
n | n | Small n | |
o | o | Small o | |
p | p | Small p | |
q | q | Small q | |
r | r | Small r | |
s | s | Small s | |
t | t | Small t | |
u | u | Small u | |
v | v | Small v | |
w | w | Small w | |
x | x | Small x | |
y | y | Small y | |
z | z | Small z | |
{ | { | Left curly brace | |
| | | | Vertical bar | |
} | } | Right curly brace | |
~ | ~ | Tilde | |
|  | Unused | |
| € | Unused | |
  | | Non-breaking Space | |
¡ | ¡ | ¡ | Inverted exclamation |
¢ | ¢ | ¢ | Cent sign |
£ | £ | £ | Pound sterling |
¤ | ¤ | ¤ | General currency sign |
¥ | ¥ | ¥ | Yen sign |
¦ | ¦ | ¦ or &brkbar; | Broken vertical bar |
§ | § | &§ | Section sign |
¨ | ¨ | &&um; or &¨ | Diæresis / Umlaut |
© | © | &© | Copyright |
ª | ª | &ª | Feminine ordinal |
« | « | &« | Left angle quote, guillemot left |
¬ | ¬ | &¬ | Not sign |
| ­ | ­ | Soft hyphen |
® | ® | ® | Registered trademark |
¯ | ¯ | ¯ or &hibar; | Macron accent |
° | ° | ° | Degree sign |
± | ± | ± | Plus or minus |
² | ² | ² | Superscript two |
³ | ³ | ³ | Superscript three |
´ | ´ | ´ | Acute accent |
µ | µ | µ | Micro sign |
¶ | ¶ | ¶ | Paragraph sign |
· | · | · | Middle dot |
¸ | ¸ | ¸ | Cedilla |
¹ | ¹ | ¹ | Superscript one |
º | º | º | Masculine ordinal |
» | » | » | Right angle quote, guillemot right |
¼ | ¼ | ¼ | Fraction one-fourth |
½ | ½ | ½ | Fraction one-half |
¾ | ¾ | ¾ | Fraction three-fourths |
¿ | ¿ | ¿ | Inverted question mark |
À | À | À | Capital A, grave accent |
Á | Á | Á | Capital A, acute accent |
 |  |  | Capital A, circumflex |
à | à | à | Capital A, tilde |
Ä | Ä | Ä | Capital A, diæresis / umlaut |
Å | Å | Å | Capital A, ring |
Æ | Æ | Æ | Capital AE ligature |
Ç | Ç | Ç | Capital C, cedilla |
È | È | È | Capital E, grave accent |
É | É | É | Capital E, acute accent |
Ê | Ê | Ê | Capital E, circumflex |
Ë | Ë | Ë | Capital E, diæresis / umlaut |
Ì | Ì | Ì | Capital I, grave accent |
Í | Í | Í | Capital I, acute accent |
Î | Î | Î | Capital I, circumflex |
Ï | Ï | Ï | Capital I, diæresis / umlaut |
Ð | Ð | Ð | Capital Eth, Icelandic |
Ñ | Ñ | Ñ | Capital N, tilde |
Ò | Ò | Ò | Capital O, grave accent |
Ó | Ó | Ó | Capital O, acute accent |
Ô | Ô | Ô | Capital O, circumflex |
Õ | Õ | Õ | Capital O, tilde |
Ö | Ö | Ö | Capital O, diæresis / umlaut |
× | × | × | Multiply sign |
Ø | Ø | Ø | Capital O, slash |
Ù | Ù | Ù | Capital U, grave accent |
Ú | Ú | Ú | Capital U, acute accent |
Û | Û | Û | Capital U, circumflex |
Ü | Ü | Ü | Capital U, diæresis / umlaut |
Ý | Ý | Ý | Capital Y, acute accent |
Þ | Þ | Þ | Capital Thorn, Icelandic |
ß | ß | ß | Small sharp s, German sz |
à | à | à | Small a, grave accent |
á | á | á | Small a, acute accent |
â | â | â | Small a, circumflex |
ã | ã | ã | Small a, tilde |
ä | ä | ä | Small a, diæresis / umlaut |
å | å | å | Small a, ring |
æ | æ | æ | Small ae ligature |
ç | ç | ç | Small c, cedilla |
è | è | è | Small e, grave accent |
é | é | é | Small e, acute accent |
ê | ê | ê | Small e, circumflex |
ë | ë | ë | Small e, diæresis / umlaut |
ì | ì | ì | Small i, grave accent |
í | í | í | Small i, acute accent |
î | î | î | Small i, circumflex |
ï | ï | ï | Small i, diæresis / umlaut |
ð | ð | ð | Small eth, Icelandic |
ñ | ñ | ñ | Small n, tilde |
ò | ò | ò | Small o, grave accent |
ó | ó | ó | Small o, acute accent |
ô | ô | ô | Small o, circumflex |
õ | õ | õ | Small o, tilde |
ö | ö | ö | Small o, diæresis / umlaut |
÷ | ÷ | ÷ | Division sign |
ø | ø | ø | Small o, slash |
ù | ù | ù | Small u, grave accent |
ú | ú | ú | Small u, acute accent |
û | û | û | Small u, circumflex |
ü | ü | ü | Small u, diæresis / umlaut |
ý | ý | ý | Small y, acute accent |
þ | þ | þ | Small thorn, Icelandic |
ÿ | ÿ | ÿ | Small y, diæresis / umlaut |
![]() |
![]() |
![]() |
![]() |