HTML Character Sets

Microsoft FrontPage Visual Basic

The following table contains the complete ISO Latin-1 character set, corresponding to the first 256 entries of the Unicode character repertoire. The table provides each character, its decimal code, its named entity reference for HTML, and also a brief description.

Character Decimal code Named entity Description
--- � --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
--- 	 --- Horizontal tab
--- 
 --- Line feed
---  --- Unused
---  --- Unused
--- 
 --- Carriage Return
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
---  --- Unused
  --- Space
! ! --- Exclamation mark
" " " Quotation mark
# # --- Number sign
$ $ --- Dollar sign
% % --- Percent sign
& & & Ampersand
' ' --- Apostrophe
( ( --- Left parenthesis
) ) --- Right parenthesis
* * --- Asterisk
+ + --- Plus sign
, , --- Comma
- - --- Hyphen
. . --- Period (fullstop)
/ / --- Solidus (slash)
0 0 --- Digit 0
1 1 --- Digit 1
2 2 --- Digit 2
3 3 --- Digit 3
4 4 --- Digit 4
5 5 --- Digit 5
6 6 --- Digit 6
7 7 --- Digit 7
8 8 --- Digit 8
9 9 --- Digit 9
: : --- Colon
; &#59; --- Semicolon
< &#60; &lt; Less than
= &#61; --- Equals sign
> &#62; &gt; Greater than
? &#63; --- Question mark
@ &#64; --- Commercial at
A &#65; --- Capital A
B &#66; --- Capital B
C &#67; --- Capital C
D &#68; --- Capital D
E &#69; --- Capital E
F &#70; --- Capital F
G &#71; --- Capital G
H &#72; --- Capital H
I &#73; --- Capital I
J &#74; --- Capital J
K &#75; --- Capital K
L &#76; --- Capital L
M &#77; --- Capital M
N &#78; --- Capital N
O &#79; --- Capital O
P &#80; --- Capital P
Q &#81; --- Capital Q
R &#82; --- Capital R
S &#83; --- Capital S
T &#84; --- Capital T
U &#85; --- Capital U
V &#86; --- Capital V
W &#87; --- Capital W
X &#88; --- Capital X
Y &#89; --- Capital Y
Z &#90; --- Capital Z
[ &#91; --- Left square bracket
\ &#92; --- Reverse solidus (backslash)
] &#93; --- Right square bracket
^ &#94; --- Caret
_ &#95; --- Horizontal bar (underscore)
` &#96; --- Grave accent
a &#97; --- Small a
b &#98; --- Small b
c &#99; --- Small c
d &#100; --- Small d
e &#101; --- Small e
f &#102; --- Small f
g &#103; --- Small g
h &#104; --- Small h
i &#105; --- Small i
j &#106; --- Small j
k &#107; --- Small k
l &#108; --- Small l
m &#109; --- Small m
n &#110; --- Small n
o &#111; --- Small o
p &#112; --- Small p
q &#113; --- Small q
r &#114; --- Small r
s &#115; --- Small s
t &#116; --- Small t
u &#117; --- Small u
v &#118; --- Small v
w &#119; --- Small w
x &#120; --- Small x
y &#121; --- Small y
z &#122; --- Small z
{ &#123; --- Left curly brace
| &#124; --- Vertical bar
} &#125; --- Right curly brace
~ &#126; --- Tilde
--- &#127; --- Unused
&#160; &nbsp; Nonbreaking space
¡ &#161; &iexcl; Inverted exclamation
¢ &#162; &cent; Cent sign
£ &#163; &pound; Pound sterling
¤ &#164; &curren; General currency sign
¥ &#165; &yen; Yen sign
¦ &#166; &brvbar; or &brkbar; Broken vertical bar
§ &#167; &sect; Section sign
¨ &#168; &uml; or &die; Diæresis / Umlaut
© &#169; &copy; Copyright
ª &#170; &ordf; Feminine ordinal
« &#171; &laquo; Left angle quote, guillemot left
&#172; &not; Not sign
- &#173; &shy; Soft hyphen
® &#174; &reg; Registered trademark
¯ &#175; &macr; or &hibar; Macron accent
° &#176; &deg; Degree sign
± &#177; &plusmn; Plus or minus
² &#178; &sup2; Superscript two
³ &#179; &sup3; Superscript three
´ &#180; &acute; Acute accent
µ &#181; &micro; Micro sign
&#182; &para; Paragraph sign
&#183; &middot; Middle dot
¸ &#184; &cedil; Cedilla
¹ &#185; &sup1; Superscript one
º &#186; &ordm; Masculine ordinal
» &#187; &raquo; Right angle quote, guillemot right
¼ &#188; &frac14; Fraction one-fourth
½ &#189; &frac12; Fraction one-half
¾ &#190; &frac34; Fraction three-fourths
¿ &#191; &iquest; Inverted question mark
À &#192; &Agrave; Capital A, grave accent
Á &#193; &Aacute; Capital A, acute accent
 &#194; &Acirc; Capital A, circumflex
à &#195; &Atilde; Capital A, tilde
Ä &#196; &Auml; Capital A, diæresis / umlaut
Å &#197; &Aring; Capital A, ring
Æ &#198; &AElig; Capital AE ligature
Ç &#199; &Ccedil; Capital C, cedilla
È &#200; &Egrave; Capital E, grave accent
É &#201; &Eacute; Capital E, acute accent
Ê &#202; &Ecirc; Capital E, circumflex
Ë &#203; &Euml; Capital E, diæresis / umlaut
Ì &#204; &Igrave; Capital I, grave accent
Í &#205; &Iacute; Capital I, acute accent
Î &#206; &Icirc; Capital I, circumflex
Ï &#207; &Iuml; Capital I, diæresis / umlaut
Ð &#208; &ETH; Capital Eth, Icelandic
Ñ &#209; &Ntilde; Capital N, tilde
Ò &#210; &Ograve; Capital O, grave accent
Ó &#211; &Oacute; Capital O, acute accent
Ô &#212; &Ocirc; Capital O, circumflex
Õ &#213; &Otilde; Capital O, tilde
Ö &#214; &Ouml; Capital O, diæresis / umlaut
× &#215; &times; Multiply sign
Ø &#216; &Oslash; Capital O, slash
Ù &#217; &Ugrave; Capital U, grave accent
Ú &#218; &Uacute; Capital U, acute accent
Û &#219; &Ucirc; Capital U, circumflex
Ü &#220; &Uuml; Capital U, diæresis / umlaut
Ý &#221; &Yacute; Capital Y, acute accent
Þ &#222; &THORN; Capital Thorn, Icelandic
ß &#223; &szlig; Small sharp s, German sz
à &#224; &agrave; Small a, grave accent
á &#225; &aacute; Small a, acute accent
â &#226; &acirc; Small a, circumflex
ã &#227; &atilde; Small a, tilde
ä &#228; &auml; Small a, diæresis / umlaut
å &#229; &aring; Small a, ring
æ &#230; &aelig; Small ae ligature
ç &#231; &ccedil; Small c, cedilla
è &#232; &egrave; Small e, grave accent
é &#233; &eacute; Small e, acute accent
ê &#234; &ecirc; Small e, circumflex
ë &#235; &euml; Small e, diæresis / umlaut
ì &#236; &igrave; Small i, grave accent
í &#237; &iacute; Small i, acute accent
î &#238; &icirc; Small i, circumflex
ï &#239; &iuml; Small i, diæresis / umlaut
ð &#240; &eth; Small eth, Icelandic
ñ &#241; &ntilde; Small n, tilde
ò &#242; &ograve; Small o, grave accent
ó &#243; &oacute; Small o, acute accent
ô &#244; &ocirc; Small o, circumflex
õ &#245; &otilde; Small o, tilde
ö &#246; &ouml; Small o, diæresis / umlaut
÷ &#247; &divide; Division sign
ø &#248; &oslash; Small o, slash
ù &#249; &ugrave; Small u, grave accent
ú &#250; &uacute; Small u, acute accent
û &#251; &ucirc; Small u, circumflex
ü &#252; &uuml; Small u, diæresis / umlaut
ý &#253; &yacute; Small y, acute accent
þ &#254; &thorn; Small thorn, Icelandic
ÿ &#255; &yuml; Small y, diæresis / umlaut

ShowAdditional Named Entities for HTML

The following table contains additional named entities, their numeric character references, and a description of each. With the exception of the left and right-pointing brackets (&#9001; and &#9002;), these entities are rendered using Lucida sans Unicode.

Character Named entity Numeric character reference Description
Latin Extended-B
ƒ &fnof; &#402; Latin small f with hook, =function, =florin, U0192 ISOtech
Greek
Α &Alpha; &#913; Greek capital letter alpha, U0391
Β &Beta; &#914; Greek capital letter beta, U0392
Γ &Gamma; &#915; Greek capital letter gamma, U0393 ISOgrk3
Δ &Delta; &#916; Greek capital letter delta, U0394 ISOgrk3
Ε &Epsilon; &#917; Greek capital letter epsilon, U0395
Ζ &Zeta; &#918; Greek capital letter zeta, U0396
Η &Eta; &#919; Greek capital letter eta, U0397
Θ &Theta; &#920; Greek capital letter theta, U0398 ISOgrk3
Ι &Iota; &#921; Greek capital letter iota, U0399
Κ &Kappa; &#922; Greek capital letter kappa, U039A
Λ &Lambda; &#923; Greek capital letter lambda, U039B ISOgrk3
Μ &Mu; &#924; Greek capital letter mu, U039C
Ν &Nu; &#925; Greek capital letter nu, U039D
Ξ &Xi; &#926; Greek capital letter xi, U039E ISOgrk3
Ο &Omicron; &#927; Greek capital letter omicron, U039F
Π &Pi; &#928; Greek capital letter pi, U03A0 ISOgrk3
Ρ &Rho; &#929; Greek capital letter rho, U03A1
Σ &Sigma; &#931; Greek capital letter sigma, U03A3 ISOgrk3
Τ &Tau; &#932; Greek capital letter tau, U03A4
Υ &Upsilon; &#933; Greek capital letter upsilon, U03A5 ISOgrk3
Φ &Phi; &#934; Greek capital letter phi, U03A6 ISOgrk3
Χ &Chi; &#935; Greek capital letter chi, U03A7
Ψ &Psi; &#936; Greek capital letter psi, U03A8 ISOgrk3
Ω &Omega; &#937; Greek capital letter omega, U03A9 ISOgrk3
α &alpha; &#945; Greek small letter alpha, U03B1 ISOgrk3
β &beta; &#946; Greek small letter beta, U03B2 ISOgrk3
γ &gamma; &#947; Greek small letter gamma, U03B3 ISOgrk3
δ &delta; &#948; Greek small letter delta, U03B4 ISOgrk3
ε &epsilon; &#949; Greek small letter epsilon, U03B5 ISOgrk3
ζ &zeta; &#950; Greek small letter zeta, U03B6 ISOgrk3
η &eta; &#951; Greek small letter eta, U03B7 ISOgrk3
θ &theta; &#952; Greek small letter theta, U03B8 ISOgrk3
ι &iota; &#953; Greek small letter iota, U03B9 ISOgrk3
κ &kappa; &#954; Greek small letter kappa, U03BA ISOgrk3
λ &lambda; &#955; Greek small letter lambda, U03BB ISOgrk3
μ &mu; &#956; Greek small letter mu, U03BC ISOgrk3
ν &nu; &#957; Greek small letter nu, U03BD ISOgrk3
ξ &xi; &#958; Greek small letter xi, U03BE ISOgrk3
ο &omicron; &#959; Greek small letter omicron, U03BF NEW
π &pi; &#960; Greek small letter pi, U03C0 ISOgrk3
ρ &rho; &#961; Greek small letter rho, U03C1 ISOgrk3
ς &sigmaf; &#962; Greek small letter final sigma, U03C2 ISOgrk3
σ &sigma; &#963; Greek small letter sigma, U03C3 ISOgrk3
τ &tau; &#964; Greek small letter tau, U03C4 ISOgrk3
υ &upsilon; &#965; Greek small letter upsilon, U03C5 ISOgrk3
φ &phi; &#966; Greek small letter phi, U03C6 ISOgrk3
χ &chi; &#967; Greek small letter chi, U03C7 ISOgrk3
ψ &psi; &#968; Greek small letter psi, U03C8 ISOgrk3
ω &omega; &#969; Greek small letter omega, U03C9 ISOgrk3
θ &thetasym; &#977; Greek small letter theta symbol, U03D1 NEW
Υ &upsih; &#978; Greek upsilon with hook symbol, U03D2 NEW
π &piv; &#982; Greek pi symbol, U03D6 ISOgrk3
General Punctuation
&bull; &#8226; bullet, =black small circle, U2022 ISOpub
&hellip; &#8230; horizontal ellipsis, =three dot leader, U2026 ISOpub
&prime; &#8242; prime, =minutes, =feet, U2032 ISOtech
&Prime; &#8243; double prime, =seconds, =inches, U2033 ISOtech
&oline; &#8254; overline, =spacing overscore, U203E NEW
&frasl; &#8260; fraction slash, U2044 NEW
Letterlike Symbols
Script capital P &weierp; &#8472; script capital P, =power set, =Weierstrass p, U2118 ISOamso
Black letter capital L &image; &#8465; blackletter capital I, =imaginary part, U2111 ISOamso
Black letter capital R &real; &#8476; blackletter capital R, =real part symbol, U211C ISOamso
&trade; &#8482; trade mark sign, U2122 ISOnum
Alef symbol &alefsym; &#8501; alef symbol, =first transfinite cardinal, U2135 NEW
Arrows
&larr; &#8592; leftward arrow, U2190 ISOnum
&uarr; &#8593; upward arrow, U2191 ISOnum
&rarr; &#8594; rightward arrow, U2192 ISOnum
&darr; &#8595; downward arrow, U2193 ISOnum
&harr; &#8596; left right arrow, U2194 ISOamsa
Downward arrow with corner left. &crarr; &#8629; downward arrow with corner leftward, =carriage return, U21B5 NEW
Leftward double arrow. &lArr; &#8656; leftward double arrow, U21D0 ISOtech
Upward double arrow. &uArr; &#8657; upward double arrow, U21D1 ISOamsa
Rightward double arrow. &rArr; &#8658; rightward double arrow, U21D2 ISOtech
Downward double arrow. &dArr; &#8659; downward double arrow, U21D3 ISOamsa
Left right double down arrow. &hArr; &#8660; left right double arrow, U21D4 ISOamsa
Mathematical Operators
&forall; &#8704; for all, U2200 ISOtech
&part; &#8706; partial differential, U2202 ISOtech
&exist; &#8707; there exists, U2203 ISOtech
Empty set. &empty; &#8709; empty set, =null set, =diameter, U2205 ISOamso
&nabla; &#8711; nabla, =backward difference, U2207 ISOtech
&isin; &#8712; element of, U2208 ISOtech
Not an element of. &notin; &#8713; not an element of, U2209 ISOtech
&ni; &#8715; contains as member, U220B ISOtech
&prod; &#8719; n-ary product, =product sign, U220F ISOamsb
&sum; &#8722; n-ary sumation, U2211 ISOamsb
&minus; &#8722; minus sign, U2212 ISOtech
Asterisk operator. &lowast; &#8727; asterisk operator, U2217 ISOtech
&radic; &#8730; square root, =radical sign, U221A ISOtech
&prop; &#8733; proportional to, U221D ISOtech
&infin; &#8734; infinity, U221E ISOtech
&ang; &#8736; angle, U2220 ISOamso
&and; &#8869; logical and, =wedge, U2227 ISOtech
Logical or. &or; &#8870; logical or, =vee, U2228 ISOtech
&cap; &#8745; intersection, =cap, U2229 ISOtech
&cup; &#8746; union, =cup, U222A ISOtech
&int; &#8747; integral, U222B ISOtech
&there4; &#8756; therefore, U2234 ISOtech
&sim; &#8764; tilde operator, =varies with, =similar to, U223C ISOtech
Approximately equal to. &cong; &#8773; approximately equal to, U2245 ISOtech
Almost equal to. &asymp; &#8773; almost equal to, =asymptotic to, U2248 ISOamsr
&ne; &#8800; not equal to, U2260 ISOtech
&equiv; &#8801; identical to, U2261 ISOtech
&le; &#8804; less-than or equal to, U2264 ISOtech
&ge; &#8805; greater-than or equal to, U2265 ISOtech
&sub; &#8834; subset of, U2282 ISOtech
&sup; &#8835; superset of, U2283 ISOtech
Not a subset of. &nsub; &#8836; not a subset of, U2284 ISOamsn
&sube; &#8838; subset of or equal to, U2286 ISOtech
&supe; &#8839; superset of or equal to, U2287 ISOtech
&oplus; &#8853; circled plus, =direct sum, U2295 ISOamsb
Circled times. &otimes; &#8855; circled times, =vector product, U2297 ISOamsb
&perp; &#8869; up tack, =orthogonal to, =perpendicular, U22A5 ISOtech
Dot operator. &sdot; &#8901; dot operator, U22C5 ISOamsb
Miscellaneous Technical
Left ceiling. &lceil; &#8968; left ceiling, =apl upstile, U2308, ISOamsc
Right ceiling. &rceil; &#8969; right ceiling, U2309, ISOamsc
Left floor. &lfloor; &#8970; left floor, =apl downstile, U230A, ISOamsc
Right floor. &rfloor; &#8971; right floor, U230B, ISOamsc
< &lang; &#9001; left-pointing angle bracket, =bra, U2329 ISOtech
> &rang; &#9002; right-pointing angle bracket, =ket, U232A ISOtech
Geometric Shapes
&loz; &#9674; lozenge, U25CA ISOpub
Miscellaneous Symbols
&spades; &#9824; black spade suit, U2660 ISOpub
&clubs; &#9827; black club suit, =shamrock, U2663 ISOpub
&hearts; &#9829; black heart suit, =valentine, U2665 ISOpub
Black diamond suit. &diams; &#9830; black diamond suit, U2666 ISOpub

ShowCharacter Entities for Special Symbols and BIDI Text

Using NE NE NCR Using NCR
C0 Controls and Basic Latin
" &quot &#34; quotation mark, =apl quote, U0022 ISOnum
& &amp &#38; ampersand, U0026 ISOnum
< &lt &#60; less-than sign, U003C ISOnum
> &gt &#62; greater-than sign, U003E ISOnum
Latin Extended-A
Π&OElig &#338; Latin capital ligature oe, U0152 ISOlat2
œ &oelig &#339; Latin small ligature oe, U0153 ISOlat2
Š &Scaron &#352; Latin capital letter s with caron, U0160 ISOlat2
š &scaron &#353; Latin small letter s with caron, U0161 ISOlat2
Ÿ &Yuml &#376; Latin capital letter y with diaeresis, U0178 ISOlat2
Spacing Modifier Letters
ˆ &circ &#710; modifier letter circumflex accent, U02C6 ISOpub
˜ &tilde &#732; small tilde, U02DC ISOdia
General Punctuation
  &ensp &#8194; en space, U2002 ISOpub
  &emsp &#8195; em space, U2003 ISOpub
  &thinsp &#8201; thin space, U2009 ISOpub
&zwnj &#8204; zero width non-joiner, U200C NEW RFC 2070
&zwj &#8205; zero width joiner, U200D NEW RFC 2070
&lrm &#8206; left-to-right mark, U200E NEW RFC 2070
&rlm &#8207; right-to-left mark, U200F NEW RFC 2070
&ndash &#8211; en dash, U2013 ISOpub
—  &mdash &#151; em dash, U2014 ISOpub
&lsquo &#8216; left single quotation mark, U2018 ISOnum
&rsquo &#8217; right single quotation mark, U2019 ISOnum
&sbquo &#8218; single low-9 quotation mark, U201A NEW
&ldquo &#8220; left double quotation mark, U201C ISOnum
&rdquo &#8221; right double quotation mark, U201D ISOnum
&bdquo &#8222; double low-9 quotation mark, U201E NEW
&dagger &#8224; dagger, U2020 ISOpub
&Dagger &#8225; double dagger, U2021 ISOpub
&permil &#8240; per mille sign, U2030 ISOtech
< &lsaquo &#8249; single left-pointing angle quotation mark, U2039 ISO proposed
> &rsaquo &#8250; single right-pointing angle quotation mark, U203A ISO proposed

ShowCharacter Set Recognition

Microsoft Internet Explorer uses the character set specified for a document to determine how to translate the bytes in the document into characters on the screen or on paper. By default, Internet Explorer uses the character set specified in the HTTP content type returned by the server to determine this translation. If this parameter is not given, Internet Explorer uses the character set specified by the META element in the document. It uses the user's preferences if no META element is given.

You can use the META element to explicitly set the character set for a document. In this case, you set the HTTP-EQUIV= attribute to "Content-Type" and specify a character set identifier in the CONTENT= attribute. For example, the following META element identifies windows-1251 as the character set for the document.

<META HTTP-EQUIV="Content-Type"
  CONTENT="text/html; CHARSET=windows-1251">

As long as you place the META element before the BODY element, it affects the whole document, including the TITLE element. For clarity, it should appear as the first element after HEAD so that all readers know the encoding before the first element that can be displayed is parsed. Note that the META element applies to the document containing it. This means, for example, that a compound document (a document consisting of two or more documents in a set of frames) can use different character sets in different frames.

The following table contains information concerning the character sets supported by Internet Explorer 5. The information provided is:

  1. Display Name— the name used to refer to the character set.
  2. Preferred Charset ID— the most common identifier used to set character sets in Internet Explorer. For example, in the previous code sample windows-1251 is the Charset ID.
  3. Additional Aliases— other identifiers that may be used to set character sets.
  4. MLang Code Pages— numeric value of the code pages used by the Internet Explorer MLang API.
  5. Supported by Version— the versions of Internet Explorer that support the listed character sets.
  6. Note   CS indicates that the version of Internet Explorer must support complex scripts such as Arabic, Hebrew, or Thai.

Charsets in Microsoft Internet Explorer 5

Display Name Preferred Charset ID Additional Aliases MLang Code Page Supported by Versions
Arabic ASMO-708 ASMO-708 708 4CS, 5
Arabic (DOS) DOS-720 720 4CS, 5
Arabic (ISO) iso-8859-6 ISO_8859-6:1987, iso-ir-127, ISO_8859-6, ECMA-114, arabic, csISOLatinArabic 28596 4CS, 5
Arabic (Windows) windows-1256 1256 4CS, 5
Baltic (ISO) iso-8859-4 csISOLatin4, iso-ir-110, ISO_8859-4, ISO_8859-4:1988, l4, latin4 28594 4, 5
Baltic (Windows) Windows-1257 1257 4, 5
Central European (DOS) ibm852 cp852 852 4, 5
Central European (ISO) iso-8859-2 csISOLatin2, iso-ir-101, iso8859-2, iso_8859-2, iso_8859-2:1987, l2, latin2 28592 3, 4, 5
Central European (Windows) windows-1250 x-cp1250 1250 3, 4, 5
Chinese Simplified (GB2312) gb2312 chinese, csGB2312, csISO58GB23128, GB2312, GBK, GB_2312-80, iso-ir-58 936 3, 4, 5
Chinese Simplified (HZ) hz-gb-2312 52936 4, 5
Chinese Traditional big5 csbig5, x-x-big5 950 3, 4, 5
Cyrillic (DOS) cp866 ibm866 866 4, 5
Cyrillic (ISO) iso-8859-5 csISOLatinCyrillic, cyrillic, iso-ir-144, ISO_8859-5, ISO_8859-5:1988 28595 4, 5
Cyrillic (KOI8-R) koi8-r csKOI8R, koi 20866 3, 4, 5
Cyrillic (Windows) windows-1251 x-cp1251 1251 3, 4, 5
Greek (ISO) iso-8859-7 csISOLatinGreek, ECMA-118, ELOT_928, greek, greek8, iso-ir-126, ISO_8859-7, ISO_8859-7:1987 28597 3, 4, 5
Greek (Windows) Windows-1253 windows-1253 1253 5
Hebrew (DOS) DOS-862 862 4CS, 5
Hebrew (ISO) iso-8859-8 csISOLatinHebrew, hebrew, iso-ir-138, ISO_8859-8, visual, ISO-8859-8 Visual 28598 4CS, 5
Hebrew (Windows) windows-1255 logical, ISO_8859-8:1988, iso-ir-138 1255 3CS, 4CS, 5
Japanese (JIS) iso-2022-jp csISO2022JP 50220 4, 5
Japanese (JIS-Allow 1-byte Kana) csISO2022JP iso-2022-jp 50221 4, 5
Japanese (JIS-Allow 1-byte Kana - SO/SI) iso-2022-jp csISO2022JP 50222 3, 4, 5
Japanese (EUC) euc-jp csEUCPkdFmtJapanese, Extended_UNIX_Code_Packed_
Format_for_Japanese, x-euc, x-euc-jp
51932 3, 4, 5
Japanese (Shift-JIS) shift_jis csShiftJIS, csWindows31J, ms_Kanji, shift-jis, x-ms-cp932, x-sjis 932 3, 4, 5
Korean ks_c_5601-1987 csKSC56011987, euc-kr, korean, ks_c_5601 949 3, 4, 5
Korean (ISO) iso-2022-kr csISO2022KR 50225 3, 4, 5
Latin 3 (ISO) iso-8859-3 28593 4, 5
Thai (Windows) iso-8859-11 windows-874 874 3, 4, 5
Turkish (Windows) Windows-1254 windows-1254 1254 3, 4, 5
Turkish (ISO) iso-8859-9 csISOLatin5, ISO_8859-9, ISO_8859-9:1989, iso-ir-148, l5, latin5 28599 3, 4, 5
Ukrainian (KOI8-U) koi8-u 21866 4, 5
Unicode (UTF-7) utf-7 csUnicode11UTF7, unicode-1-1-utf-7, x-unicode-2-0-utf-7 65000 4, 5
Unicode (UFT-8) utf-8 unicode-1-1-utf-8, unicode-2-0-utf-8, x-unicode-2-0-utf-8 65001 4, 5
Vietnamese (Windows) windows-1258 1258 3, 4, 5
Western European (Windows) Windows-1252 1252 5
Western European (ISO) iso-8859-1 ANSI_X3.4-1968, ANSI_X3.4-1986, ascii, cp367, cp819, csASCII, IBM367, ibm819, iso-ir-100, iso-ir-6, ISO646-US, iso8859-1, ISO_646.irv:1991, iso_8859-1, iso_8859-1:1987, latin1, us, us-ascii, x-ansi 1252 3, 4, 5

Nonstandard Charsets with Special Meaning Inside Internet Explorer and MLang

These character sets are not to be used for labeling documents.

Display Name Preferred Charset ID Additional Aliases MLang Code Page Supported by Versions
Japanese (Auto Select) _autodetect 50932 3, 4, 5
Korean (Auto Select) _autodetect_kr 50949 4, 5
Unicode unicode 1200 4, 5
Unicode (BigEndian) unicodeFEFF 1201 4, 5
User Defined x-user-defined 50000 4, 5