Published on May 25, 2016
C0 Controls and Basic Latin
Unicode range: 0000–007F
The current Unicode standard is Unicode 8.0.
Condensed Character Code Chart
0000
|
0001
|
0002
|
0003
|
0004
|
0005
|
0006
|
0007
|
0008
|
0009
|
000A
|
000B
|
000C
|
000D
|
000E
|
000F
|
0010
|
0011
|
0012
|
0013
|
0014
|
0015
|
0016
|
0017
|
0018
|
0019
|
001A
|
001B
|
001C
|
001D
|
001E
|
001F
|
0020
|
!
0021
|
“
0022
|
#
0023
|
$
0024
|
%
0025
|
&
0026
|
‘
0027
|
(
0028
|
)
0029
|
*
002A
|
+
002B
|
,
002C
|
–
002D
|
.
002E
|
/
002F
|
0
0030
|
1
0031
|
2
0032
|
3
0033
|
4
0034
|
5
0035
|
6
0036
|
7
0037
|
8
0038
|
9
0039
|
:
003A
|
;
003B
|
<
003C
|
=
003D
|
>
003E
|
?
003F
|
@
0040
|
A
0041
|
B
0042
|
C
0043
|
D
0044
|
E
0045
|
F
0046
|
G
0047
|
H
0048
|
I
0049
|
J
004A
|
K
004B
|
L
004C
|
M
004D
|
N
004E
|
O
004F
|
P
0050
|
Q
0051
|
R
0052
|
S
0053
|
T
0054
|
U
0055
|
V
0056
|
W
0057
|
X
0058
|
Y
0059
|
Z
005A
|
[
005B
|
\
005C
|
]
005D
|
^
005E
|
_
005F
|
`
0060
|
a
0061
|
b
0062
|
c
0063
|
d
0064
|
e
0065
|
f
0066
|
g
0067
|
h
0068
|
i
0069
|
j
006A
|
k
006B
|
l
006C
|
m
006D
|
n
006E
|
o
006F
|
p
0070
|
q
0071
|
r
0072
|
s
0073
|
t
0074
|
u
0075
|
v
0076
|
w
0077
|
x
0078
|
y
0079
|
z
007A
|
{
007B
|
|
007C
|
}
007D
|
~
007E
|
007F
|
Unicode Character Code Chart
C0 Controls | |||
---|---|---|---|
Alias names are those for ISO/IEC 6429:1992. Commonly used alternative aliases are also shown. | |||
Glyph | Dec | Hex | Unicode Name |
� | � | <control> = NULL Ⓣ C0 or C1 control code |
|
 |  | <control> = START OF HEADING Ⓣ C0 or C1 control code |
|
 |  | <control> = START OF TEXT Ⓣ C0 or C1 control code |
|
 |  | <control> = END OF TEXT Ⓣ C0 or C1 control code |
|
 |  | <control> = END OF TRANSMISSION Ⓣ C0 or C1 control code |
|
 |  | <control> = ENQUIRY Ⓣ C0 or C1 control code |
|
 |  | <control> = ACKNOWLEDGE Ⓣ C0 or C1 control code |
|
 |  | <control> = BELL Ⓣ C0 or C1 control code |
|
 |  | <control> = BACKSPACE Ⓣ C0 or C1 control code |
|
	 | 	 | <control> = CHARACTER TABULATION = horizontal tabulation (HT), tab Ⓣ C0 or C1 control code |
|
| 
 | <control> = LINE FEED (LF) = new line (NL), end of line (EOL) Ⓣ C0 or C1 control code |
|
 |  | <control> = LINE TABULATION = vertical tabulation (VT) Ⓣ C0 or C1 control code |
|
 |  | <control> = FORM FEED (FF) Ⓣ C0 or C1 control code |
|
| 
 | <control> = CARRIAGE RETURN (CR) Ⓣ C0 or C1 control code |
|
 |  | <control> = SHIFT OUT • known as LOCKING-SHIFT ONE in 8-bit environments Ⓣ C0 or C1 control code |
|
 |  | <control> = SHIFT IN • known as LOCKING-SHIFT ZERO in 8-bit environments Ⓣ C0 or C1 control code |
|
 |  | <control> = DATA LINK ESCAPE Ⓣ C0 or C1 control code |
|
 |  | <control> = DEVICE CONTROL ONE Ⓣ C0 or C1 control code |
|
 |  | <control> = DEVICE CONTROL TWO Ⓣ C0 or C1 control code |
|
 |  | <control> = DEVICE CONTROL THREE Ⓣ C0 or C1 control code |
|
 |  | <control> = DEVICE CONTROL FOUR Ⓣ C0 or C1 control code |
|
 |  | <control> = NEGATIVE ACKNOWLEDGE Ⓣ C0 or C1 control code |
|
 |  | <control> = SYNCHRONOUS IDLE Ⓣ C0 or C1 control code |
|
 |  | <control> = END OF TRANSMISSION BLOCK Ⓣ C0 or C1 control code |
|
 |  | <control> = CANCEL Ⓣ C0 or C1 control code |
|
 |  | <control> = END OF MEDIUM Ⓣ C0 or C1 control code |
|
 |  | <control> = SUBSTITUTE → FFFD replacement character Ⓣ C0 or C1 control code |
|
 |  | <control> = ESCAPE Ⓣ C0 or C1 control code |
|
 |  | <control> = INFORMATION SEPARATOR FOUR = file separator (FS) Ⓣ C0 or C1 control code |
|
 |  | <control> = INFORMATION SEPARATOR THREE = group separator (GS) Ⓣ C0 or C1 control code |
|
 |  | <control> = INFORMATION SEPARATOR TWO = record separator (RS) Ⓣ C0 or C1 control code |
|
 |  | <control> = INFORMATION SEPARATOR ONE = unit separator (US) Ⓣ C0 or C1 control code |
|
ASCII punctuation and symbols | |||
Based on ISO/IEC 646. | |||
Glyph | Dec | Hex | Unicode Name |
  |   | SPACE • sometimes considered a control code • other space characters: 2000 –200A → 00A0 no-break space → 200B zero width space → 2060 word joiner → 3000 ideographic space → FEFF zero width no-break space Ⓣ space character (non-zero width) |
|
! | ! | ! | EXCLAMATION MARK = factorial = bang → 00A1 ¡ inverted exclamation mark → 01C3 ǃ latin letter retroflex click → 203C ‼ double exclamation mark → 203D ‽ interrobang → 2762 ❢ heavy exclamation mark ornament Ⓣ punctuation mark of other type |
“ | " | " | QUOTATION MARK • neutral (vertical), used as opening or closing quotation mark • preferred characters in English for paired quotation marks are 201C “ & 201D ” • 05F4 ״ is preferred for gershayim when writing Hebrew → 02BA ʺ modifier letter double prime → 030B $̋ combining double acute accent → 030E $̎ combining double vertical line above → 05F4 ״ hebrew punctuation gershayim → 2033 ″ double prime → 3003 〃 ditto mark Ⓣ punctuation mark of other type |
# | # | # | NUMBER SIGN = pound sign, hash, crosshatch, octothorpe → 2114 ℔ l b bar symbol → 2317 ⌗ viewdata square → 266F ♯ music sharp sign ⁓ 0023 FE0E text style ⁓ 0023 FE0F emoji style Ⓣ punctuation mark of other type |
$ | $ | $ | DOLLAR SIGN = milréis, escudo • used for many peso currencies in Latin America and elsewhere • glyph may have one or two vertical bars • other currency symbol characters start at 20A0 ₠ → 00A4 ¤ currency sign → 20B1 ₱ peso sign → 1F4B2 heavy dollar sign Ⓣ currency symbol |
% | % | % | PERCENT SIGN → 066A arabic percent sign → 2030 ‰ per mille sign → 2031 ‱ per ten thousand sign → 2052 ⁒ commercial minus sign Ⓣ punctuation mark of other type |
& | & | & | AMPERSAND → 204A ⁊ tironian sign et → 214B ⅋ turned ampersand → 1F674 heavy ampersand ornament Ⓣ punctuation mark of other type |
‘ | ' | ' | APOSTROPHE = apostrophe-quote (1.0) = APL quote • neutral (vertical) glyph with mixed usage • 2019 ’ is preferred for apostrophe • preferred characters in English for paired quotation marks are 2018 ‘ & 2019 ’ • 05F3 ׳ is preferred for geresh when writing Hebrew → 02B9 ʹ modifier letter prime → 02BC ʼ modifier letter apostrophe → 02C8 ˈ modifier letter vertical line → 0301 $́ combining acute accent → 05F3 ׳ hebrew punctuation geresh → 2032 ′ prime → A78C ꞌ latin small letter saltillo Ⓣ punctuation mark of other type |
( | ( | ( | LEFT PARENTHESIS = opening parenthesis (1.0) Ⓣ opening punctuation mark (of a pair) |
) | ) | ) | RIGHT PARENTHESIS = closing parenthesis (1.0) • see discussion on semantics of paired bracketing characters Ⓣ closing punctuation mark (of a pair) |
* | * | * | ASTERISK = star (on phone keypads) → 066D arabic five pointed star → 204E ⁎ low asterisk → 2217 ∗ asterisk operator → 26B9 ⚹ sextile → 2731 ✱ heavy asterisk Ⓣ punctuation mark of other type |
+ | + | + | PLUS SIGN → 2795 ➕ heavy plus sign Ⓣ mathematical symbol |
, | , | , | COMMA = decimal separator → 060C arabic comma → 201A ‚ single low-9 quotation mark → 2E41 ⹁ reversed comma → 3001 、 ideographic comma Ⓣ punctuation mark of other type |
– | - | - | HYPHEN-MINUS = hyphen or minus sign • used for either hyphen or minus sign → 2010 ‐ hyphen → 2011 non-breaking hyphen → 2012 ‒ figure dash → 2013 – en dash → 2043 ⁃ hyphen bullet → 2212 − minus sign → 10191 roman uncia sign Ⓣ dash or hyphen punctuation mark |
. | . | . | FULL STOP = period, dot, decimal point • may be rendered as a raised decimal point in old style numbers → 06D4 arabic full stop → 2E3C ⸼ stenographic full stop → 3002 。 ideographic full stop Ⓣ punctuation mark of other type |
/ | / | / | SOLIDUS = slash, virgule → 01C0 ǀ latin letter dental click → 0338 $̸ combining long solidus overlay → 2044 ⁄ fraction slash → 2215 ∕ division slash Ⓣ punctuation mark of other type |
ASCII Digits | |||
Glyph | Dec | Hex | Unicode Name |
0 | 0 | 0 | DIGIT ZERO ⁓ 0030 FE0E text style ⁓ 0030 FE0F emoji style Ⓣ decimal digit |
1 | 1 | 1 | DIGIT ONE ⁓ 0031 FE0E text style ⁓ 0031 FE0F emoji style Ⓣ decimal digit |
2 | 2 | 2 | DIGIT TWO ⁓ 0032 FE0E text style ⁓ 0032 FE0F emoji style Ⓣ decimal digit |
3 | 3 | 3 | DIGIT THREE ⁓ 0033 FE0E text style ⁓ 0033 FE0F emoji style Ⓣ decimal digit |
4 | 4 | 4 | DIGIT FOUR ⁓ 0034 FE0E text style ⁓ 0034 FE0F emoji style Ⓣ decimal digit |
5 | 5 | 5 | DIGIT FIVE ⁓ 0035 FE0E text style ⁓ 0035 FE0F emoji style Ⓣ decimal digit |
6 | 6 | 6 | DIGIT SIX ⁓ 0036 FE0E text style ⁓ 0036 FE0F emoji style Ⓣ decimal digit |
7 | 7 | 7 | DIGIT SEVEN ⁓ 0037 FE0E text style ⁓ 0037 FE0F emoji style Ⓣ decimal digit |
8 | 8 | 8 | DIGIT EIGHT ⁓ 0038 FE0E text style ⁓ 0038 FE0F emoji style Ⓣ decimal digit |
9 | 9 | 9 | DIGIT NINE ⁓ 0039 FE0E text style ⁓ 0039 FE0F emoji style Ⓣ decimal digit |
ASCII Punctuation and Symbols | |||
Glyph | Dec | Hex | Unicode Name |
: | : | : | COLON • also used to denote division or scale; for that mathematical use 2236 ∶ is preferred → 0589 ։ armenian full stop → 05C3 ׃ hebrew punctuation sof pasuq → 2236 ∶ ratio → A789 ꞉ modifier letter colon Ⓣ punctuation mark of other type |
; | ; | ; | SEMICOLON • this, and not 037E ; , is the preferred character for ’Greek question mark’ → 037E ; greek question mark → 061B arabic semicolon → 204F ⁏ reversed semicolon Ⓣ punctuation mark of other type |
< | < | < | LESS-THAN SIGN → 2039 ‹ single left-pointing angle quotation mark → 2329 〈 left-pointing angle bracket → 27E8 ⟨ mathematical left angle bracket → 3008 〈 left angle bracket Ⓣ mathematical symbol |
= | = | = | EQUALS SIGN • other related characters: 2241 ≁ –2263 ≣ → 2260 ≠ not equal to → 2261 ≡ identical to → A78A ꞊ modifier letter short equals sign → 10190 roman sextans sign Ⓣ mathematical symbol |
> | > | > | GREATER-THAN SIGN → 203A › single right-pointing angle quotation mark → 232A 〉 right-pointing angle bracket → 27E9 ⟩ mathematical right angle bracket → 3009 〉 right angle bracket Ⓣ mathematical symbol |
? | ? | ? | QUESTION MARK → 00BF ¿ inverted question mark → 037E ; greek question mark → 061F arabic question mark → 203D ‽ interrobang → 2048 ⁈ question exclamation mark → 2049 ⁉ exclamation question mark Ⓣ punctuation mark of other type |
@ | @ | @ | COMMERCIAL AT = at sign Ⓣ punctuation mark of other type |
Uppercase Latin Alphabet | |||
Glyph | Dec | Hex | Unicode Name |
A | A | A | LATIN CAPITAL LETTER A Ⓣ uppercase letter |
B | B | B | LATIN CAPITAL LETTER B → 212C ℬ script capital b Ⓣ uppercase letter |
C | C | C | LATIN CAPITAL LETTER C → 2102 ℂ double-struck capital c → 212D ℭ black-letter capital c Ⓣ uppercase letter |
D | D | D | LATIN CAPITAL LETTER D Ⓣ uppercase letter |
E | E | E | LATIN CAPITAL LETTER E → 2107 ℇ euler constant → 2130 ℰ script capital e Ⓣ uppercase letter |
F | F | F | LATIN CAPITAL LETTER F → 2131 ℱ script capital f → 2132 Ⅎ turned capital f Ⓣ uppercase letter |
G | G | G | LATIN CAPITAL LETTER G Ⓣ uppercase letter |
H | H | H | LATIN CAPITAL LETTER H → 210B ℋ script capital h → 210C ℌ black-letter capital h → 210D ℍ double-struck capital h Ⓣ uppercase letter |
I | I | I | LATIN CAPITAL LETTER I • Turkish and Azerbaijani use 0131 ı for lowercase → 0130 İ latin capital letter i with dot above → 0406 І cyrillic capital letter byelorussianukrainian i → 04C0 Ӏ cyrillic letter palochka → 2110 ℐ script capital i → 2111 ℑ black-letter capital i → 2160 Ⅰ roman numeral one Ⓣ uppercase letter |
J | J | J | LATIN CAPITAL LETTER J Ⓣ uppercase letter |
K | K | K | LATIN CAPITAL LETTER K → 212A K kelvin sign Ⓣ uppercase letter |
L | L | L | LATIN CAPITAL LETTER L → 2112 ℒ script capital l Ⓣ uppercase letter |
M | M | M | LATIN CAPITAL LETTER M → 2133 ℳ script capital m Ⓣ uppercase letter |
N | N | N | LATIN CAPITAL LETTER N → 2115 ℕ double-struck capital n Ⓣ uppercase letter |
O | O | O | LATIN CAPITAL LETTER O Ⓣ uppercase letter |
P | P | P | LATIN CAPITAL LETTER P → 2119 ℙ double-struck capital p Ⓣ uppercase letter |
Q | Q | Q | LATIN CAPITAL LETTER Q → 211A ℚ double-struck capital q Ⓣ uppercase letter |
R | R | R | ATIN CAPITAL LETTER R → 211B ℛ script capital r → 211C ℜ black-letter capital r → 211D ℝ double-struck capital r Ⓣ uppercase letter |
S | S | S | LATIN CAPITAL LETTER S Ⓣ uppercase letter |
T | T | T | LATIN CAPITAL LETTER T Ⓣ uppercase letter |
U | U | U | LATIN CAPITAL LETTER U Ⓣ uppercase letter |
V | V | V | LATIN CAPITAL LETTER V → 2164 Ⅴ roman numeral five Ⓣ uppercase letter |
W | W | W | LATIN CAPITAL LETTER W Ⓣ uppercase letter |
X | X | X | LATIN CAPITAL LETTER X Ⓣ uppercase letter |
Y | Y | Y | LATIN CAPITAL LETTER Y Ⓣ uppercase letter |
Z | Z | Z | LATIN CAPITAL LETTER Z → 2124 ℤ double-struck capital z → 2128 ℨ black-letter capital z Ⓣ uppercase letter |
ASCII Punctuation and Symbols | |||
Glyph | Dec | Hex | Unicode Name |
[ | [ | [ | LEFT SQUARE BRACKET = opening square bracket (1.0) • other bracket characters: 27E6 ⟦ –27EB ⟫ , 2983 ⦃ –2998 ⦘ , 3008 〈 –301B 〛 Ⓣ opening punctuation mark (of a pair) |
\ | \ | \ | REVERSE SOLIDUS = backslash → 20E5 ⃥ combining reverse solidus overlay → 2216 ∖ set minus Ⓣ punctuation mark of other type |
] | ] | ] | RIGHT SQUARE BRACKET = closing square bracket (1.0) Ⓣ closing punctuation mark (of a pair) |
^ | ^ | ^ | CIRCUMFLEX ACCENT • this is a spacing character → 02C4 ˄ modifier letter up arrowhead → 02C6 ˆ modifier letter circumflex accent → 0302 $̂ combining circumflex accent → 2038 ‸ caret → 2303 ⌃ up arrowhead Ⓣ non-letterlike modifier symbol |
_ | _ | _ | LOW LINE = spacing underscore (1.0) • this is a spacing character → 02CD ˍ modifier letter low macron → 0331 $̱ combining macron below → 0332 $̲ combining low line → 2017 ‗ double low line Ⓣ connecting punctuation mark |
` | ` | ` | GRAVE ACCENT • this is a spacing character → 02CB ˋ modifier letter grave accent → 0300 $̀ combining grave accent → 2035 ‵ reversed prime Ⓣ non-letterlike modifier symbol |
Lowercase Latin Alphabet | |||
Glyph | Dec | Hex | Unicode Name |
a | a | a | LATIN SMALL LETTER A Ⓣ lowercase letter |
b | b | b | LATIN SMALL LETTER B Ⓣ lowercase letter |
c | c | c | LATIN SMALL LETTER C Ⓣ lowercase letter |
d | d | d | LATIN SMALL LETTER D Ⓣ lowercase letter |
e | e | e | LATIN SMALL LETTER E → 212E ℮ estimated symbol → 212F ℯ script small e Ⓣ lowercase letter |
f | f | f | LATIN SMALL LETTER F Ⓣ lowercase letter |
g | g | g | LATIN SMALL LETTER G → 0261 ɡ latin small letter script g → 210A ℊ script small g Ⓣ lowercase letter |
h | h | h | LATIN SMALL LETTER H → 04BB һ cyrillic small letter shha → 210E ℎ planck constant Ⓣ lowercase letter |
i | i | i | LATIN SMALL LETTER I • Turkish and Azerbaijani use 0130 İ for uppercase → 0131 ı latin small letter dotless i → 1D6A4 mathematical italic small dotless i Ⓣ lowercase letter |
j | j | j | LATIN SMALL LETTER J → 0237 ȷ latin small letter dotless j → 1D6A5 mathematical italic small dotless j Ⓣ lowercase letter |
k | k | k | LATIN SMALL LETTER K Ⓣ lowercase letter |
l | l | l | LATIN SMALL LETTER L → 2113 ℓ script small l → 1D4C1 mathematical script small l Ⓣ lowercase letter |
m | m | m | LATIN SMALL LETTER M Ⓣ lowercase letter |
n | n | n | LATIN SMALL LETTER N → 207F ⁿ superscript latin small letter n Ⓣ lowercase letter |
o | o | o | LATIN SMALL LETTER O → 2134 ℴ script small o Ⓣ lowercase letter |
p | p | p | LATIN SMALL LETTER P Ⓣ lowercase letter |
q | q | q | LATIN SMALL LETTER Q Ⓣ lowercase letter |
r | r | r | LATIN SMALL LETTER R Ⓣ lowercase letter |
s | s | s | LATIN SMALL LETTER S Ⓣ lowercase letter |
t | t | t | LATIN SMALL LETTER T Ⓣ lowercase letter |
u | u | u | LATIN SMALL LETTER U Ⓣ lowercase letter |
v | v | v | LATIN SMALL LETTER V Ⓣ lowercase letter |
w | w | w | LATIN SMALL LETTER W Ⓣ lowercase letter |
x | x | x | LATIN SMALL LETTER X Ⓣ lowercase letter |
y | y | y | LATIN SMALL LETTER Y Ⓣ lowercase letter |
z | z | z | LATIN SMALL LETTER Z → 01B6 ƶ latin small letter z with stroke Ⓣ lowercase letter |
ASCII Punctuation and Symbols | |||
Glyph | Dec | Hex | Unicode Name |
{ | { | { | LEFT CURLY BRACKET = opening curly bracket (1.0) = left brace Ⓣ opening punctuation mark (of a pair) |
| | | | | | VERTICAL LINE = vertical bar • used in pairs to indicate absolute value → 01C0 ǀ latin letter dental click → 05C0 ׀ hebrew punctuation paseq → 2223 ∣ divides → 2758 ❘ light vertical bar Ⓣ mathematical symbol |
} | } | } | RIGHT CURLY BRACKET = closing curly bracket (1.0) = right brace Ⓣ closing punctuation mark (of a pair) |
~ | ~ | ~ | TILDE • this is a spacing character → 02DC ˜ small tilde → 0303 $̃ combining tilde → 2053 ⁓ swung dash → 223C ∼ tilde operator → FF5E ~ fullwidth tilde Ⓣ mathematical symbol |
Control Character | |||
Glyph | Dec | Hex | Unicode Name |
|  |  | <control> Ⓘ DELETE Ⓣ C0 or C1 control code |
Key to the Unicode Character Code Charts
Meaning | ||
---|---|---|
Ⓐ | Character name alias. For example, using 2118 ℘: SCRIPT CAPITAL P
means that in addition to the character name, SCRIPT CAPITAL P, WEIERSTRASS ELLIPTIC FUNCTION is an additional unique name associated with the character.Ⓐ WEIERSTRASS ELLIPTIC FUNCTION |
※ |
Ⓒ | Compatibility decomposition mapping. For example, using 0133 ij: LATIN SMALL LIGATURE IJ
means that character 0133 ij can be decomposed into characters 0069 i and 006A j for compatibility purposes.Ⓒ 0069 i 006A j |
≈ |
Ⓓ | Canonical decomposition mapping. For example, using 00E4 ä: LATIN SMALL LETTER A WITH DIAERESIS
means that character 00E4 ä can be decomposed into characters 0061 a and 0308 ̈ .Ⓓ 0061 a 0308 ̈ |
~ |
Ⓘ | Informative alias(es). For example, using 0023 #: NUMBER SIGN
means that character 0023 # is also known as pound sign, hash, crosshatch, and octothorpe.Ⓘ pound sign, hash, crosshatch, octothorpe |
= |
Ⓝ | Informative note. For example, using 0024 $: DOLLAR SIGN
provides additional information about the character, such as, in this example, where it is used and variations in the glyph.Ⓝ used for many peso currencies in Latin America and elsewhere Ⓝ glyph may have one or two vertical bars |
• |
Ⓣ | Type. Unicode general category. | |
Ⓥ | Standardized variation sequence. For example, using 00A9 ©: COPYRIGHT SIGN
See the Unicode FAQ: Variation Sequences for more detail about standardized variation sequences.Ⓥ 00A9 FE0E text style Ⓥ 00A9 FE0F emoji style |
~ |
Ⓧ | Cross-reference. For example, using 002D -: HYPHEN-MINUS
means that character 002D - is similar to 2010 ‐.Ⓧ 2010 ‐ hyphen |
→ |