Latin-1 Supplement (Unicode block)
The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic. This block ranges from U+0080 to U+00FF, contains 128 characters and includes the C1 controls, Latin-1 punctuation and symbols, 30 pairs of majuscule and minuscule accented Latin characters and 2 mathematical operators.
C1 Controls and Latin-1 Supplement | |
---|---|
Range | U+0080..U+00FF (128 code points) |
Plane | BMP |
Scripts | Latin (64 char.) Common (64 char.) |
Major alphabets | French German Icelandic Spanish |
Symbol sets | Punctuation Mathematics Currency |
Assigned | 128 code points 33 Control or Format |
Unused | 0 reserved code points |
Source standards | ISO/IEC 8859-1 |
Unicode version history | |
1.0.0 | 128 (+128) |
Note: [1][2] |
The C1 controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard.[3] Its block name in Unicode 1.0 was simply Latin1.[4]
Character table
Subheadings
The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical operator(s).[5]
C1 controls
The C1 controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.[5]
Latin-1 punctuation and symbols
The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as inverted exclamation and question marks, and a middle dot; and symbols like currency signs, spacing diacritic marks, vulgar fraction, and superscript numbers.[5]
Letters
The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters not commonly used word-initially.[5]
Mathematical operator
The Mathematical operator subheading is used for the multiplication and division signs.[5]
Number of symbols, letters and control codes
The table below shows the number of each letters, symbols and control codes in each subheadings in the C1 Controls and Latin-1 Supplement block.
Type of subheading | Number of symbols | Range of characters |
---|---|---|
C1 controls | 32 control codes | U+0080 to U+009F |
Latin-1 punctuation and symbols | 32 punctuation and symbols | U+00A0 to U+00BF |
Letters | 30 pairs of majuscule and minuscule accented Latin characters | U+00C0 to U+00D6, U+00D8 to U+00F6 and U+00F8 to U+00FF |
Mathematical operators | The U+00D7 × MULTIPLICATION SIGN and U+00F7 ÷ DIVISION SIGN symbols. | U+00D7 and U+00F7 |
Compact table
C1 Controls and Latin-1 Supplement[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+008x | XXX | XXX | BPH | NBH | IND | NEL | SSA | ESA | HTS | HTJ | VTS | PLD | PLU | RI | SS2 | SS3 |
U+009x | DCS | PU1 | PU2 | STS | CCH | MW | SPA | EPA | SOS | XXX | SCI | CSI | ST | OSC | PM | APC |
U+00Ax | NB SP |
¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY |
® | ¯ |
U+00Bx | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
U+00Cx | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
U+00Dx | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
U+00Ex | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
U+00Fx | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
Notes
|
Emoji
The Latin-1 Supplement block contains two emoji: U+00A9 and U+00AE.[6][7]
The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[8]
U+ | 00A9 | 00AE |
base code point | © | ® |
base+VS15 (text) | ©︎ | ®︎ |
base+VS16 (emoji) | ©️ | ®️ |
History
The following Unicode-related documents record the purpose and process of defining specific characters in the Latin-1 Supplement block:
Version | Final code points[lower-alpha 1] | Count | L2 ID | WG2 ID | Document |
---|---|---|---|---|---|
1.0.0 | U+0080..009F | 32 | X3L2/95-002 | PDAM No. 3 to ISO/IEC 10646-1 on coding of C1 controls, 1994-11-01 | |
X3L2/95-028 | N1148 | Nine tables of replies to repeated/extended votes, 1995-02-22 | |||
N1203 | Umamaheswaran, V. S.; Ksar, Mike (1995-05-03), "5.3", Unconfirmed minutes of SC2/WG2 Meeting 27, Geneva | ||||
X3L2/95-061 | DAM no.3 to ISO/IEC 10646-1 (Coding of C1 controls), 1995-06-01 | ||||
N1307 | Table of replies to JTC1 letter ballot on 10646 DAM 3, Coding of C1 Controls, (SC2 N 2666), 1996-01-15 | ||||
N1309 | Paterson, Bruce (1996-01-17), Report and Disposition of Comments on DAM 1, UTF 16 and DAM 2, UTF-8, DAM 3, Coding of C1 Controls, and DAM 4, Removal of Annex G: UTF1 | ||||
N1312 | Paterson, Bruce (1996-01-17), Draft Final Text of 10646 AMD-3, Coding of C1 Controls | ||||
L2/99-048 | Umamaheswaran, V. S. (1999-02-04), C1 controls in the code charts | ||||
L2/99-054R | Aliprand, Joan (1999-06-21), "C1 Controls", Approved Minutes from the UTC/L2 meeting in Palo Alto, February 3-5, 1999 | ||||
N3046 | Suignard, Michel (2006-02-22), Improving formal definition for control characters | ||||
N3103 (pdf, doc) | Umamaheswaran, V. S. (2006-08-25), "M48.33", Unconfirmed minutes of WG 2 meeting 48, Mountain View, CA, USA; 2006-04-24/27 | ||||
U+00A0..00FF | 96 | (to be determined) | |||
X3L2/94-077 | N994 | Davis, Mark (1994-03-03), ISO/IEC 10646-1 - Proposed Draft Corrigendum 1 | |||
X3L2/94-098 | N1033 (pdf, doc) | Umamaheswaran, V. S.; Ksar, Mike (1994-06-01), "8.1.15", Unconfirmed Minutes of ISO/IEC JTC 1/SC 2/WG 2 Meeting 25, Falez Hotel, Antalya, Turkey, 1994-04-18--22 | |||
L2/11-016 | Moore, Lisa (2011-02-15), "Correct mistakes in property assignments for super and subscripted letters (B.13.4) [U+00AA, U+00BA]", UTC #126 / L2 #223 Minutes | ||||
L2/11-116 | Moore, Lisa (2011-05-17), "Consensus 127-C14", UTC #127 / L2 #224 Minutes, Change the general category of to U+00AA FEMININE ORDINAL INDICATOR and U+00BA MASCULINE ORDINAL INDICATOR "Lo" for Unicode 6.1. | ||||
L2/11-261R2 | Moore, Lisa (2011-08-16), "Consensus 128-C6", UTC #128 / L2 #225 Minutes, Change the general category from "So" to "Po" ... [U+00A7 and U+00B6] | ||||
L2/15-050R[lower-alpha 2][lower-alpha 3] | Davis, Mark; et al. (2015-01-29), Additional variation selectors for emoji | ||||
|
References
- "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
- "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
- The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1991 [1990]. ISBN 0-201-56788-1.
- "3.8: Block-by-Block Charts" (PDF). The Unicode Standard. version 1.0. Unicode Consortium.
- "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013.
- "UTR #51: Unicode Emoji". Unicode Consortium. 2020-02-11.
- "UCD: Emoji Data for UTR #51". Unicode Consortium. 2020-01-28.
- "UTS #51 Emoji Variation Sequences". The Unicode Consortium.