ISO-IR-153
ISO-IR-153[3] (ST SEV 358-88) is an 8-bit character set that covers the Russian and Bulgarian alphabets. Unlike the KOI encodings, this encoding lists the Cyrillic letters in their correct traditional order. This has become the basis for ISO/IEC 8859-5 and the Cyrillic Unicode block.
Language(s) | Russian, Bulgarian |
---|---|
Standard | ST SEV 358-88, GOST R 34.303-92 (see below) |
Classification | Extended ASCII |
Based on | Main code page[1] |
Extensions | ISO-8859-5, IBM-1124, ISO-IR-200, ISO-IR-201 |
Preceded by | KOI8-B[2] |
Standards and Naming
ISO-IR-153 is a subset of ISO/IEC 8859-5 (synchronised with ECMA-113 since 1988).[4] The ISO-IR-153 documentation cites ST SEV 358-88 as the source standard.[3] While it also cites the earlier GOST 19768-74[3] (which defines KOI-8 and was conformed to by the first version of ECMA-113, i.e. ISO-IR-111),[4] it does not follow the KOI-8 layout (rather using a close modification of the letter layout from the Main code page)[1] so this appears to be in error. The ISO-IR-153 encoding was intended to replace GOST 19768-74, and is sometimes referred to as GOST-19768-87.[2][5] This confusion has led to a common misconception that ISO-8859-5 was defined in or based on GOST 19768-74.[1]
Notwithstanding the extents of their accuracy, the IANA lists GOST_19768-74
, ST_SEV_358-88
and iso-ir-153
as labels which may be used for the ISO-IR-153 encoding on the Internet, with reference to RFC 1345, which assigns it those labels.[6][7]
GOST R 34.303-92 includes the ISO-IR-153 code page and dubs it KOI-8 V1 (in addition to using KOI-8 N1 and KOI-8 N2 for two Alternative code page/Code page 866 variants).[8]
Character set
The following table shows the ISO-IR-153 encoding. Each character is shown with its equivalent Unicode code point.
The encoding closely resembles the letter subset of the Cyrillic part of the Main code page, apart from the relocation of the uppercase Ё from 0xF0 to 0xA1. ISO-8859-5 is a superset.
_0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
8_ | ||||||||||||||||
9_ | ||||||||||||||||
A_ | NBSP 00A0 |
Ё 0401 |
SHY 00AD |
|||||||||||||
B_ | А 0410 |
Б 0411 |
В 0412 |
Г 0413 |
Д 0414 |
Е 0415 |
Ж 0416 |
З 0417 |
И 0418 |
Й 0419 |
К 041A |
Л 041B |
М 041C |
Н 041D |
О 041E |
П 041F |
C_ | Р 0420 |
С 0421 |
Т 0422 |
У 0423 |
Ф 0424 |
Х 0425 |
Ц 0426 |
Ч 0427 |
Ш 0428 |
Щ 0429 |
Ъ 042A |
Ы 042B |
Ь 042C |
Э 042D |
Ю 042E |
Я 042F |
D_ | а 0430 |
б 0431 |
в 0432 |
г 0433 |
д 0434 |
е 0435 |
ж 0436 |
з 0437 |
и 0438 |
й 0439 |
к 043A |
л 043B |
м 043C |
н 043D |
о 043E |
п 043F |
E_ | р 0440 |
с 0441 |
т 0442 |
у 0443 |
ф 0444 |
х 0445 |
ц 0446 |
ч 0447 |
ш 0448 |
щ 0449 |
ъ 044A |
ы 044B |
ь 044C |
э 044D |
ю 044E |
я 044F |
F_ | ё 0451 |
Letter Number Punctuation Symbol Other Undefined
See also
References
- Nechayev, Valentin (2013) [2001]. "Review of 8-bit Cyrillic encodings universe". Archived from the original on 2016-12-05. Retrieved 2016-12-05.
- Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03.
[…] in the meantime GOST had inhaled some perestroika and declared the installed base and KOI correspondence less important and revised its 19768 standard from 1974 in 1987 into an incompatible new GOST 19768-87 […]
- ISO-IR-153 (1 December 1989)
- ECMA-113. 8-Bit Single-Byte Coded Graphic Character Sets - Latin/Cyrillic Alphabet (2nd ed., June 1988)
- http://czyborra.com/charsets/gost19768-87.txt.gz
- "Character Sets". IANA.
- Simonsen, Keld (1992). "Character Mnemonics & Character Sets". Requests for Comments. IETF. doi:10.17487/rfc1345. RFC 1345.
- (in Russian) ГОСТ Р 34.303-92. Наборы 8-битных кодированных символов. 8-битный код обмена и обработки информации. = 8-bit coded character sets. 8-bit code for information interchange.