Windows-1257
Windows-1257 (Windows Baltic) is a single byte code page used to support the Estonian, Latvian and Lithuanian languages under Microsoft Windows.
MIME / IANA | windows-1257 |
---|---|
Alias(es) | cp1257 (Code page 1257) |
Language(s) | Estonian, Latvian, Lithuanian (also supports Polish, Swedish, Finnish, Norwegian, Danish, German, English) |
Created by | Microsoft |
Standard | LST 1590-3, WHATWG Encoding Standard |
Classification | extended ASCII, Windows-125x |
Other related encoding(s) | ISO 8859-13, LST 1590-4 |
The label Windows-1257
was registered with the IANA in 1996, citing a publication of the specification in 1995 and inclusion with pan-European versions of Windows 95.[1] The later ISO 8859-13 encoding (first published in 1998) is similar, but differs in reserving the range 0x80–9F for control characters, and accordingly locating certain quotation marks at codepoints 0xA1, 0xA5, 0xB4 and 0xFF instead (the latter two are used for spacing diacritics in Windows-1257). Windows-1257 is not compatible with the older ISO 8859-4 and ISO 8859-10 encodings.
IBM uses code page 1257 (CCSID 1257, euro sign extended CCSID 5353, and the further extended CCSID 9449) for Windows-1257.[2][3][4][5]
As with many other code pages, the languages supported in this code page can be supported in other code pages. The Estonian language can be written with Windows-1252. It is possible, but unusual, to write Polish, Swedish, Finnish, Norwegian, Danish and German using this code page. The German specific characters will be identical to those encoded in Windows-1252.
In Lithuania, this code page is standardised as LST 1590-3, alongside a modified variant named LST 1590-4.[6][7]
Unicode is preferred to Windows-1257 in modern applications.
Character set
The following table shows Windows-1257. Each character is shown with its Unicode equivalent. Differences from Windows-1252 are shaded.
_0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0_ 0 |
NUL 0000 |
SOH 0001 |
STX 0002 |
ETX 0003 |
EOT 0004 |
ENQ 0005 |
ACK 0006 |
BEL 0007 |
BS 0008 |
HT 0009 |
LF 000A |
VT 000B |
FF 000C |
CR 000D |
SO 000E |
SI 000F |
1_ 16 |
DLE 0010 |
DC1 0011 |
DC2 0012 |
DC3 0013 |
DC4 0014 |
NAK 0015 |
SYN 0016 |
ETB 0017 |
CAN 0018 |
EM 0019 |
SUB 001A |
ESC 001B |
FS 001C |
GS 001D |
RS 001E |
US 001F |
2_ 32 |
SP 0020 |
! 0021 |
" 0022 |
# 0023 |
$ 0024 |
% 0025 |
& 0026 |
' 0027 |
( 0028 |
) 0029 |
* 002A |
+ 002B |
, 002C |
- 002D |
. 002E |
/ 002F |
3_ 48 |
0 0030 |
1 0031 |
2 0032 |
3 0033 |
4 0034 |
5 0035 |
6 0036 |
7 0037 |
8 0038 |
9 0039 |
: 003A |
; 003B |
< 003C |
= 003D |
> 003E |
? 003F |
4_ 64 |
@ 0040 |
A 0041 |
B 0042 |
C 0043 |
D 0044 |
E 0045 |
F 0046 |
G 0047 |
H 0048 |
I 0049 |
J 004A |
K 004B |
L 004C |
M 004D |
N 004E |
O 004F |
5_ 80 |
P 0050 |
Q 0051 |
R 0052 |
S 0053 |
T 0054 |
U 0055 |
V 0056 |
W 0057 |
X 0058 |
Y 0059 |
Z 005A |
[ 005B |
\ 005C |
] 005D |
^ 005E |
_ 005F |
6_ 96 |
` 0060 |
a 0061 |
b 0062 |
c 0063 |
d 0064 |
e 0065 |
f 0066 |
g 0067 |
h 0068 |
i 0069 |
j 006A |
k 006B |
l 006C |
m 006D |
n 006E |
o 006F |
7_ 112 |
p 0070 |
q 0071 |
r 0072 |
s 0073 |
t 0074 |
u 0075 |
v 0076 |
w 0077 |
x 0078 |
y 0079 |
z 007A |
{ 007B |
| 007C |
} 007D |
~ 007E |
DEL 007F |
8_ 128 |
€ 20AC |
‚ 201A |
„ 201E |
… 2026 |
† 2020 |
‡ 2021 |
‰ 2030 |
‹ 2039 |
¨ 00A8 |
ˇ 02C7 |
¸ 00B8 | |||||
9_ 144 |
‘ 2018 |
’ 2019 |
“ 201C |
” 201D |
• 2022 |
– 2013 |
— 2014 |
™ 2122 |
› 203A |
¯ 00AF |
˛ 02DB |
|||||
A_ 160 |
NBSP 00A0 |
¢ 00A2 |
£ 00A3 |
¤ 00A4 |
¦ 00A6 |
§ 00A7 |
Ø 00D8 |
© 00A9 |
Ŗ 0156 |
« 00AB |
¬ 00AC |
SHY 00AD |
® 00AE |
Æ 00C6 | ||
B_ 176 |
° 00B0 |
± 00B1 |
² 00B2 |
³ 00B3 |
´ 00B4 |
µ 00B5 |
¶ 00B6 |
· 00B7 |
ø 00F8 |
¹ 00B9 |
ŗ 0157 |
» 00BB |
¼ 00BC |
½ 00BD |
¾ 00BE |
æ 00E6 |
C_ 192 |
Ą 0104 |
Į 012E |
Ā 0100 |
Ć 0106 |
Ä 00C4 |
Å 00C5 |
Ę 0118 |
Ē 0112 |
Č 010C |
É 00C9 |
Ź 0179 |
Ė 0116 |
Ģ 0122 |
Ķ 0136 |
Ī 012A |
Ļ 013B |
D_ 208 |
Š 0160 |
Ń 0143 |
Ņ 0145 |
Ó 00D3 |
Ō 014C |
Õ 00D5 |
Ö 00D6 |
× 00D7 |
Ų 0172 |
Ł 0141 |
Ś 015A |
Ū 016A |
Ü 00DC |
Ż 017B |
Ž 017D |
ß 00DF |
E_ 224 |
ą 0105 |
į 012F |
ā 0101 |
ć 0107 |
ä 00E4 |
å 00E5 |
ę 0119 |
ē 0113 |
č 010D |
é 00E9 |
ź 017A |
ė 0117 |
ģ 0123 |
ķ 0137 |
ī 012B |
ļ 013C |
F_ 240 |
š 0161 |
ń 0144 |
ņ 0146 |
ó 00F3 |
ō 014D |
õ 00F5 |
ö 00F6 |
÷ 00F7 |
ų 0173 |
ł 0142 |
ś 015B |
ū 016B |
ü 00FC |
ż 017C |
ž 017E |
˙ 02D9 |
Letter Number Punctuation Symbol Other Undefined
References
- Lazhintseva, Katya (1996-05-03). "Registration of new MIME charset: Windows-1257". IANA.
- "Code page 1257 information document". Archived from the original on 2016-03-03.
- "CCSID 1257 information document". Archived from the original on 2016-03-26.
- "CCSID 5353 information document". Archived from the original on 2014-11-29.
- "CCSID 9449 information document". Archived from the original on 2014-11-29.
- "Rašmenų koduotės". Lietuvių kalba informacinėse technologijose.
- "LST 1590-3". Rašmenų koduotės.
- Unicode mapping table for Windows 1257
- Unicode mappings of windows 1257 with "best fit"
- Code Page CPGID 01257 (pdf) (PDF), IBM
- Code Page CPGID 01257 (txt), IBM
- International Components for Unicode (ICU), ibm-1257_P100-1995.ucm, 2002-12-03
- International Components for Unicode (ICU), ibm-5353_P100-1998.ucm, 2002-12-03
- International Components for Unicode (ICU), ibm-9449_P100-2002.ucm, 2003-10-07