Windows-1250
Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use Latin script, such as Polish, Czech, Slovak, Hungarian, Slovene, Bosnian, Croatian, Serbian (Latin script), Romanian (before 1993 spelling reform) and Albanian. It may also be used with the German language; German-language texts encoded with Windows-1250 and Windows-1252 are identical.
In modern applications UTF-8 or UTF-16 is a preferred encoding; As of July 2020, under 0.1% of all web pages use Windows-1250.[1][2]
Windows-1250 is similar to ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlike Windows-1252, which keeps all printable characters from ISO-8859-1 in the same place). Most of the rearrangements seem to have been done to keep characters shared with Windows-1252 in the same place as in Windows-1252 but three of the characters moved (Ą, Ľ, ź) cannot be explained this way, since those do not occur in Windows-1252 and could have been put in the same positions as in ISO-8859-2 if ˇ had been put e.g. at 9F. The part that differs from ISO-8859-2 is compared with Windows-1252 in the table below:
position (hex) | 1252 | 1250 | position (hex) | 1252 | 1250 | position (hex) | 1252 | 1250 | position (hex) | 1252 | 1250 | position (hex) | 1252 | 1250 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
83 | ƒ U+0192 | 98 | ˜ U+02DC | A2 | ¢ U+00A2 | ˘ U+02D8 |
B2 | ² U+00B2 | ˛ U+02DB |
BD | ½ U+00BD | ˝ U+02DD | ||
88 | ˆ U+02C6 | 9C | œ U+0153 | ś U+015B |
A3 | £ U+00A3 | Ł U+0141 |
B3 | ³ U+00B3 | ł U+0142 |
BE | ¾ U+00BE | ľ U+013E | |
8C | Œ U+0152 | Ś U+015A |
9D | ť U+0165 |
A5 | ¥ U+00A5 | Ą U+0104 |
B9 | ¹ U+00B9 | ą U+0105 |
BF | ¿ U+00BF | ż U+017C | |
8D | Ť U+0164 |
9F | Ÿ U+0178 | ź U+017A |
AA | ª U+00AA | Ş U+015E |
BA | º U+00BA | ş U+015F | ||||
8F | Ź U+0179 |
A1 | ¡ U+00A1 | ˇ U+02C7 |
AF | ¯ U+00AF | Ż U+017B |
BC | ¼ U+00BC | Ľ U+013D |
- Note: The shaded positions at A2, A3, AA, AF, B2, B3, BA, BD and BF are the same as in ISO-8859-2. Positions which are identical in Windows-1252 and Windows-1250 are not shown.
IBM uses code page 1250 (CCSID 1250 and euro sign extended CCSID 5346) for Windows-1250.[3][4][5][6][7][8][9]
Character set
The following table shows Windows-1250. Each character is shown with its Unicode equivalent.
_0 | _1 | _2 | _3 | _4 | _5 | _6 | _7 | _8 | _9 | _A | _B | _C | _D | _E | _F | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0_ 0 |
NUL 0000 |
SOH 0001 |
STX 0002 |
ETX 0003 |
EOT 0004 |
ENQ 0005 |
ACK 0006 |
BEL 0007 |
BS 0008 |
HT 0009 |
LF 000A |
VT 000B |
FF 000C |
CR 000D |
SO 000E |
SI 000F |
1_ 16 |
DLE 0010 |
DC1 0011 |
DC2 0012 |
DC3 0013 |
DC4 0014 |
NAK 0015 |
SYN 0016 |
ETB 0017 |
CAN 0018 |
EM 0019 |
SUB 001A |
ESC 001B |
FS 001C |
GS 001D |
RS 001E |
US 001F |
2_ 32 |
SP 0020 |
! 0021 |
" 0022 |
# 0023 |
$ 0024 |
% 0025 |
& 0026 |
' 0027 |
( 0028 |
) 0029 |
* 002A |
+ 002B |
, 002C |
- 002D |
. 002E |
/ 002F |
3_ 48 |
0 0030 |
1 0031 |
2 0032 |
3 0033 |
4 0034 |
5 0035 |
6 0036 |
7 0037 |
8 0038 |
9 0039 |
: 003A |
; 003B |
< 003C |
= 003D |
> 003E |
? 003F |
4_ 64 |
@ 0040 |
A 0041 |
B 0042 |
C 0043 |
D 0044 |
E 0045 |
F 0046 |
G 0047 |
H 0048 |
I 0049 |
J 004A |
K 004B |
L 004C |
M 004D |
N 004E |
O 004F |
5_ 80 |
P 0050 |
Q 0051 |
R 0052 |
S 0053 |
T 0054 |
U 0055 |
V 0056 |
W 0057 |
X 0058 |
Y 0059 |
Z 005A |
[ 005B |
\ 005C |
] 005D |
^ 005E |
_ 005F |
6_ 96 |
` 0060 |
a 0061 |
b 0062 |
c 0063 |
d 0064 |
e 0065 |
f 0066 |
g 0067 |
h 0068 |
i 0069 |
j 006A |
k 006B |
l 006C |
m 006D |
n 006E |
o 006F |
7_ 112 |
p 0070 |
q 0071 |
r 0072 |
s 0073 |
t 0074 |
u 0075 |
v 0076 |
w 0077 |
x 0078 |
y 0079 |
z 007A |
{ 007B |
| 007C |
} 007D |
~ 007E |
DEL 007F |
8_ 128 |
€ 20AC |
‚ 201A |
„ 201E |
… 2026 |
† 2020 |
‡ 2021 |
‰ 2030 |
Š 0160 |
‹ 2039 |
Ś 015A |
Ť 0164 |
Ž 017D |
Ź 0179 | |||
9_ 144 |
‘ 2018 |
’ 2019 |
“ 201C |
” 201D |
• 2022 |
– 2013 |
— 2014 |
™ 2122 |
š 0161 |
› 203A |
ś 015B |
ť 0165 |
ž 017E |
ź 017A | ||
A_ 160 |
NBSP 00A0 |
ˇ 02C7 |
˘ 02D8 |
Ł 0141 |
¤ 00A4 |
Ą 0104 |
¦ 00A6 |
§ 00A7 |
¨ 00A8 |
© 00A9 |
Ş 015E |
« 00AB |
¬ 00AC |
SHY 00AD |
® 00AE |
Ż 017B |
B_ 176 |
° 00B0 |
± 00B1 |
˛ 02DB |
ł 0142 |
´ 00B4 |
µ 00B5 |
¶ 00B6 |
· 00B7 |
¸ 00B8 |
ą 0105 |
ş 015F |
» 00BB |
Ľ 013D |
˝ 02DD |
ľ 013E |
ż 017C |
C_ 192 |
Ŕ 0154 |
Á 00C1 |
 00C2 |
Ă 0102 |
Ä 00C4 |
Ĺ 0139 |
Ć 0106 |
Ç 00C7 |
Č 010C |
É 00C9 |
Ę 0118 |
Ë 00CB |
Ě 011A |
Í 00CD |
Î 00CE |
Ď 010E |
D_ 208 |
Đ 0110 |
Ń 0143 |
Ň 0147 |
Ó 00D3 |
Ô 00D4 |
Ő 0150 |
Ö 00D6 |
× 00D7 |
Ř 0158 |
Ů 016E |
Ú 00DA |
Ű 0170 |
Ü 00DC |
Ý 00DD |
Ţ 0162 |
ß 00DF |
E_ 224 |
ŕ 0155 |
á 00E1 |
â 00E2 |
ă 0103 |
ä 00E4 |
ĺ 013A |
ć 0107 |
ç 00E7 |
č 010D |
é 00E9 |
ę 0119 |
ë 00EB |
ě 011B |
í 00ED |
î 00EE |
ď 010F |
F_ 240 |
đ 0111 |
ń 0144 |
ň 0148 |
ó 00F3 |
ô 00F4 |
ő 0151 |
ö 00F6 |
÷ 00F7 |
ř 0159 |
ů 016F |
ú 00FA |
ű 0171 |
ü 00FC |
ý 00FD |
ţ 0163 |
˙ 02D9 |
Letter Number Punctuation Symbol Other Undefined
References
- "Historical trends in the usage of character encodings for websites, July 202". w3techs.com.
- "Frequently Asked Questions". w3techs.com.
- "Code page 1250 information document". Archived from the original on 2016-03-03.
- "CCSID 1250 information document". Archived from the original on 2016-03-27.
- "CCSID 5346 information document". Archived from the original on 2014-11-29.
- Code Page CPGID 01250 (pdf) (PDF), IBM
- Code Page CPGID 01250 (txt), IBM
- International Components for Unicode (ICU), ibm-1250_P100-1995.ucm, 2002-12-03
- International Components for Unicode (ICU), ibm-5346_P100-1998.ucm, 2002-12-03
- Steele, Shawn (1998), CP1250 to Unicode table, Unicode Consortium, CP1250.TXT