Windows-1250
Windows-1250 is a code page used under Microsoft Windows to represent texts in Central European and Eastern European languages that use the Latin script. It is primarily used by Czech,[1] though Czech has now moved to UTF-8[2] and mostly abandoned this legacy encoding. It is also used for Polish, Slovak, Hungarian, Slovene, Serbo-Croatian (Latin script), Romanian (before a 1993 spelling reform), Rotokas and Albanian. It may also be used with the German language, though it's missing uppercase ẞ.[lower-alpha 1] German-language texts encoded with Windows-1250 and Windows-1252 are identical.
MIME / IANA | windows-1250 |
---|---|
Alias(es) | cp1250 (Code page 1250) |
Language(s) | Czech, Polish, Slovak, Hungarian, Slovene, Serbo-Croatian (Latin script), Montenegrin, Romanian (before 1993 spelling reform), Gagauz, Rotokas, Albanian, English, German, Luxembourgish |
Created by | Microsoft |
Standard | WHATWG Encoding Standard |
Classification | extended ASCII, Windows-125x |
Other related encoding(s) | ISO-8859-2 |
This has been replaced by Unicode (such as UTF-8) far more than Windows-1252. As of October 2022, less than 0.04% of all web pages use Windows-1250.[3][4]
Windows-1250 is similar to ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlike Windows-1252, which keeps all printable characters from ISO-8859-1 in the same place). Most of the rearrangements seem to have been done to keep characters shared with Windows-1252 in the same place but three of the characters moved (Ą, Ľ, ź) cannot be explained this way, since those do not occur in Windows-1252 and could have been put in the same positions as in ISO-8859-2 if ˇ had been put e.g. at 9F.
IBM uses code page 1250 (CCSID 1250 and euro sign extended CCSID 5346) for Windows-1250.[5][6][7][8][9][10][11]
Character set
The following table shows Windows-1250. Each character is shown with its Unicode equivalent.
Windows-1250[12] | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
8x | € | ‚ | „ | … | † | ‡ | ‰ | Š | ‹ | Ś | Ť | Ž | Ź | |||
9x | ‘ | ’ | “ | ” | • | – | — | ™ | š | › | ś | ť | ž | ź | ||
Ax | NBSP | ˇ | ˘ | Ł | ¤ | Ą | ¦ | § | ¨ | © | Ş | « | ¬ | SHY | ® | Ż |
Bx | ° | ± | ˛ | ł | ´ | µ | ¶ | · | ¸ | ą | ş | » | Ľ | ˝ | ľ | ż |
Cx | Ŕ | Á | Â | Ă | Ä | Ĺ | Ć | Ç | Č | É | Ę | Ë | Ě | Í | Î | Ď |
Dx | Đ | Ń | Ň | Ó | Ô | Ő | Ö | × | Ř | Ů | Ú | Ű | Ü | Ý | Ţ | ß |
Ex | ŕ | á | â | ă | ä | ĺ | ć | ç | č | é | ę | ë | ě | í | î | ď |
Fx | đ | ń | ň | ó | ô | ő | ö | ÷ | ř | ů | ú | ű | ü | ý | ţ | ˙ |
See also
Notes
- In 2017, the Council for German Orthography officially adopted a capital, ⟨ẞ⟩, before support for German was complete. Fully compatible with ISO/IEC 8859-1 for German texts.
References
- "Distribution of Content Languages among websites that use Windows-1250". w3techs.com. Retrieved 2022-10-23.
- "Distribution of Character Encodings among websites that use Czech". w3techs.com. Retrieved 2022-10-23.
- "Historical trends in the usage of character encodings for websites, October 2022". w3techs.com.
- "Frequently Asked Questions". w3techs.com.
- "Code page 1250 information document". Archived from the original on 2016-03-03.
- "CCSID 1250 information document". Archived from the original on 2016-03-27.
- "CCSID 5346 information document". Archived from the original on 2014-11-29.
- Code Page CPGID 01250 (pdf) (PDF), IBM
- Code Page CPGID 01250 (txt), IBM
- International Components for Unicode (ICU), ibm-1250_P100-1995.ucm, 2002-12-03
- International Components for Unicode (ICU), ibm-5346_P100-1998.ucm, 2002-12-03
- Steele, Shawn (1998), CP1250 to Unicode table, Unicode Consortium, CP1250.TXT