Jump to content

Character Encodings/Code Tables/MS-DOS/Code page 852

From Wikibooks, open books for an open world

Code page 852 (CCSID 852) (also known as CP 852, IBM 00852, OEM 852 (Latin II),[1][2] MS-DOS Latin 2[3]) is a code page used under DOS to write Central European languages that use Latin script (such as Serbo-Croatian, Czech, Hungarian, Polish, Romanian or Slovene).[4]

CCSID 9044 is the euro currency update of code page/CCSID 852.[5] Byte AA replaces ¬ with € in that update.[6][7]

Code page 852 (DOS Latin 2) is very different from ISO 8859-2 (ISO Latin-2), although both are informally referred to as "Latin-2" in different language regions.[8] However, all printable characters from ISO 8859-2 are included, in a different arrangement which preserves a subset of the box-drawing characters of the original DOS code page 437, while sacrificing others (those combining both single and double lining) in order to include more letters with diacritics. This is the same approach taken by code page 850, the equivalent for ISO 8859-1.

This reduced box-drawing support caused display glitches in DOS applications that made use of the box-drawing characters to display a GUI-like surface in text mode (e.g. Norton Commander). Several local, more language-specific encodings were invented to avoid the problem, for example the w:Kamenický encoding for Czech and Slovak[9] or the w:Mazovia encoding for Polish

A variant (used by FreeDOS, for example) replaces the not sign (¬) at code point 0xAA with the euro sign (€).

Character set

[edit | edit source]

The following table shows code page 852.[1][10] Each character is shown with its equivalent Unicode code point. Only the second half of the table (128–255) is shown, the first half (0–127) being the same as code page 437.

Code page 852[3][6][7][11]
0 1 2 3 4 5 6 7 8 9 A B C D E F
8x
128
Ç
00C7
ü
00FC
é
00E9
â
00E2
ä
00E4
ů
016F
ć
0107
ç
00E7
ł
0142
ë
00EB
Ő
0150
ő
0151
î
00EE
Ź
0179
Ä
00C4
Ć
0106
9x
144
É
00C9
Ĺ
0139
ĺ
013A
ô
00F4
ö
00F6
Ľ
013D
ľ
013E
Ś
015A
ś
015B
Ö
00D6
Ü
00DC
Ť
0164
ť
0165
Ł
0141
×
00D7
č
010D
Ax
160
á
00E1
í
00ED
ó
00F3
ú
00FA
Ą
0104
ą
0105
Ž
017D
ž
017E
Ę
0118
ę
0119
¬
00AC
ź
017A
Č
010C
ş
015F
«
00AB
»
00BB
Bx
176

2591

2592

2593

2502

2524
Á
00C1
Â
00C2
Ě
011A
Ş
015E

2563

2551

2557

255D
Ż
017B
ż
017C

2510
Cx
192

2514

2534

252C

251C

2500

253C
Ă
0102
ă
0103

255A

2554

2569

2566

2560

2550

256C
¤
00A4
Dx
208
đ
0111
Đ
0110
Ď
010E
Ë
00CB
ď
010F
Ň
0147
Í
00CD
Î
00CE
ě
011B

2518

250C

2588

2584
Ţ
0162
Ů
016E

2580
Ex
224
Ó
00D3
ß
00DF
Ô
00D4
Ń
0143
ń
0144
ň
0148
Š
0160
š
0161
Ŕ
0154
Ú
00DA
ŕ
0155
Ű
0170
ý
00FD
Ý
00DD
ţ
0163
´
00B4
Fx
240
SHY
00AD
˝
02DD
˛
02DB
ˇ
02C7
˘
02D8
§
00A7
÷
00F7
¸
00B8
°
00B0
¨
00A8
˙
02D9
ű
0171
Ř
0158
ř
0159

25A0
NBSP
00A0

     Symbols and punctuation      Differences from code page 850

See also

[edit | edit source]
  • LMBCS-6

References

[edit | edit source]
  1. a b "OEM 852". Go Global Developer Center. Microsoft. Retrieved 11 Nov 2011.
  2. "Code Pages Supported by Windows: OEM Code Pages". Go Global Developer Center. Microsoft. Archived from the original on 2 November 2011. Retrieved 11 Oct 2011.
  3. a b "Code Page 852 DOS Latin 2". Developing International Software. Microsoft. 6 February 2008. Retrieved 11 Nov 2011.
  4. "CCSID 852 information document". Archived from the original on 2016-03-27.
  5. "CCSID 9044 information document". Archived from the original on 2016-03-27.
  6. a b Code Page CPGID 00852 (pdf) (PDF), IBM
  7. a b Code Page CPGID 00852 (txt), IBM
  8. "The Czech and Slovak Character Encoding Mess Explained". luki.sdf-eu.org. Retrieved 2022-02-27.
  9. The Czech and Slovak Character Encoding Mess Explained / Kamenicky
  10. "cp852_DOSLatin2 to Unicode table" (TXT). The Unicode Consortium. Retrieved 11 Nov 2011.
  11. International Components for Unicode (ICU), ibm-852_P100-1995.ucm, 2002-12-03