Sybase Business Intelligence Solutions - Database Management, Data Warehousing Software, Mobile Enterprise Applications and Messaging
Sybase Brand Color Bar
delete

Search for    in all of Sybase.com
view all search results right arrow
  blank
 
 
 
 
 
 
 
 
 
 

 
 
CLICK TO EXPAND NAVIGATION
CLICK TO EXPAND NAVIGATION
Downloads > Code Samples & Utilities > Character Set Support for Sybase Products  
RSS Feed
 
 
 

Character Set Support for Sybase Products

Character sets include maps of characters (alphabetic, numeric, symbols) and their assigned numerical values. There are over 250 native character sets that support the different languages in the world. Supported character sets for Sybase products are available by download on this page. Recently updated, this download includes support for the *Euro symbol (uses charsets: iso15 and roman9) as well.

Click here to download Character Sets for Sybase products.

The remainder of this document discusses the addition of roman9 and iso15 to Adaptive Server Enterprise supported character sets, and updates , "Configuring Character Sets, Sort Orders, and Languages" Chapter in the Sybase System Administration Guide.

 

Adaptive Server versions affected

These versions of Adaptive Server are affected by the introduction of these character sets:

 

  • 11.9.2.5 ESD#4
  • 12.0.0.4 IR
  • 12.5 ESD#2 ONE OFF
  • 12.5.0.1 IR

The roman9 and iso15 character sets are supported on all platforms. The new server binary and files are located in $SYBASE/charsets.

roman8 and roman9 character sets

The roman9 character set is the same as roman8, with one exception. Codepoint 0xBA, previously treated as the universal currency symbol, now represents the Euro currency symbol. The same sort orders are supported for roman9 as for roman8.

 

roman9 data conversions are supported for all other Sybase-supported character sets, but the Euro symbol is defined only in: utf8, iso15, cp1250 through cp1258, and cp874, with one exception. In Adaptive Server 11.9.2.5 ESD#4, the cp874 character set does not support the Euro symbol.

 

iso_1 and iso15 character sets

 

The iso15 character set was included in Adaptive Server 12.0 and 12.5, but previously required CR#222710 to be operational. iso15 (based on ISO 8859-15) is similar to iso_1 (based on ISO 8859-1). The same sort orders are supported for iso15 as for iso_1. These changes have been made to the character set:

 

Table 1-1: iso_1 to iso15 conversions



Charset
EURO
11.0
11.5.1
11.9
12.0
12.5
ascii_8
-
SB-
SBU
SBU
SBU
SBU
big5
-
S--
S-U
S-U
S-U
S-U
big5hk 1
-
S--
S--
S--
S--
S-U
cp1026
-
---
--U
--U
--U
--U
cp1047
-
---
--U
--U
--U
--U
cp1140
-
---
---
---
---
--U
cp1141
-
---
---
---
---
--U
cp1142
-
---
---
---
---
--U
cp1143
-
---
---
---
---
--U
cp1144
-
---
---
---
---
--U
cp1145
-
---
---
---
---
--U
cp1146
-
---
---
---
---
--U
cp1147
-
---
---
---
---
--U
cp1148
-
---
---
---
---
--U
cp1149
-
---
---
---
---
--U
cp1250
0x80
SB-
SBU
SBU
SBU
SBU
cp1251
0x80
SB-
SBU
SBU
SBU
SBU
cp1252
0x80
S--
S-U
S-U
S-U
S-U
cp1253
0x80
SB-
SBU
BU
BU
BU
cp1254
0x80
SB-
SBU
BU
BU
BU
cp1255
0x80
S--
S-U
S-U
S-U
S-U
cp1256
0x80
S--
S-U
S-U
S-U
S-U
cp1257
0x80
S--
S-U
S-U
S-U
S-U
cp1258
0x80
S--
S-U
S-U
S-U
S-U
cp273
-
---
--U
--U
--U
--U
cp277
-
---
--U
--U
--U
--U
cp278
-
---
--U
--U
--U
--U
cp280
-
---
--U
--U
--U
--U
cp284
-
---
--U
--U
--U
--U
cp285
-
---
--U
--U
--U
--U
cp297
-
---
--U
--U
--U
--U
cp420
-
---
---
---
---
--U
cp424
-
---
---
---
---
--U
cp437
-
SB-
SBU
SBU
SBU
SBU
cp500
-
---
--U
--U
--U
--U
cp5026
-
---
--U
--U
--U
--U
cp5026yen
-
---
--U
--U
--U
--U
cp5035
-
---
--U
--U
--U
--U
cp5035yen
-
---
--U
--U
--U
--U
cp737
-
---
--U
--U
--U
--U
cp775
-
---
--U
--U
--U
--U
cp850
-
SB-
SBU
SBU
SBU
SBU
cp852
-
SB-
SBU
SBU
SBU
SBU
cp855
-
SB-
SBU
SBU
SBU
SBU
cp857
-
SBU
SBU
SBU
SBU
SBU
cp858 2
0xD5
S--
S--
S--
S-U  
--U
cp860
-
S--
S-U
S-U
S-U
S-U
cp861
-
---
--U
--U
--U
--U
cp862
-
---
--U
--U
--U
--U
cp863
-
---
--U
--U
--U
--U
cp864
-
S--
S-U
S-U
S-U
S-U
cp865
-
---
--U
--U
--U
--U
cp866
-
SB-
SBU
SBU
SBU
SBU
cp869
-
SB-
SBU
SBU
SBU
SBU
cp870
-
---
---
---
---
--U
cp871
-
---
---
---
---
--U
cp874
0x80
S--
S-U
S-U
S-U
S-U
cp874ibm
-
---
--U
--U
--U
--U
cp875
-
---
--U
--U
--U
--U
cp921
-
---
---
---
---
--U
cp923
-
---
---
---
---
--U
cp930
-
---
--U
--U
--U
--U
cp930yen
-
---
--U
--U
--U
--U
cp932
-
S--
S-U
S-U
S-U
S-U
cp932ms
-
---
--U
--U
--U
--U
cp933
-
---
--U
--U
--U
--U
cp935
-
---
--U
--U
--U
--U
cp936
-
S--
S-U
S-U
S-U
S-U
cp937
-
---
--U
--U
--U
--U
cp939
-
---
--U
--U
--U
--U
cp939yen
-
---
--U
--U
--U
--U
cp949
-
---
---
---
--U
--U
cp950
-
S--
S-U
S-U
S-U
S-U
cp954
-
---
---
---
--U
--U
deckanji
-
SB-
SBU
SBU
SBU
SBU
euccns
-
S--
S-U
S-U
S-U
S-U
eucgb
-
S--
S-U
S-U
S-U
S-U
eucjis
-
SB-
SBU
SBU
SBU
SBU
eucksc
-
S--
S-U
S-U
S-U
S-U
greek8
-
SB-
SBU
SBU
SBU
SBU
iso10
-
---
---
---
--U
--U
iso13
-
---
---
---
--U
--U
iso14
-
---
---
---
--U
--U
iso15 3
0xA4
S--
S--
S-U
S-U
S-U
iso646
-
---
--U
--U
--U
--U
iso88592
-
SB-
SBU
SBU
SBU
SBU
iso88595
-
SB-
SBU
SBU
SBU
SBU
iso88596
-
S--
S-U
S-U
S-U
S-U
iso88597
-
SB-
SBU
SBU
SBU
SBU
iso88598
-
S--
S-U
S-U
S-U
S-U
iso88599
-
S--
SBU
SBU
SBU
SBU

   iso_1

-
SB-
SBU
SBU
SBU
SBU
koi8
-
SB-
SBU
SBU
SBU
SBU
mac
-
SB-
SBU
SBU
SBU
SBU
mac_cyr
-
SB-
SBU
SBU
SBU
SBU
mac_ee
-
SB-
SBU
SBU
SBU
SBU
macgrk2
-
SB-
SBU
SBU
SBU
SBU
macgreek
-
---
--U
--U
--U
--U
macthai
-
---
---
---
--U
--U
macturk
-
SB-
SBU
SBU
SBU
SBU
roman8
-
SB-
SBU
SBU
SBU
SBU
roman9 3
0xBA
S--
S--
S-U
S-U
S-U
rcsu
-
---
---
---
---
--U
sjis
-
SB-
SBU
SBU
SBU
SBU
tis620
-
S--
S-U
S-U
S-U
S-U
turkish8
-
SB-
SBU
SBU
SBU
SBU
utf8
0xE282AC
--
S-U
S-U
S-U
S-U

Notes:
S  The charset can be used as server’s default character
B  Can be client charset with built-in conversion available (to at least one other character set).
U  Can be client charset with Unicode conversion available (to any other character set).  

Special Cases:
1. Unicode conversion for big5hk is available in 12.5.0.1.
2. Unicode conversion for cp858 will be available in 12.0.0.5.
3. Unicode conversion for roman9 and iso15 is available in 11.9.2.5 (ESD#4), 12.0.0.4 (IR), 12.5 (N-OFF), and 12.5.0.1 (IR).


iso15 data conversions are supported for all other Sybase-supported character sets, but the Euro symbol is defined only in: utf8, roman9, cp1250 through cp1258, and cp874, with one exception. In Adaptive Server 11.9.2.5 ESD#4, the cp874 character set does not support the Euro symbol.

 

Changing default character sets

For conversions to work properly, you must set:

sp_configure "enable unicode conversions", 1

Note: You can also set "enable unicode conversions" to 2

To change the server's default character set:

 

% charset -Usa -P binary.srt iso15 (replace "binary.srt" as appropriate)
% isql -Usa -P
> sp_configure "enable unicode conversions", 1
> go
> sp_configure "default character set id", 14
> go
> shutdown
> go

You can also use the UNIX sqlloc utility or the NT sybconfig tool to change the default character set. See the Adaptive Server Enterprise System Administration Guide for more information.

 

You must reboot the system twice for character conversion to take effect.

 

Sort order and roman9
If you use the same sort order, you do not need to rebuild indexes when changing from roman8 to roman9, or vice-versa. The collation sequences are identical between the two character sets. The codepoint 0xBA (universal currency symbol) in existing roman8 character data is interpreted in roman9 as 0xBA (Euro symbol).

 

Note: There is no representation for the universal currency symbol in roman9.

 

For existing data to be interpreted correctly, you must translate all character data from roman8 to roman9 by removing all instances of 0xBA, or substituting a replacement character, such as '?', for the universal currency symbol.

The roman9 character set uses the value of 218 for SYSCHARSETS.ID, though only ID values 0-200 were reserved for Sybase. If you defined your character set with the ID set to 218, you must change it before you can use roman9.

Note that this does not apply to customer-defined sort orders that are also contained in SYSCHARSETS. A customer-defined sort order with ID of 218 can be used with the roman9 character set with ID of 218, because the corresponding CSID and TYPE values are different.

 

Sort order and iso15

 

You must rebuild indexes when changing from iso_1 to iso15 (or vice-versa), except when binary sort order is used. iso15 has new alphabetic characters, so the sort orders are not identical between the two character sets.

Existing iso_1 character data that contains the changed codepoints are interpreted in iso15.

 

Note: There is no representation in iso15 for the changed characters in iso_1 (CEDILLA, for example) . For existing data to be interpreted correctly, you must translate all character data from iso_1 to iso15 by removing all character values mentioned above or substituting a replacement character, such as '?', for the characters that cannot be represented in iso15.

The iso15 character set uses SYSCHARSETS.ID == 14.

 

Converting client applications

To avoid unnecessary conversion overhead, Sybase recommends that when you switch the Adaptive Server character set from roman8 to roman9, or from iso_1 to iso15, you do the same for client applications. Use the -J flag to explicitly declare a client character set:

 

% isql -Usa -P -Jroman9
% isql -Usa -P -Jiso15

This explicitly declares the client character set to be roman9 or iso15.


 

DOCUMENT ATTRIBUTES
Last Revised: Feb 08, 2002
Part Number: 
Product: Open Client, Adaptive Server Enterprise
Technical Topics: Migration
  
Business or Technical: Technical
Content Id: 1016214
Infotype: Download
 
 
 

 
© Copyright 2014, Sybase Inc. - v 7.6 Home / Contact Us / Help / Jobs / Legal / Privacy / Code of Ethics