Oracle8i National Language Support Guide
Release 8.1.5

A67789-01

Library

Product

Contents

Index

Prev Next

A
Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can also obtain information about supported character sets, languages, territories, and sorting orders by querying the dynamic data view V$NLS_VALID_VALUES. For more information on the data which can be returned by this view, see Oracle8i Reference.

Languages

Table A-1 lists the languages supported by the Oracle server.

Table A-1 Oracle Supported Languages
Name  Abbreviation 

AMERICAN  

us  

ARABIC  

ar  

BENGALI  

bn  

BRAZILIAN PORTUGUESE  

ptb  

BULGARIAN  

bg  

CANADIAN FRENCH  

frc  

CATALAN  

ca  

CROATIAN  

hr  

CZECH  

cs  

DANISH  

dk  

DUTCH  

nl  

EGYPTIAN  

eg  

ENGLISH  

gb  

ESTONIAN  

et  

FINNISH  

sf  

FRENCH  

f  

GERMAN DIN  

din  

GERMAN  

d  

GREEK  

el  

HEBREW  

iw  

HUNGARIAN  

hu  

ICELANDIC  

is  

INDONESIAN  

in  

ITALIAN  

i  

JAPANESE  

ja  

KOREAN  

ko  

LATIN AMERICAN SPANISH  

esa  

LATVIAN  

lv  

LITHUANIAN  

lt  

MALAY  

ms  

MEXICAN SPANISH  

esm  

NORWEGIAN  

n  

POLISH  

pl  

PORTUGUESE  

pt  

ROMANIAN  

ro  

RUSSIAN  

ru  

SIMPLIFIED CHINESE  

zhs  

SLOVAK  

sk  

SLOVENIAN  

sl  

SPANISH  

e  

SWEDISH  

s  

THAI  

th  

TRADITIONAL CHINESE  

zht  

TURKISH  

tr  

UKRAINIAN  

uk  

VIETNAMESE  

vn  

Translated Messages

Oracle error messages and user interfaces have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages
Name  Abbreviation 

ARABIC  

ar  

BRAZILIAN PORTUGUESE  

ptb  

CATALAN  

ca  

CZECH  

cs  

DANISH  

dk  

DUTCH  

nl  

FINNISH  

sf  

FRENCH  

f  

GERMAN  

d  

GREEK  

el  

HEBREW  

iw  

HUNGARIAN  

hu  

ITALIAN  

i  

JAPANESE  

ja  

KOREAN  

ko  

LATIN AMERICAN SPANISH  

esa  

NORWEGIAN  

n  

POLISH  

pl  

PORTUGUESE  

pt  

ROMANIAN  

ro  

RUSSIAN  

ru  

SIMPLIFIED CHINESE  

zhs  

SLOVAK  

sk  

SPANISH  

e  

SWEDISH  

s  

TRADITIONAL CHINESE  

zht  

TURKISH  

tr  

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories
Name     

ALGERIA  

HUNGARY  

QATAR  

AMERICA  

ICELAND  

ROMANIA  

AUSTRALIA  

INDONESIA  

SAUDI ARABIA  

AUSTRIA  

IRAQ  

SINGAPORE  

BAHRAIN  

IRELAND  

SLOVAKIA  

BANGLADESH  

ISRAEL  

SLOVENIA  

BELGIUM  

ITALY  

SOMALIA  

BRAZIL  

JAPAN  

SOUTH AFRICA  

BULGARIA  

JORDAN  

SPAIN  

CANADA  

KAZAKHSTAN  

SUDAN  

CATALONIA  

KUWAIT  

SWEDEN  

CHINA  

LATVIA  

SWITZERLAND  

CIS  

LEBANON  

SYRIA  

CROATIA  

LIBYA  

TAIWAN  

CYPRUS  

KOREA  

THAILAND  

CZECH  

LITHUANIA  

THE NETHERLANDS  

CZECHOSLOVAKIA  

LUXEMBOURG  

TUNISIA  

DENMARK  

MALAYSIA  

TURKEY  

DJIBOUTI  

MAURITANIA  

UKRAINE  

EGYPT  

MEXICO  

UNITED ARAB EMIRATES  

ESTONIA  

MOROCCO  

UNITED KINGDOM  

FINLAND  

NEW ZEALAND  

UZBEKISTAN  

FRANCE  

NORWAY  

VIETNAM  

GERMANY  

OMAN  

YEMEN  

GREECE  

POLAND  

 

HONG KONG  

PORTUGAL  

 

Character Sets

Oracle-supported character sets are listed below, for easy reference, according to three broad language groups:

Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.

The comment section indicates the type of encoding used:

SB = Single-byte encoding

MB = Multi-byte encoding

FIXED = Fixed-width multi-byte encoding

As mentioned in Chapter 3, "Choosing a Character Set", the type of encoding will affect performance so you should use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, fixed-width multibyte encoded character sets can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For instance, whether the character set supports the new Euro currency symbol, whether user defined characters are supported for character set customization, and whether the character set is a strict superset of ASCII (which will allow you to make use of the ALTER DATABASE [NATIONAL] CHARACTER SET command in case of migration.)

EURO = Euro symbol supported

UDC = User-defined Characters supported

ASCII = Strict Superset of ASCII

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you should refer to the actual national, international, or vendor-specific standards.

Asian Language Character Sets

Table A-4 lists the Oracle character sets that can support Asian languages.

Table A-4 Asian Language Character Sets
Name  Description  Comments 

BN8BSCII  

Bangladesh National Code 8-bit BSCII  

SB, ASCII  

ZHT16BIG5  

BIG5 16-bit Traditional Chinese  

MB, ASCII  

ZHS16CGB231280  

CGB2312-80 16-bit Simplified Chinese  

MB. ASCII  

JA16EUC  

EUC 24-bit Japanese  

MB, ASCII  

JA16EUCYEN  

EUC 24-bit Japanese with '\' mapped to the Japanese yen character  

MB  

JA16EUCFIXED  

EUC 16-bit Japanese. A fixed-width subset of JA16EUC (contains only the 2-byte characters of JA16EUC). Contains no 7- or 8-bit ASCII characters  

FIXED  

ZHT32EUC  

EUC 32-bit Traditional Chinese  

MB, ASCII  

ZHS16GBK  

GBK 16-bit Simplified Chinese  

MB, ASCII, UDC  

ZHS16GBKFIXED  

GBK 16-bit Simplified Chinese (16-bit fixed-width, no single byte)  

FIXED, UDC  

ZHT16CCDC  

HP CCDC 16-bit Traditional Chinese  

MB, ASCII  

JA16DBCS  

IBM EBCDIC 16-bit Japanese  

MB, UDC  

JA16EBCDIC930  

IBM DBCS Code Page 290 16-bit Japanese  

MB, UDC  

JA16DBCSFIXED  

IBM EBCDIC 16-bit Japanese (16-bit fixed width, no single byte)  

FIXED, UDC  

KO16DBCS  

IBM EBCDIC 16-bit Korean  

MB, UDC  

KO16DBCSFIXED  

IBM EBCDIC 16-bit Korean (16-bit fixed-width, no single byte)  

FIXED, UDC  

ZHS16DBCS  

IBM EBCDIC 16-bit Simplified Chinese  

MB, UDC  

ZHS16DBCSFIXED  

IBM EBCDIC 16-bit Simplified Chinese (16-bit fixed-width, no single byte)  

FIXED, UDC  

ZHT16DBCS  

IBM EBCDIC 16-bit Traditional Chinese  

MB, UDC  

KO16KSC5601  

KSC5601 16-bit Korean  

MB, ASCII  

KO16KSCCS  

KSCCS 16-bit Korean  

MB, ASCII  

JA16VMS  

JVMS 16-bit Japanese  

MB, ASCII  

ZHS16MACCGB231280  

Mac client CGB2312-80 16-bit Simplified Chinese  

MB  

JA16MACSJIS  

Mac client Shift-JIS 16-bit Japanese  

MB  

TH8MACTHAI  

Mac Client 8-bit Latin/Thai  

SB  

TH8MACTHAIS  

Mac Server 8-bit Latin/Thai  

SB, ASCII  

ZHT16MSWIN950  

MS Windows Code Page 950 Traditional Chinese  

MB, ASCII, UDC  

KO16MSWIN949  

MS Windows Code Page 949 Korean  

MB, ASCII, UDC  

VN8MSWIN1258  

MS Windows Code Page 1258 8-bit Vietnamese  

SB, ASCII, EURO  

IN8ISCII  

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages  

SB, ASCII  

JA16SJIS  

Shift-JIS 16-bit Japanese  

MB, ASCII, UDC  

JA16SJISFIXED  

Shift-JIS 16-bit Japanese. A fixed-width subset of JA16SJIS (contains only the 2-byte characters of JA16JIS). Contains no 7- or 8-bit ASCII characters  

FIXED, UDC  

JA16SJISYEN  

Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character  

MB, UDC  

ZHT32SOPS  

SOPS 32-bit Traditional Chinese  

MB, ASCII  

ZHT16DBT  

Taiwan Taxation 16-bit Traditional Chinese  

MB, ASCII  

TH8TISASCII  

Thai Industrial Standard 620-2533 - ASCII 8-bit  

SB, ASCII, EURO  

TH8TISEBCDIC  

Thai Industrial Standard 620-2533 - EBCDIC 8-bit  

SB  

ZHT32TRIS  

TRIS 32-bit Traditional Chinese  

MB, ASCII  

ZHT32TRISFIXED  

TRIS 32-bit Fixed-width Traditional Chinese  

FIXED  

AL24UTFFSS  

Unicode 1.1 UTF-8 Universal character set  

MB, ASCII, EURO  

UTF8  

Unicode 2.0 UTF-8 Universal character set  

MB, ASCII, EURO  

VN8VN3  

VN3 8-bit Vietnamese  

SB, ASCII  

European Language Character Sets

Table A-5 lists the Oracle character sets that can support European languages.

Table A-5 European Language Character Sets
Name  Description  Comments 

US7ASCII  

ASCII 7-bit American  

SB, ASCII  

SF7ASCII  

ASCII 7-bit Finnish  

SB  

YUG7ASCII  

ASCII 7-bit Yugoslavian  

SB  

RU8BESTA  

BESTA 8-bit Latin/Cyrillic  

SB, ASCII  

EL8GCOS7  

Bull EBCDIC GCOS7 8-bit Greek  

SB  

WE8GCOS7  

Bull EBCDIC GCOS7 8-bit West European  

SB  

EL8DEC  

DEC 8-bit Latin/Greek  

SB  

TR7DEC  

DEC VT100 7-bit Turkish  

SB  

TR8DEC  

DEC 8-bit Turkish  

SB, ASCII  

TR8EBCDIC  

EBCDIC Code Page 1026 8-bit Turkish  

SB  

TR8PC857  

IBM-PC Code Page 857 8-bit Turkish  

SB, ASCII  

TR8MACTURKISH  

MAC Client 8-bit Turkish  

SB  

TR8MACTURKISHS  

MAC Server 8-bit Turkish  

SB, ASCII  

TR8MSWIN1254  

MS Windows Code Page 1254 8-bit Turkish  

SB, ASCII, EURO  

WE8BS2000L5  

Siemens EBCDIC.DF.L5 8-bit West European/Turkish  

SB  

WE8DEC  

DEC 8-bit West European  

SB, ASCII  

D7DEC  

DEC VT100 7-bit German  

SB  

F7DEC  

DEC VT100 7-bit French  

SB  

S7DEC  

DEC VT100 7-bit Swedish  

SB  

E7DEC  

DEC VT100 7-bit Spanish  

SB  

NDK7DEC  

DEC VT100 7-bit Norwegian/Danish  

SB  

I7DEC  

DEC VT100 7-bit Italian  

SB  

NL7DEC  

DEC VT100 7-bit Dutch  

SB  

CH7DEC  

DEC VT100 7-bit Swiss (German/French)  

SB  

SF7DEC  

DEC VT100 7-bit Finnish  

SB  

WE8DG  

DG 8-bit West European  

SB, ASCII  

WE8EBCDIC37C  

EBCDIC Code Page 37 8-bit Oracle/c  

SB  

WE8EBCDIC37  

EBCDIC Code Page 37 8-bit West European  

SB  

D8EBCDIC273  

EBCDIC Code Page 273/1 8-bit Austrian German  

SB  

DK8EBCDIC277  

EBCDIC Code Page 277/1 8-bit Danish  

SB  

S8EBCDIC278  

EBCDIC Code Page 278/1 8-bit Swedish  

SB  

I8EBCDIC280  

EBCDIC Code Page 280/1 8-bit Italian  

SB  

WE8EBCDIC284  

EBCDIC Code Page 284 8-bit Latin American/Spanish  

SB  

WE8EBCDIC285  

EBCDIC Code Page 285 8-bit West European  

SB  

F8EBCDIC297  

EBCDIC Code Page 297 8-bit French  

SB  

WE8EBCDIC500C  

EBCDIC Code Page 500 8-bit Oracle/c  

SB  

WE8EBCDIC500  

EBCDIC Code Page 500 8-bit West European  

SB  

EE8EBCDIC870  

EBCDIC Code Page 870 8-bit East European  

SB  

WE8EBCDIC871  

EBCDIC Code Page 871 8-bit Icelandic  

SB  

EL8EBCDIC875  

EBCDIC Code Page 875 8-bit Greek  

SB  

CL8EBCDIC1025  

EBCDIC Code Page 1025 8-bit Cyrillic  

SB  

CL8EBCDIC1025X  

EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic  

SB  

BLT8EBCDIC1112  

EBCDIC Code Page 1112 8-bit Baltic Multilingual  

SB  

D8EBCDIC1141  

EBCDIC Code Page 1141 8-bit Austrian German  

SB, EURO  

DK8EBCDIC1142  

EBCDIC Code Page 1142 8-bit Danish  

SB, EURO  

S8EBCDIC1143  

EBCDIC Code Page 1143 8-bit Swedish  

SB, EURO  

I8EBCDIC1144  

EBCDIC Code Page 1144 8-bit Italian  

SB, EURO  

F8EBCDIC1147  

EBCDIC Code Page 1147 8-bit French  

SB, EURO  

EEC8EUROASCI  

EEC Targon 35 ASCI West European/Greek  

SB  

EEC8EUROPA3  

EEC EUROPA3 8-bit West European/Greek  

SB  

LA8PASSPORT  

German Government Printer 8-bit All-European Latin  

SB, ASCII  

WE8HP  

HP LaserJet 8-bit West European  

SB  

WE8ROMAN8  

HP Roman8 8-bit West European  

SB, ASCII  

HU8CWI2  

Hungarian 8-bit CWI-2  

SB, ASCII  

HU8ABMOD  

Hungarian 8-bit Special AB Mod  

SB, ASCII  

LV8RST104090  

IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic)  

SB, ASCII  

US8PC437  

IBM-PC Code Page 437 8-bit American  

SB, ASCII  

BG8PC437S  

IBM-PC Code Page 437 8-bit (Bulgarian Modification)  

SB, ASCII  

EL8PC437S  

IBM-PC Code Page 437 8-bit (Greek modification)  

SB, ASCII  

EL8PC737  

IBM-PC Code Page 737 8-bit Greek/Latin  

SB  

LT8PC772  

IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic)  

SB, ASCII  

LT8PC774  

IBM-PC Code Page 774 8-bit Lithuanian (Latin)  

SB, ASCII  

BLT8PC775  

IBM-PC Code Page 775 8-bit Baltic  

SB, ASCII  

WE8PC850  

IBM-PC Code Page 850 8-bit West European  

SB, ASCII  

EL8PC851  

IBM-PC Code Page 851 8-bit Greek/Latin  

SB, ASCII  

EE8PC852  

IBM-PC Code Page 852 8-bit East European  

SB, ASCII  

RU8PC855  

IBM-PC Code Page 855 8-bit Latin/Cyrillic  

SB, ASCII  

WE8PC858  

IBM-PC Code Page 858 8-bit West European  

SB, ASCII, EURO  

WE8PC860  

IBM-PC Code Page 860 8-bit West European  

SB. ASII  

IS8PC861  

IBM-PC Code Page 861 8-bit Icelandic  

SB, ASCII  

CDN8PC863  

IBM-PC Code Page 863 8-bit Canadian French  

SB, ASCII  

N8PC865  

IBM-PC Code Page 865 8-bit Norwegian  

SB. ASCII  

RU8PC866  

IBM-PC Code Page 866 8-bit Latin/Cyrillic  

SB, ASCII  

EL8PC869  

IBM-PC Code Page 869 8-bit Greek/Latin  

SB, ASCII  

LV8PC1117  

IBM-PC Code Page 1117 8-bit Latvian  

SB, ASCII  

US8ICL  

ICL EBCDIC 8-bit American  

SB  

WE8ICL  

ICL EBCDIC 8-bit West European  

SB  

WE8ISOICLUK  

ICL special version ISO8859-1  

SB  

WE8ISO8859P1  

ISO 8859-1 West European  

SB, ASCII  

EE8ISO8859P2  

ISO 8859-2 East European  

SB, ASCII  

SE8ISO8859P3  

ISO 8859-3 South European  

SB, ASCII  

NEE8ISO8859P4  

ISO 8859-4 North and North-East European  

SB, ASCII  

CL8ISO8859P5  

ISO 8859-5 Latin/Cyrillic  

SB, ASCII  

AR8ISO8859P6  

ISO 8859-6 Latin/Arabic  

SB, ASCII  

EL8ISO8859P7  

ISO 8859-7 Latin/Greek  

SB, ASCII  

IW8ISO8859P8  

ISO 8859-8 Latin/Hebrew  

SB, ASCII  

NE8ISO8859P10  

ISO 8859-10 North European  

SB, ASCII  

WE8ISO8859P15  

ISO 8859-15 West European  

SB, ASCII, EURO  

LA8ISO6937  

ISO 6937 8-bit Coded Character Set for Text Communication  

SB, ASCII  

IW7IS960  

Israeli Standard 960 7-bit Latin/Hebrew  

SB  

AR8ARABICMAC  

Mac Server 8-bit Latin/Arabic  

SB  

EE8MACCE  

Mac Client 8-bit Central European  

SB  

EE8MACCROATIAN  

Mac Client 8-bit Croatian  

SB  

WE8MACROMAN8  

Mac Client 8-bit Extended Roman8 West European  

SB  

EL8MACGREEK  

Mac Client 8-bit Greek  

SB  

IS8MACICELANDIC  

Mac Client 8-bit Icelandic  

SB  

CL8MACCYRILLIC  

Mac Client 8-bit Latin/Cyrillic  

SB  

AR8ARABICMACS  

Mac Server 8-bit Latin/Arabic  

SB, ASCII  

EE8MACCES  

Mac Server 8-bit Central European  

SB, ASCII  

EE8MACCROATIANS  

Mac Server 8-bit Croatian  

SB, ASCII  

WE8MACROMAN8S  

Mac Server 8-bit Extended Roman8 West European  

SB, ASCII  

CL8MACCYRILLICS  

Mac Server 8-bit Latin/Cyrillic  

SB, ASCII  

EL8MACGREEKS  

Mac Server 8-bit Greek  

SB, ASCII  

IS8MACICELANDICS  

Mac Server 8-bit Icelandic  

SB  

BG8MSWIN  

MS Windows 8-bit Bulgarian Cyrillic  

SB, ASCII  

LT8MSWIN921  

MS Windows Code Page 921 8-bit Lithuanian  

SB, ASCII  

ET8MSWIN923  

MS Windows Code Page 923 8-bit Estonian  

SB, ASCII  

EE8MSWIN1250  

MS Windows Code Page 1250 8-bit East European  

SB, ASCII, EURO  

CL8MSWIN1251  

MS Windows Code Page 1251 8-bit Latin/Cyrillic  

SB, ASCII, EURO  

WE8MSWIN1252  

MS Windows Code Page 1252 8-bit West European  

SB, ASCII, EURO  

EL8MSWIN1253  

MS Windows Code Page 1253 8-bit Latin/Greek  

SB, ASCII, EURO  

BLT8MSWIN1257  

MS Windows Code Page 1257 8-bit Baltic  

SB, ASCII, EURO  

BLT8CP921  

Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic  

SB, ASCII  

LV8PC8LR  

Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic  

SB, ASCII  

WE8NCR4970  

NCR 4970 8-bit West European  

SB, ASCII  

WE8NEXTSTEP  

NeXTSTEP PostScript 8-bit West European  

SB, ASCII  

CL8KOI8R  

RELCOM Internet Standard 8-bit Latin/Cyrillic  

SB, ASCII  

US8BS2000  

Siemens 9750-62 EBCDIC 8-bit American  

SB  

DK8BS2000  

Siemens 9750-62 EBCDIC 8-bit Danish  

SB  

F8BS2000  

Siemens 9750-62 EBCDIC 8-bit French  

SB  

D8BS2000  

Siemens 9750-62 EBCDIC 8-bit German  

SB  

E8BS2000  

Siemens 9750-62 EBCDIC 8-bit Spanish  

SB  

S8BS2000  

Siemens 9750-62 EBCDIC 8-bit Swedish  

SB  

DK7SIEMENS9780X  

Siemens 97801/97808 7-bit Danish  

SB  

F7SIEMENS9780X  

Siemens 97801/97808 7-bit French  

SB  

D7SIEMENS9780X  

Siemens 97801/97808 7-bit German  

SB  

I7SIEMENS9780X  

Siemens 97801/97808 7-bit Italian  

SB  

N7SIEMENS9780X  

Siemens 97801/97808 7-bit Norwegian  

SB  

E7SIEMENS9780X  

Siemens 97801/97808 7-bit Spanish  

SB  

S7SIEMENS9780X  

Siemens 97801/97808 7-bit Swedish  

SB  

WE8BS2000  

Siemens EBCDIC.DF.04 8-bit West European  

SB  

CL8BS2000  

Siemens EBCDIC.EHC.LC 8-bit Cyrillic  

SB  

AL24UTFFSS  

Unicode 1.1 UTF-8 Universal character set  

MB, ASCII, EURO  

UTF8  

Unicode 2.0 UTF-8 Universal character set  

MB, ASCII, EURO  

Middle Eastern Language Character Sets

Table A-6 lists the Oracle character sets that can support Middle Eastern languages.

Table A-6 Middle Eastern Character Sets
Name  Description  Comments 

AR8APTEC715  

APTEC 715 Server 8-bit Latin/Arabic  

SB, ASCII  

AR8ASMO708PLUS  

ASMO 708 Plus 8-bit Latin/Arabic  

SB, ASCII  

AR8ASMO8X  

ASMO Extended 708 8-bit Latin/Arabic  

SB, ASCII  

AR8ADOS710  

Arabic MS-DOS 710 Server 8-bit Latin/Arabic  

SB, ASCII  

AR8ADOS720  

Arabic MS-DOS 720 Server 8-bit Latin/Arabic  

SB, ASCII  

TR7DEC  

DEC VT100 7-bit Turkish  

SB  

TR8DEC  

DEC 8-bit Turkish  

SB  

WE8EBCDIC37C  

EBCDIC Code Page 37 8-bit Oracle/c  

SB  

IW8EBCDIC424  

EBCDIC Code Page 424 8-bit Latin/Hebrew  

SB  

WE8EBCDIC500C  

EBCDIC Code Page 500 8-bit Oracle/c  

SB  

IW8EBCDIC1086  

EBCDIC Code Page 1086 8-bit Hebrew  

SB  

AR8EBCDICX  

EBCDIC XBASIC Server 8-bit Latin/Arabic  

SB  

TR8EBCDIC1026  

EBCDIC Code Page 1026 8-bit Turkish  

SB  

TR8PC857  

IBM-PC Code Page 857 8-bit Turkish  

SB, ASCII  

IW8PC1507  

IBM-PC Code Page 1507/862 8-bit Latin/Hebrew  

SB, ASCII  

AR8ISO8859P6  

ISO 8859-6 Latin/Arabic  

SB, ASCII  

IW8ISO8859P8  

ISO 8859-8 Latin/Hebrew  

SB, ASCII  

WE8ISO8859P9  

ISO 8859-9 West European & Turkish  

SB, ASCII  

LA8ISO6937  

ISO 6937 8-bit Coded Character Set for Text Communication  

SB, ASCII  

IW7IS960  

Israeli Standard 960 7-bit Latin/Hebrew  

SB  

IW8MACHEBREW  

Mac Client 8-bit Hebrew  

SB  

AR8ARABICMAC  

Mac Client 8-bit Latin/Arabic  

SB  

TR8MACTURKISH  

Mac Client 8-bit Turkish  

SB  

IW8MACHEBREWS  

Mac Server 8-bit Hebrew  

SB, ASCII  

AR8ARABICMACS  

Mac Server 8-bit Latin/Arabic  

SB, ASCII  

TR8MACTURKISHS  

Mac Server 8-bit Turkish  

SB, ASCII  

TR8MSWIN1254  

MS Windows Code Page 1254 8-bit Turkish  

SB, ASCII, EURO  

IW8MSWIN1255  

MS Windows Code Page 1255 8-bit Latin/Hebrew  

SB, ASCII, EURO  

AR8MSWIN1256  

MS Windows Code Page 1256 8-Bit Latin/Arabic  

SB. ASCII, EURO  

IN8ISCII  

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages  

SB  

AR8MUSSAD768  

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic  

SB, ASCII  

AR8NAFITHA711  

Nafitha Enhanced 711 Server 8-bit Latin/Arabic  

SB, ASCII  

AR8NAFITHA721  

Nafitha International 721 Server 8-bit Latin/Arabic  

SB, ASCII  

AR8SAKHR706  

SAKHR 706 Server 8-bit Latin/Arabic  

SB, ASCII  

AR8SAKHR707  

SAKHR 707 Server 8-bit Latin/Arabic  

SB, ASCII  

WE8BS2000L5  

Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish  

SB  

AL24UTFFSS  

Unicode 1.1 UTF-8 Universal character set  

MB. ASCII, EURO  

UTF8  

Unicode 2.0 UTF-8 Universal character set  

MB, ASCII, EURO  

Universal Character Sets

Table A-7 lists the Oracle character sets that provide universal language support, that is, they attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-7 Universal Character Sets
Name  Description  Comments 

AL24UTFFSS  

Unicode 1.1 UTF-8 Universal character set  

MB, ASCII, EURO  

UTF8  

Unicode 2.0 UTF-8 Universal character set  

MB, ASCII, EURO  

Note: The Unicode 1.1 character set has been superseded by Unicode 2.0. One of the major differences between version 1.1 and 2.0 is the redefinition and addition of 11,172 Korean characters. Whenever possible, you should use the latest version of the Unicode standard. The primary scripts currently supported by Unicode 2.0 are:

Arabic  

Gujarati  

Latin  

Armenian  

Gurmukhi  

Lao  

Bengali  

Han  

Malayalam  

Bopomofo  

Hangul  

Oriya  

Cyrillic  

Hebrew  

Tamil  

Devanagari  

Hiragana  

Telugu  

Georgian  

Kannada  

Thai  

Greek  

Katakana  

Tibetan  

For details on the Unicode standard, see http://www.unicode.org or refer to the Unicode Standard, defined by the Unicode consortium.

Linguistic Definitions

Linguistic definitions define linguistic cases for particular languages. Extended linguistic definitions include some special linguistic cases for the language. Typically, using the extended definition means that characters will be sorted differently from their ASCII values. For example, ch and ll are treated as only one character in XSPANISH. Table A-8 lists the linguistic definitions supported by the Oracle server.

Table A-8 Linguistic Definitions
Basic Name  Extended Name  Special Cases 

ARABIC  

--  

 

ARABIC_MATCH  

--  

 

ARABIC_ABJ_SORT  

--  

 

ARABIC_ABJ_MATCH  

--  

 

ASCII7  

--  

 

BENGALI  

--  

 

BULGARIAN  

--  

 

CANADIAN FRENCH  

--  

 

CATALAN  

XCATALAN  

æ, AE, ß  

CROATIAN  

XCROATIAN  

D, L, N, d, l, n, ß  

CZECH  

XCZECH  

ch, CH, Ch, ß  

DANISH  

XDANISH  

A, ß, Å , å  

DUTCH  

XDUTCH  

ij, IJ  

EEC_EURO  

--  

 

EEC_EUROPA3  

--  

 

ESTONIAN  

--  

 

FINNISH  

--  

 

FRENCH  

XFRENCH  

 

GERMAN  

XGERMAN  

ß  

GERMAN_DIN  

XGERMAN_DIN  

ß, ä, ö, ü, Ä, Ö, Ü  

GREEK  

--  

 

HEBREW  

--  

 

HUNGARIAN  

XHUNGARIAN  

cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs  

ICELANDIC  

--  

 

INDONESIAN  

--  

 

ITALIAN  

--  

 

JAPANESE  

--  

 

LATIN  

--  

 

LATVIAN  

--  

 

LITHUANIAN  

--  

 

MALAY  

--  

 

NORWEGIAN  

--  

 

POLISH  

--  

 

PUNCTUATION  

XPUNCTUATION  

 

ROMANIAN  

--  

 

RUSSIAN  

--  

 

SLOVAK  

XSLOVAK  

dz, DZ, Dz, ß (caron)  

SLOVENIAN  

XSLOVENIAN  

ß  

SPANISH  

XSPANISH  

ch, ll, CH, Ch, LL, Ll  

SWEDISH  

--  

 

SWISS  

XSWISS  

ß  

THAI_DICTIONARY  

--  

 

THAI_TELEPHONE  

--  

 

TURKISH  

XTURKISH  

æ, AE, ß  

UKRAINIAN  

--  

 

UNICODE_BINARY  

 

 

VIETNAMESE  

--  

 

WEST_EUROPEAN  

XWEST_EUROPEAN  

ß  

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-9 lists the other calendar systems supported by the Oracle server.

Table A-9 NLS Supported Calendars
Name  Default Format  Character Set Used
For Default Format
 

Japanese Imperial  

EEYY"\307\257"MM"\267\356"DD"\306\374"  

JA16EUC  

ROC Official  

EEyy"\310\241"mm"\305\314"dd"\305\312"  

ZHT32EUC  

Thai Buddha  

dd month EE yyyy  

TH8TISASCII  

Persian  

DD Month YYYY  

AR8ASMO8X  

Arabic Hijrah  

DD Month YYYY  

AR8ISO8859P6  

English Hijrah  

DD Month YYYY  

AR8ISO8859P6  

March 20, 1998 looks like this in ROC Official:


March 27, 1998 looks like this in Japanese Imperial:


Character Sets that Support the Euro Symbol

Table A-10 lists the character sets that support the Euro symbol.

Table A-10 Character Sets with Euro Support
Name  Description  Euro Code Value 

D8EBCDIC1141  

EBCDIC Code Page 1141 8-bit Austrian German  

0x9F  

DK8EBCDIC1142  

EBCDIC Code Page 1142 8-bit Danish  

0x5A  

S8EBCDIC1142  

EBCDIC Code Page 1143 8-bit Swedish  

0x5A  

I8EBCDIC1144  

EBCDIC Code Page 1144 8-bit Italian  

0x9F  

F8EBCDIC1147  

EBCDIC Code Page 1147 8-bit French  

0x9F  

WE8PC858  

IBM-PC Code Page 858 8-bit West European  

0xD5  

WE8ISO8859P15  

ISO 8859-15 West European  

0xA4  

EE8MSWIN1250  

MS Windows Code Page 1250 8-bit East European  

0x80  

CL8MSWIN1251  

MS Windows Code Page 1251 8-bit Latin/Cyrillic  

0x88  

WE8MSWIN1252  

MS Windows Code Page 1252 8-bit West European  

0x80  

EL8MSWIN1253  

MS Windows Code Page 1253 8-bit Latin/Greek  

0x80  

TR8MSWIN1254  

MS Windows Code Page 1254 8-bit Turkish  

0x80  

BLT8MSWIN1257  

MS Windows Code Page 1257 Baltic  

0x80  

VN8MSWIN1258  

MS Windows Code Page 1258 8-bit Vietnamese  

0xA0  

TH8TISASCII  

Thai Industrial 520-2533 - ASCII 8-bit  

0x80  

AL24UTFFSS  

Unicode 1.1 UTF-8 Universal character set  

U+20AC  

UTF8  

Unicode 2.0 UTF-8 Universal character set  

U+20AC  




Prev

Next
Oracle
Copyright © 1999 Oracle Corporation.

All Rights Reserved.

Library

Product

Contents

Index