0xx

Contents   Fixed field   0xx   1xx   2xx   3xx   4xx   5xx   6xx   7xx   8xx   9xx

066  Character Sets Present (NR)

       

Input Standards

System Supplied
1st Indicator  Undefined
blank character  Undefined
2nd Indicator  Undefined
blank character  Undefined
Subfields (R=Repeatable  NR=Nonrepeatable) Input Standards 
‡c Alternate graphic character set identification (R) Mandatory/Mandatory




Definition

Field 066 is system-supplied and identifies the presence of any character sets for non-Latin scripts in the record. You cannot add, edit, or delete field 066.

1st Indicator

Undefined. The 1st indicator position is undefined and contains a blank ( blank character ).
 
blank character Undefined 

2nd Indicator

Undefined. The 2nd indicator position is undefined and contains a blank ( blank character ).
 
blank character Undefined 

Subfields

 
‡c Alternate graphic character set identification   Subfield ‡c contains a code identifying the alternative character set used in the record. The subfield is repeated for each additional character set present. The following codes display:
 

$1 Chinese, Japanese, Korean vernacular present

(3 Basic Arabic present

(4 Extended Arabic present

(N Basic Cyrillic present

(Q Extended Cyrillic present

(S Extended Greek present

(2 Basic Hebrew present

Note: These character sets encode language data in the script of the language. They do not encode romanized data in Latin script. The dollar sign ( "$" ) means the character set has multiple bytes per character. The left paragraph mark ( "(" ) means the character set has one byte per character.

Character sets for Bengali, Devanagari, Tamil, and Thai. There are no MARC-8 character sets for Bengali, Devanagari, Tamil, and Thai. OCLC implemented the following script identification codes for these scripts based on the ISO 15924 Code Lists ( http://www.unicode.org/iso15924/codelists.html) and supports Unicode UTF-8 characters for these scripts.

Beng Bengali present.

Deva Deva present.

Taml Tamil present.

Thai Thai present.

Note: Records containing non-MARC-8 characters are expected to be output in the UTF-8 (Unicode) data format. If multiple non-Latin scripts exist in a single field or a single record and the MARC-8 data format is used, all non MARC-8 characters are expressed by numeric character reference (NCR) using the form , where x is in the lower case and 091A indicates the Unicode code point of the target script. Non-MARC-8 script code does not appear in subfield ‡6 of the 880 linkage field.

Printing

Field 066 does not print.

back to top