|
|
|
|
|
Technical Bulletin 240: Pinyin Conversion ProjectISSN: 1097-9654August 2000 About Technical Bulletin 240Route toCataloging and local system staff. Why Read This?To learn about the OCLC® conversion of transliterated Chinese characters in OCLC-MARC records from the Wade-Giles to the pinyin romanization system. This TB also defines the conversion markers for authority records (field 008/07, Romanization Scheme) and bibliographic records (field 987, Local Romanization/Conversion History). Record processing informationImportant: Sections 4 and 5 describe changes that may affect local system processing of OCLC-MARC records. Conversion start dateIn phases between August 1, 2000 and April 2001. OCLC will notify users of project progess via logon Messages of the Day. Manuals affectedOCLC Cataloging Service User Guide 1 Overview of Pinyin Conversion
BackgroundOCLC, in close cooperation with the Library of Congress and the Research Libraries Group, has developed plans for the conversion of authority and bibliographic records that use the Wade-Giles Chinese romanization scheme to reflect pinyin romanization. For the romanization of Chinese, most North American libraries have been using the Wade-Giles system, especially since its adoption by the Library of Congress in 1957. The pinyin romanization system, adopted by the People's Republic of China in 1958 and accepted by the government of Taiwan in 1999, in the meantime has become the standard for the rest of the world. It is recognized by the International Standards Organization (ISO) and the United Nations. Because pinyin is now the world standard, existing authority and bibliographic files will be converted to reflect pinyin. LC, RLG, and OCLC have been consulting regularly, especially since June 1999, in the planning, coding, testing, and implementation of the conversion project.
Project scheduleThe table below outlines the schedule of the Pinyin Conversion Project components discussed in this Technical Bulletin.
Impact on OCLC CJK softwareAside from updates to the validation tables to accommodate the pinyin authority and bibliographic record markers, there will be no major changes to OCLC CJK software. The Wade-Giles to Pinyin search key conversion program in OCLC CJK software will remain unchanged until most institutions have had a chance to convert their local catalogs. At that time, the direction of the conversion program will be reversed, in keeping with the request of the OCLC CJK Users Group. As always, this program should be used with descretion depending on individual institutional needs.
Pinyin Day OneOctober 1, 2000 is the mutually agreed upon date for the implementation of pinyin in United States bibliographic systems. Users should refrain from using pinyin for the formulation of systematically romanized access points and other data in bibliographic and authority records until that date. On October 1, 2000, all users should cease using Wade-Giles romanization and begin using pinyin romanization exclusively for all cataloging. See the Library of Congress's New Chinese Romanization Guidelines for the Pinyin Conversion Project. 2 Authority Records
Authority record conversionOCLC, LC and RLG have identified authority records that represent headings used in Chinese bibliographic records. OCLC will convert authority records for use in the national authority file. This conversion began on August 1, 2000 and will be completed by October 1, 2000 ("Day One"), at which time all LC Chinese current cataloging will begin to reflect pinyin romanization. Converted authority records will be marked with appropriate codes in the 008/07 (Romanization Scheme) fixed field.
Moratorium on record creation and editingFrom August 1 to September 30, 2000 there will be a moratorium on the creation of, or change to any authority record that contains (or will contain) systematically romanized Chinese language data in 1xx, 4xx, 5xx, 64x, or 663 fields. LC will not delete authority records with systematically romanized Chinese language data in 1xx, 4xx, 5xx, 64x, or 663 fields during this period. This moratorium will ensure that
End of moratorium on record creation and editingOCLC and LC expect to complete the conversion and distribution of the estimated 180,000 authority records by mid-September 2000. Since October 1, 2000 is the mutually agreed upon start date for the implementation of pinyin, LC and PCC participants will not use pinyin for the formulation of systematically romanized access points in bibliographic and authority records until October 1. The moratorium on the creation, change, and deletion of authority records will continue through September 30, 2000.
NACO participantsNACO participants should use the appropriate Pinyin Conversion Marker in new authority records created starting October 1, 2000. If updates to an existing authority record involve pinyin romanization, the record should include the appropriate Pinyin Conversion Marker starting October 1, 2000. 3 Bibliographic Records
Record conversionConversion of bibliographic records is based on the specifications developed in cooperation with LC and RLG. Converted bibliographic records will be marked with a locally defined MARC 987 (Local Romanization/Conversion History) field so that risk of double conversion is minimized.
CONSER serial record conversionOCLC will convert all the Chinese language CONSER serial records (estimated to be about 7,000 records) in one group, early in the conversion process. These will be reloaded and distributed to LC separately. Additional Wade-Giles text in other CONSER records will be converted to pinyin as encountered and will be distributed as part of the regular CONSER distribution.
Library of Congress record conversion scheduleRLG will convert LC's Chinese language bibliographic records and distribute them to LC and OCLC. This conversion is planned to be completed by October 1, 2000 ("Day One"), at which time all LC Chinese current cataloging will begin to reflect pinyin romanization.
OCLC record conversion scheduleOCLC will begin to convert the WorldCat bibliographic file, working backwards from the most recent records, soon after October 1, 2000. The expected completion date is April 2001.
New record creationAll OCLC Cataloging users should include an appropriate Pinyin Conversion Marker 987 in new bibliographic records created beginning October 1, 2000 that contain romanized Chinese characters. They should also add an appropriate field 987 to any existing bibliographic record containing pinyin romanized Chinese characters that is locked and replaced beginning October 1, 2000. Records that include field 987 will not be converted subsequently by OCLC's pinyin conversion programs, eliminating the potential for erroneous conversion.
Batchload processingOCLC will modify batchload processing to ensure that incoming bibliographic data is appropriately evaluated and converted to pinyin if necessary. This conversion processing ensures that existing matching software works correctly following conversion of Wade-Giles data in WorldCat. It also prevents the re-introduction of Wade-Giles data into WorldCat starting October 1, 2000.
Local data conversion optionsOCLC will offer a variety of local data conversion options to both members and non-members. The options are variations on the three scenarios below, each of which provides the option of including corrected authority records.
4 Pinyin Conversion Marker for Authority Records: Field 008/07
Field 008/07The 008/07 field ("Romanization scheme") will indicate whether an authority record has been converted to pinyin. Name, series and subject authority records will be given a pinyin marker during the conversion process. When the Library of Congress declares that all authority records in the name and subject authority files have been converted, the marker will no longer be used. If authority record conversion has not been completed by October 1, 2000 ("Day One"), catalogers may be asked to code new authority records 'c' or 'n' until conversion has been completed.
Value 'c'Use value 'c' to indicate headings which have been converted to the pinyin form according to the romanization guidelines prescribed by the Library of Congress. Note: No category of headings that are "already in pinyin form" exists, since technically no headings have been romanized according to the new guidelines. Current headings which appear to be in pinyin form either also represent valid Wade-Giles forms, or have not been romanized but established according to usage.
Value 'n'Use value 'n' to indicate headings which were considered for conversion but were not converted because they were not romanized according to Wade-Giles guidelines. Note: Use value 'n' for headings which appear to be in pinyin form but in fact have not been romanized. For example, the heading: Liu, Chang, 1954- appears to be in Wade-Giles form. In fact, the heading was established for the author of an English-language monograph. The form of heading was based on how the author's name appeared on the title page. After evaluation, code this heading 'n' because it has not been romanized according to Wade-Giles guidelines. It would not be converted to pinyin.
More informationFor the most current text of the Pinyin Conversion Marker for Authority Records, see the Library of Congress Pinyin Web site. 5 Pinyin Conversion Marker for Bibliographic Records: Field 987
Field 987Use local MARC field 987 to record temporary information about the conversion status of MARC records that contain romanized Chinese data. Participants in the Pinyin Conversion Project, an effort to convert romanized Chinese data from the Wade-Giles to pinyin romanization schemes, will use field 987. Beginning October 1, 2000, add this field to all records that include romanized Chinese characters.
Production of field 987Field 987 may originate from manual input or system generation.
Effect on record processingIf a record includes field 987, that record will not be converted by the utilities' conversion programs. Thus, adding field 987 to a record eliminates the potential for erroneous conversion.
Local system implementationThis local field will be implemented in the local library systems of the Library of Congress, OCLC, RLG, and others to store conversion status information until it is no longer needed. Because field 987 is a locally defined data element, all local systems are not required to implement this field. Its definition for use with the Pinyin Conversion Project may conflict with the local definition of field 987 in local systems not involved with the project. This should not be a problem for non-participants who do not receive cataloging records for Chinese materials from participants.
Field 987 definitionThe table below defines OCLC-MARC field 987. Note: NR=nonrepeatable field.
Field 987 examplesThe table below contains some common examples of 987 fileds added to converted bibliographic records.
Guidelines for manual inputThe table below provides guidelines for manually inputting values in the subfields of field 987.
More informationFor the most current text of the Pinyin Conversion Marker for bibliographic records, see the Library of Congress Pinyin Web site. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||