|
|
|
|
|
OCLC Concise Batch Processing GuideAbout This GuideThe OCLC Concise Batch Processing Guide is intended to provide high-level information about different types of Batch Processing projects and how they may be initiated. For detailed information about OCLC local holdings record updating, refer to the Local Holdings Record Updating Service User Guide. back to topWhat is Batch Processing?Maintain holdings in WorldCatBatch processing is a way for your institution to maintain your holdings and add records offline. You can send files of MARC or non-MARC bibliographic records or numeric search keys to OCLC, to set and cancel holdings. Bibliographic records may also be added to WorldCat. Local data records (LDRs) may be added and updated through batch processing. No matter how incomplete your cataloging is, files can be prepared for batch processing by OCLC if some basic access points are available. These basic access points include:
OCLC-MARC outputYou may also choose to receive a copy of the matching OCLC bibliographic records found during Batch Processing. back to topTypes of Batch Processing ProjectsGeneral informationOCLC's Batch Services staff assigns a unique project ID for all batchload projects. This unique project ID is used for tracking project file submittals and processing them through completion. Records are billed under differing product codes, depending on the type of project and type of records being submitted for processing and record output. An evaluation fee for new projects (TAP 3921 or TAP 3845) may be charged to cover record analysis and setup. Batch post-processing reports are available on OCLC Product Services Web (PSWeb) after every batch of records is processed. OCLC-MARC records from Batch Processing projects are also available on OCLC Product Services Web (PSWeb). Retrieving records and reports via OCLC Product Services WebTo retrieve files using OCLC Product Services Web (PSWeb):
Retrospective (retro) batchloadYour institution can submit all of the records in your local system for Batch Processing. This is normally referred to as retro load. All records read are billed under one product code, TAP 3981 for single institutions, or TAP 3983 for groups. After the retro load is complete, some institutions maintain their holdings online locally. Therefore this type of Batch Processing project is referred to as one-time retro. In this case, the institution does no further batchloading. Retrospective (retro) batchload followed by ongoing batchloadYou may also choose to continue with batchload on an ongoing basis. This would be referred to as retro followed by ongoing. For tracking purposes, a separate definition is created for both the retro and the ongoing. When going directly from retro batchload to ongoing batchload, there are no additional evaluation fees since the only differences between the definitions are project IDs and product codes. The setup remains the same. Ongoing batchloadYou can submit files to OCLC for batchloading on an ongoing basis. When a matching record is found in WorldCat, holdings are set or canceled according to the Record Status of each record read. The Record Status code (Leader byte 5) must be coded appropriately so that OCLC's batchload software will know when to cancel or set holdings. Since all records read are billed during pre-processing, any holdings canceled receive a credit. If your institution cannot identify "cancel holding" transactions by the record status code, these records must be submitted for batch processing separately. (See Cancel holdings.) Product codes for ongoing projects vary according to the type of records submitted:
Commonly, ongoing batchload projects consist of both OCLC-derived and non-OCLC-derived records. These projects are referred to as mixed-batch. During batchload pre-processing, files are split and billed according to the presence of an OCLC control number. Records that contain an OCLC control number are billed under the OCLC-derived product codes; records without an OCLC control number are billed under the non OCLC-derived product codes. Cancel holdingsThere are two ways to perform cancel holding or deletes:
If your institution's goal is to remove ALL holdings in WorldCat, a scan/delete should be requested. (See Scan/delete processing.) Batchload with outputWhen your institution sets its holding symbol in WorldCat through batchloading, you can also elect to receive a copy of the matching OCLC MARC bibliographic record. In addition to the per-record read and evaluation fee billed in pre-processing, a per-record matched (record returned) charge is incurred. The "output" product code varies depending on the institution or group, and if it is billable. Batchload output is billed at one of the following product codes:
Local data retentionIt is important to think about local data retention when requesting output from a batchload project. For example, you may need to retain your system control number in the output record so you can overlay the old record in your local system with the new OCLC MARC bibliographic records. For single institution batchload projects, any local data can be retained. If the data is not already contained in one of the fields listed below, it will need to be mapped to and retained in one of them:
When local data is retained, the entire fields are retained. You cannot request partial fields or selected subfields. Although most commonly used when libraries request batchload output, there are other reasons for retaining local data. Any single institution can request local data retention. This will ensure that your local data appears in the OCLC archive record should you ever need to request copies of these records through the Bibliographic Record Snapshot service. There are limitations on local data retention for group batchload projects (see Group batchload). You can also choose between the following options to determine how you would like to receive your file of matching OCLC MARC records:
Reclamation projectThere are times when an institution's holdings have not been maintained in WorldCat for one reason or another. If this is the case, you may have no easy way of identifying the records for which you need to cancel or set holdings. When this occurs, a reclamation project may be the easiest way to get your holdings in WorldCat in sync with the holdings in your local system. This type of project is meant to be a one-time solution. In a reclamation project, an institution sends the records from their local system, for which they want holdings set. After batch processing is complete, any holdings not updated by this effort will be deleted from WorldCat. This is done using a Date Stamp (typically the date on which you extract your records to submit them to OCLC). These "untouched" holdings will be deleted in a separate process called a Scan/Delete. A scan is done on WorldCat, looking for your holdings symbol. When your symbol is found and the Set Holdings Date Stamp is earlier than the Date Stamp agreed upon, your holdings symbol is removed. Any holdings set on the Date Stamp date or after, will remain set on the record in WorldCat. To initiate a reclamation project, first complete a Batchload Order form. After receiving a Batchload Project ID, submit a file (or files) containing ALL of your current holdings. Based on information supplied on the Batchload Order form, an OCLC database specialist will set up your project for processing. During the setup process they will ask you what date to use for the Scan/Delete of "untouched" holdings. OCLC will perform a Scan/Delete after processing your files, and remove holdings only from those records that have not been touched by the dataload. Any holdings set online after you've submitted your records for batchload (the Date Stamp) will NOT be removed by the Scan/Delete. There is no reason to stop online cataloging, if that is part of your regular workflow. OCLC does recommend that you stop deleting holdings until the Reclamation is complete. OCLC has offered members a one-time free reclamation project. Members are expected to keep their holdings in WorldCat current from that point forward, either online through Connexion or using ongoing Batchload. Scan/delete processingScan/delete processing removes all occurrences of an OCLC institution symbol. All local data records (LDRs) can also be removed with this process. OCLC does not require a file of records from your institution to perform this function. It is actually a scan process that looks for a given OCLC institution holding symbol and removes it from WorldCat. This scan process can only be done at the institution holding level, not the local holding library level. To remove a subset of your institution's holdings, a file of records must be submitted for a cancel holdings project. (See Cancel holdings.) To request a scan/delete, the following information must be submitted to OCLC Batch Services in writing (letter or e-mail):
OCLC does not have a specified turnaround time for completing scan/delete requests. If there is a need to have a scan/delete done within a specific time period, please include this information with the request. Scan/delete is a permanent process that cannot be reversed once it has been performed. There is no charge or related product code for this service. Adding original recordsOriginally cataloged records can be added to WorldCat through batchloading. Currently, this is only an option for single institution batchload projects. Adding this option for groups is currently not available. Unresolved records that do not find a matching record in WorldCat after batchloading unique-key and extended matching can be examined to determine if they are 1) original records to WorldCat and 2) meet minimum cataloging requirements. Records should adhere to national standards for cataloging, including those for descriptive cataloging and forms of headings. They should also meet OCLC K-level standards for completeness, as described in Bibliographic Formats and Standards, Third Edition. It should be noted that records can fail to find a match through batchloading for other reasons besides being original to WorldCat. Significant differences between the institution record and the WorldCat record can prevent a record from finding its match. Manual searching is needed to determine the true duplication rate. If fewer than 20% of the non-matching records duplicate records in WorldCat, and if these records adhere to OCLC's cataloging standards, a new definition and setup will be created to add records that do not contain validation errors. This type of processing is called conditional adds processing (CA). The desirability of adding records is determined for the entire file, not record-by-record. Conditional adds analysis is a time consuming process that has to be "worked in" to the daily workload of Batch Support staff, while they continue to provide ongoing support for existing projects and define new projects. Group batchloadBatchloading for groups has many similarities to single institution projects, but there are some distinct differences that need to be taken into account when planning a group batchload project. A group of institutions can submit files of consolidated records for batchloading; that is, a single file of records can be processed to set or cancel multiple OCLC holding symbols in WorldCat. To do this, the records must contain library identifiers within the bibliographic records themselves. It does not matter where these codes (library identifiers) reside within the bibliographic record as long as they are in the same field and subfield throughout the file. A translation table is required for group processing. This table lists the library identifiers (in the bibliographic records) and the corresponding OCLC institution symbols that are to be set or canceled during batchloading. A "unique" group batchload symbol will be assigned for batchload projects when the group symbol also represents one of the members of the group for whom holdings will be set. The OCLC database specialist assigned to the group will also request that the unique symbol be profiled at OCLC, as it is for internal batchloading purposes only. No holdings will ever be set for that unique group symbol; it is used primarily for tracking files and billing. Batchload projects are very limited in the area of local data retention. A group project can be set up to retain, for example, a system control number in an 035 field or a 9XX field. When MARC record output is requested with a group batchload project, it should be noted that a copy of the matching OCLC MARC record would be created for each institution setting holdings on a record. For example, one record containing three library identifiers will set three holdings, and three copies of that same matching OCLC record will be found in the resulting output file. The group will also be billed for three OCLC MARC records returned (CTR 3338). Group post-processing reports contain a list of holdings set or cancelled for each member of the group (symbol on translation table). That report gives the group members statistics of that specific batch processed and the accumulated set or cancelled stats from the beginning of the project. On OCLC Product Services Web (PSWeb), group post-processing reports end with a .GROUP extension. PCC BIBCO participants and batchloadingBatchloading has been enhanced to allow PCC BIBCO participants to:
PCC BIBCO participants can include their PCC records in the same file as non-PCC records. Using the criteria below, OCLC will split the PCC records out of your incoming batchload file. This process creates two files, one containing PCC records and another containing non-PCC records. Criteria used to identify PCC records:
Two separate batchload projects and setups will be created for your institution. One is for the processing of the PCC records and another for the non-PCC records. The non-PCC records are processed through the normal batchload job stream. PCC records are processed through a separate job stream created just for PCC records. This job stream allows for setting holdings, replacing records and adding records as described above. There is no additional evaluation fee for the separate PCC definition and setup. Of course, you may separate PCC from non-PCC locally prior to submission to OCLC for separate Batch Processing. Batchloading of non-MARC records, or numeric search keysIn response to member library requests, OCLC has expanded the scope of batchloading by developing custom processes that help "build" usable MARC records from "patterned data" received in non-MARC formats, and from numeric search keys. Patterned data
None of these data forms are mutually exclusive. Sometimes records can be a composite of all three of these basic data types. Examples of commonly used non-MARC data formats are Microsoft©Excel spreadsheets, Access© databases, and Microsoft©Word documents. Once the MARC records are built, the file is analyzed like any other file submitted for batchloading. There is no additional cost to have non-MARC data converted to MARC for batchloading, although it should be noted this pre-processing could extend the 90-day turnaround time. Numeric search key
Send numeric search key files via e-mail attachment to: batchload@oclc.org. Local data record updatingOCLC local data record updating (previously known as LDRUS), provides automated batch processing that allows OCLC Union List or Group Access/Union List group participants to add and update copy-specific-holdings data in WorldCat. The service takes library serials holdings records, batch processes them, and creates local data records (LDRs) that are then added to WorldCat for use in OCLC Union List, WorldCat Resource Sharing, and OCLC FirstSearch. Local data record updating is intended to be an ongoing service. Regular updates are suggested, but the schedule varies according to individual needs. The service can also be used as a one-time project, with holdings maintained manually thereafter. There is currently no charge for batch processing of serials holdings data via local data record updating. Additionally, the batchload evaluation fee (TAP 3921) is currently waived for libraries whose holdings data is compliant with the MARC 21 Format for Holdings Data. The batchload evaluation fee (TAP 3921) will be charged for libraries submitting non-MARC files. Additional charges may apply if the data does not meet the local data record updating specifications and requires custom programming. Post-processing reports are made available on the OCLC Product Services Web after each batch of records is processed. There is no record output available from this product. General Guidelines
Please consult the OCLC Batch Services: Local Data Record Updating Guide for detailed information about record requirements, procedures for initiating projects and procedures for submitting files. How to Start a New Batch Processing ProjectGetting StartedTo initiate any new Batch Processing project, two items are necessary:
Note: An Application and Agreement for Tapeloading Membership Agreement and EDX Assessment form may also need to be completed, depending on the type of project and delivery method selected. Once OCLC Batch Services receives both a completed order form and a file of records, the project is considered "in-house." Many times these two items are sent to OCLC on different dates, which is perfectly acceptable. It should be noted that the date the last item is received is used as the "in-house" date for that new project. Input media for Batch ProcessingPhysical media When putting data onto a physical media for Batch Processing the following should be considered:
Always label the tape, cartridge or diskette with the OCLC institution symbol, project ID and the record count. A sticker label that will adhere to the flat portion of the tape or cartridge should be used. Electronic transmission An institution that is submitting data via FTP for a one-time Batch Processing project does not need to establish an EDX account. For one-time transfers, please contact OCLC at batchservices@oclc.org. Files submitted electronically via e-mail attachment should be sent to batchload@oclc.org. Preparing files for electronic transmission via EDXWhen submitting records to OCLC for Batch Processing via EDX, a DATA file containing MARC records, and a LABEL file containing characteristics of the MARC record file must be created for each file submitted. Requirements for a LABEL file:
Example of how the LABEL file should look: Note: If you are using an Innovative/III system, do not follow the LABEL file instructions above. Refer to your systems manual or contact the Batch Administration Section for further details. Internet FTP Sample Session Note: Your session may look different. If you have trouble, contact OCLC Batch Administration (ext. 6026 or e-mail batchservices@oclc.org ) or OCLC User & Network Support at (800) 848-5800 or (614) 793-8682, or e-mail support@oclc.org. A sample FTP session follows:
E-mail notification Setting expectations for new batchload projectsA new batchload project can take anywhere from 30 to 90 days to set up (see Diagram 1). The 90-day turnaround time starts when the project is "in-house." On average, new projects are turned around in 50 days. If there is some type of funding deadline or other time period that needs to be considered, OCLC must be made aware of this upfront. This can be achieved by including this information in the Special Instructions area of the Batchload Order form, or by having your regional service provider send an e-mail message to OCLC Batch Services. Special circumstances will be considered whenever possible. However, the best course of action is to get the project to OCLC at least 90 days prior to any needed completion date. Many times an institution will ask if sending a smaller file of records—a subset of the entire file—will speed up the initiation process. The answer to this question is no. Smaller files are less likely to be representative of the records an institution sends to OCLC for the project. As a result, the processing of subsequent files may be delayed or be in error. OCLC recommends that you send all the records you have ready for batch processing when you initiate the new project. It is not necessary for you to send a "sample file" of records prior to sending live data for analysis and subsequent processing, but you do have this option. Why do new batchload projects take up to 90 days to evaluate and process?It is often asked, "Why do new batchload projects take up to 90 days to evaluate and process?" The diagram below will help answer this question. All new batchload projects go through the following steps: Diagram 1: Workflow for new batchload projects Unresolved RecordsUnresolved or non-matching records from batchload can be handled in the following ways:
|