Page 133 - Building Digital Libraries
P. 133

CHAPTER 6


                                                   the way that we both interact with and describe materials in the new and
                                                   emerging digital repository environments.




                                                   MARC
                                                   As previously noted, MARC was originally developed in 1969 by the
                                                   Library of Congress as a standard method for transferring bibliographic
                                                   data between systems on electromagnetic tapes. Today, MARC is the lingua
                                                   franca for transmitting data between ILS systems within the library commu-
                                                   nity. And while very few digital repositories handle MARC records directly,
                                                   a handful can export their bibliographic metadata in MARC or provide an
                                                   easy path to the creation of MARC records for stored content.
                                                      Within the library community, MARC itself is often misunderstood.
                                                   When many think MARC, they are often thinking of the fields and tags
                                                   defined by the physical rules governing the input of data, like RDA. Figure
                                                   6.1 shows a representation of a MARC21 record of an electronic thesis from
                                                   Ohio State University.
                                                      These rules define what data elements can be placed within a par-
                                                   ticular MARC field, creating specific “flavors” of MARC. For example, one
                                                   could find flavors for MARC21 (a merging of USMARC, CANMARC and
                                                   UKMARC), CHMARC, FINMARC, UNIMARC, and so on. Currently, the
                                                   Library of Congress recognizes approximately forty  different MARC flavors
                                                                                               7
                                                   that are actively utilized around the world.
                                                      Technically, MARC is nothing more than a binary data format made up
                                                   of three distinct parts: (1) the leader, (2) the directory, and (3) the biblio-
                                                   graphic data. The leader makes up the first twenty-four bytes and contains
                                                   information about the MARC record itself. The leader will include the total
                                                   length of the record, note the start position of the field data, the character
                                                   encoding, the record type, and the encoding level of the record. However,
                                                   it is within the leader that one of MARC’s most glaring limitations becomes
                                                   visible. A MARC record reserves just five bytes to define the total length of






















            FIGURE 6.1
            MARC21 Example

            118
   128   129   130   131   132   133   134   135   136   137   138