Page 195 - Building Digital Libraries
P. 195

CHAPTER 7


                                                   Metadata Repurposing

                                                   Digital repositories supporting OAI-PMH also offer an organization a num-
                                                   ber of opportunities to repurpose metadata between various systems. Within
                                                   the library community, many organizations still create MARC records for
                                                   items housed within their digital repository for indexing with the organiza-
                                                   tion’s ILS (integrated library system). This often means that organizations
                                                   are creating multiple copies of a metadata record—an original record cre-
                                                   ated in MARC and a record created within the digital repository. However,
                                                   by utilizing their OAI-PMH server, organizations can reduce the need for
                                                   duplicate metadata creation by simply deriving all metadata surrogates from
                                                   the metadata stored within the digital repository.



                                                   The Oregon State University Electronic Theses Process
                                                   Like many organizations, the Oregon State University Libraries (ORST)
                                                   requires graduate students to submit their thesis into the libraries’ institu-
                                                   tional repository (IR). The IR is where the primary metadata record for an
                                                   individual thesis is created; however, a MARC metadata record still must
                                                   be created for indexing into OCLC’s WorldCat database and the local ILS.
                                                   Previously, library technical services staff would simply re-create the MARC
                                                   records by hand utilizing the metadata from the IR as a template. And while
                                                   this process took only a handful of minutes to complete, when multiplied
                                                   over the course of 50–100 documents, it was found that a significant number
                                                   of resources were being used to re-create these metadata records.
                                                      In order to streamline the process of generating MARC metadata, a
                                                   process was developed that allowed technical services staff to utilize the
                                                   OAI-PMH server to output an item’s metadata and automatically generate
                                                   the necessary MARC records. Moreover, given the ability to harvest sets of
                                                   records in one-month increments, the harvesting process would only need
                                                   to be done at the end of each month, or twelve times a year.
                                                      To develop the process, ORST developed a custom XSLT crosswalk that
                                                   is specific to the electronic theses collection  and MarcEdit’s  built-in OAI-
                                                                                        10
                                                                                                      11
                                                   PMH harvester. Utilizing MarcEdit, library staff simply initialize the built-in
                                                   OAI-PMH harvester and provide the necessary harvesting information.
                                                      Using the MarcEdit OAI-PMH harvester, technical services staff are
                                                   able to set the harvesting range and the desired character set of the MARC
                                                   records. In figure 7.5, one can see that MarcEdit’s OAI-PMH harvester
                                                   enables the user to harvest from multiple known metadata types, as well as
                                                   convert XML data encoded in the UTF-8 character set into the more tra-
                                                   ditional MARC-8 character set. With the options set, staff simply run the
                                                   harvester, which returns a file of generated MARC records.








            180
   190   191   192   193   194   195   196   197   198   199   200