Page 94 - Building Digital Libraries
P. 94

General-Purpose Technologies Useful for Digital Repositories


                 digital repository will eventually be the storehouse for many important
                 documents and research. As such, an organization must ensure that its
                 repository will make the organization’s metadata available to it in whatever
                 format that it might need, both for the present and the future.



                 Why Use XML-Based Metadata

                 A common question that comes up many times during the planning stages
                 of many digital projects is: why should we utilize XML-based schemas at all
                 for bibliographic description? Libraries have seemed to get along just fine
                 with MARC for the last fifty years—why the sudden change? Why should
                 an organization adopt a new way of doing bibliographic description when
                 no one sees a problem with the current one? It’s a fair question and one
                 that an organization needs to be prepared to answer. This is especially true
                 for organizations that may have no other projects utilizing a non-MARC
                 markup format. From the perspective of building a digital repository, here
                 are five reasons to support such a transition:



                 XML Is Human-Readable

                 One of the primary benefits associated with XML is that the generated
                 metadata is human-readable. This is very different from binary formats like
                 PDF, MS Word, or even MARC21, which do not allow for easy inspection of
                 data. Of course, many XML records are never meant to be natively read by
                 an individual without the help of an XML reader—but this characteristic of
                 XML makes data (1) more transparent, (2) makes the data less susceptible
                 to data corruption, and (3) reduces the likelihood of data lockup.
                        1.  Data transparency provides a number of immediate ben-
                           efits. Within a closed binary system, the actual document
                           schema, or the rules that give the document meaning,
                           are hidden from the user. Within an XML-based meta-
                           data system, the document schema is always readily
                           available. Users can inspect the data markup as well as
                           any DTDs (document type definitions) or namespace
                           files that are defined within the document schema. This
                           is a powerful aspect of XML in that it opens one’s data,
                           making it easier for organizations or individuals to reuse
                           the metadata within other systems. As noted, this is very
                           different from other binary data formats—including
                           MARC. Binary data formats, like MARC, are computer-
                           generated formats, meaning that while they can be read
                           with some difficulty by humans, they are not human-
                           readable. Take the MARC record below:





                                                                                                                      79
   89   90   91   92   93   94   95   96   97   98   99