{"id":412,"date":"2013-03-20T22:07:52","date_gmt":"2013-03-21T05:07:52","guid":{"rendered":"http:\/\/galencharlton.com\/blog\/?p=412"},"modified":"2013-03-20T22:07:52","modified_gmt":"2013-03-21T05:07:52","slug":"bestiary-of-monstrous-data-2-im-not-dead-yet","status":"publish","type":"post","link":"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/","title":{"rendered":"Bestiary of Monstrous Data #2: I&#8217;m not dead yet!"},"content":{"rendered":"<p><em>This is the second part in an <a href=\"\/blog\/category\/bestiary-of-monstrous-data\/\">occasional series<\/a> on how good data can go bad.<\/em><\/p>\n<p><img loading=\"lazy\" class=\"alignleft size-full wp-image-297\" alt=\"bestiary_viper_thumbnail\" src=\"https:\/\/galencharlton.com\/blog\/wp-content\/uploads\/2013\/03\/bestiary_viper_thumbnail.jpg\" width=\"93\" height=\"100\" \/><span style=\"line-height: 1.714285714; font-size: 1rem;\">One aspect of the MARC standard that sometimes is forgotten is that it was meant to be a cataloging <\/span><em style=\"line-height: 1.714285714; font-size: 1rem;\">communications<\/em><span style=\"line-height: 1.714285714; font-size: 1rem;\"> format. One could design an ILS that doesn&#8217;t use anything resembling MARC 21 to store or express bibliographic data, but as long as its internal structure is sufficiently expressive to keep track of the distinctions called for by AACR2, in principle it could relegate MARC handling strictly to import and export functionality. By doing so, it would follow a conception of MARC as a <\/span><em style=\"line-height: 1.714285714; font-size: 1rem;\">lingua franca<\/em><span style=\"line-height: 1.714285714; font-size: 1rem;\"> for bibliographic software.<\/span><\/p>\n<p>In practice, of course, MARC isn&#8217;t just a common language for machines &#8212; it&#8217;s also part of a common language for catalogers. &nbsp;If you say &#8220;222&#8221; or &#8220;245&#8221; or &#8220;780&#8221; to one, you&#8217;ve communicated a reasonably precise (in the context of AACR2) identification of a metadata attribute. &nbsp;Sure, it&#8217;s arcane, but then again so is most professional jargon to non-practitioners. &nbsp;MARC also become the basis of record storage and editing in most ILSs, to the point where the act of cataloging is sometimes conflated with the act of creating and editing MARC records.<\/p>\n<p>But MARC&#8217;s origins as a communications format can sometimes conflict with its ad hoc role as a storage format. &nbsp;Consider this record:<\/p>\n<pre class=\"crayon:false\">00528<span style=\"color: red;\">d<\/span>am  22001577u 4500\r\n001 123\r\n100 1  $a Strang, Elizabeth Leonard.\r\n245 10 $a Lectures on landscape and gardening design \/ $c by Elizabeth Leonard Strang.<\/pre>\n<p>A brief bibliographic record, right? &nbsp;Look at the Leader\/05, which stores the the record status. &nbsp;The value &#8216;d&#8217; means that the record is deleted; other values for that position include &#8216;n&#8217; for new and &#8216;c&#8217; for corrected.<\/p>\n<p>But unlike, say, the 245, the Leader\/05 isn&#8217;t making an assertion about a bibliographic entity. &nbsp;It&#8217;s making an assertion about the metadata record itself, and one that requires more context to make sense. &nbsp;There can&#8217;t be a globally valid assertion that a record is deleted; my public library may have deaccessioned&nbsp;<em>Lectures on landscape and gardening design<\/em>, but your horticultural library may keep that title&nbsp;indefinitely.<\/p>\n<p>Consequently, the Leader\/05 is often ignored when creating or modifying records in an ILS. &nbsp;For example, if a bib record is present in an Evergreen or Koha database, setting its Leader\/05 to &#8216;d&#8217; does not affect its indexing or display.<\/p>\n<p>However, such records can become undead &#8212; not in the context of the ILS, but in the context of exporting them for loading into a discovery layer or a union catalog. Some discovery layers do look at the Leader\/05. &nbsp;If an incoming record is marked as deleted, that is taken as a signal to remove the matching record from the discovery layer&#8217;s own indexes. &nbsp;If there is no matching record, the discovery layer would be reasonable to ignore an incoming &#8220;deleted&#8221; record &#8212; and I know of at least that does exactly that.<\/p>\n<p>The result?  A record that appears to be perfectly good in the ILS doesn&#8217;t show up in the discovery layer.<\/p>\n<p>Context matters.<\/p>\n<p>I&#8217;ll finish with a couple SQL queries for finding such undead records, one for Evergreen:<\/p>\n<pre class=\"lang:pgsql decode:true\" title=\"Find undead bibs in Evergreen\">SELECT record\r\nFROM metabib.full_rec mfr\r\nJOIN biblio.record_entry bre ON (bre.id = mfr.record)\r\nWHERE tag = 'LDR'\r\nAND SUBSTRING(value, 6, 1) = 'd'\r\nAND NOT bre.deleted;<\/pre>\n<p>and one for Koha:<\/p>\n<pre class=\"lang:pgsql decode:true\" title=\"Find undead bibs in Koha\">SELECT biblionumber\r\nFROM biblioitems \r\nWHERE ExtractValue(marcxml, 'substring(\/\/leader, 6, 1)') = 'd';<\/pre>\n<p>&nbsp;<\/p>\n<p><small><em>CC-BY image of a <a href=\"http:\/\/www.flickr.com\/photos\/58558794@N07\/5436869194\/\">woodcut of a viper<\/a> courtesy of the <a href=\"http:\/\/www.flickr.com\/photos\/58558794@N07\/\">Penn Provenance Project<\/a>.<\/em><\/small><\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-412\" class=\"share-twitter sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\"><span>Twitter<\/span><\/a><\/li><li><a href=\"#\" class=\"sharing-anchor sd-button share-more\"><span>More<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><div class=\"sharing-hidden\"><div class=\"inner\" style=\"display: none;\"><ul><li class=\"share-tumblr\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-tumblr sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/?share=tumblr\" target=\"_blank\" title=\"Click to share on Tumblr\"><span>Tumblr<\/span><\/a><\/li><li class=\"share-reddit\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-reddit sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/?share=reddit\" target=\"_blank\" title=\"Click to share on Reddit\"><span>Reddit<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-print\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-print sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/\" target=\"_blank\" title=\"Click to print\"><span>Print<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>This is the second part in an occasional series on how good data can go bad. One aspect of the MARC standard that sometimes is&#8230;<\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-412\" class=\"share-twitter sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\"><span>Twitter<\/span><\/a><\/li><li><a href=\"#\" class=\"sharing-anchor sd-button share-more\"><span>More<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><div class=\"sharing-hidden\"><div class=\"inner\" style=\"display: none;\"><ul><li class=\"share-tumblr\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-tumblr sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/?share=tumblr\" target=\"_blank\" title=\"Click to share on Tumblr\"><span>Tumblr<\/span><\/a><\/li><li class=\"share-reddit\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-reddit sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/?share=reddit\" target=\"_blank\" title=\"Click to share on Reddit\"><span>Reddit<\/span><\/a><\/li><li class=\"share-end\"><\/li><li class=\"share-print\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-print sd-button share-icon\" href=\"https:\/\/galencharlton.com\/blog\/2013\/03\/bestiary-of-monstrous-data-2-im-not-dead-yet\/\" target=\"_blank\" title=\"Click to print\"><span>Print<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div><\/div><\/div>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","jetpack_publicize_message":"Bestiary of Monstrous Data #2: I'm not dead yet! http:\/\/wp.me\/p3gJ9y-6E #monstrousdata #cataloging","jetpack_is_tweetstorm":false},"categories":[29,6],"tags":[],"jetpack_featured_media_url":"","jetpack_publicize_connections":[],"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3gJ9y-6E","_links":{"self":[{"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/posts\/412"}],"collection":[{"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/comments?post=412"}],"version-history":[{"count":41,"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/posts\/412\/revisions"}],"predecessor-version":[{"id":463,"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/posts\/412\/revisions\/463"}],"wp:attachment":[{"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/media?parent=412"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/categories?post=412"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/galencharlton.com\/blog\/wp-json\/wp\/v2\/tags?post=412"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}