Data Munging Tools in Preparation for RDF: Catmandu and LODRefine

Die neue Code4Lib-Ausgabe 30 ist da. Besonders interessant:

Data Munging Tools in Preparation for RDF: Catmandu and LODRefine von Christina Harlow


Data munging, or the work of remediating, enhancing and transforming library datasets for new or improved uses, has become more important and staff-inclusive in many library technology discussions and projects. Many times we know how we want our data to look, as well as how we want our data to act in discovery interfaces or when exposed, but we are uncertain how to make the data we have into the data we want. This article introduces and compares two library data munging tools that can help: LODRefine (OpenRefine with the DERI RDF Extension) and Catmandu.

The strengths and best practices of each tool are discussed in the context of metadata munging use cases for an institution’s metadata migration workflow. There is a focus on Linked Open Data modeling and transformation applications of each tool, in particular how metadataists, catalogers, and programmers can create metadata quality reports, enhance existing data with LOD sets, and transform that data to a RDF model. Integration of these tools with other systems and projects, the use of domain specific transformation languages, and the expansion of vocabulary reconciliation services are mentioned.

Lagepläne in VuFind

Kathleen Bauer, Michael Friscia und Scott Matheson in in Code4Lib: From the Catalog to the Book on the Shelf: Building a Mapping Application for Vufind

At Yale University Library (YUL), recorded reference transactions revealed that after finding a book in the catalog patrons had difficulty knowing how to use the call number to find the book on the shelf. The Library created a mobile service to help locate the call number in the library stacks. From any call number of a book in Sterling Memorial Library at YUL, a map will be displayed which highlights that call number’s general area on a floor in the stacks. YUL introduced the mapping application in Yufind, a catalog in place at Yale since 2008 which is based on Vufind.

Die Fragen nach Standorten sind nach Implementierung stark zurück gegangen:

In the fall semester 2010, 7.3% of all questions at the Information Desk at Sterling Memorial Library were questions relating to finding a specific call number. After the mapping project was fully implemented in fall 2011 the number of call number related questions was reduced to 5.8% of all questions at the same desk, a 20.6% reduction in patron inquiries on this topic. More publicity for the mapping functionality and the iPad kiosk should further help reduce the number of requests. Based solely on the reduction in requests for help finding call numbers in Sterling Memorial Library the improved mapping can be judged successful and worth replicating elsewhere in the Yale Library system.

Eines Tages begab es sich, dass sich eine Programmiererin (m/w) und ein Bibliothekar (m/w) trafen. Der Bibliothekar sprach: “Du hast Deinen Code in meinen Metadaten!”
Die Programmiererin antwortete: “Deine Metadaten sind in meinem Code!”

So oder ähnlich soll sich die Gründung von zugetragen haben. Libcatcode ist eine Q&A-Seite für alle Fragen rund um “cataloging/coding integration and dialogue”.

Die Software dahinter ist übrigens OSQA von Bitnami, das auch für die Answers-Sektion von eingesetzt wird.

Ideen und Entwickler gesucht: Open Book Widget

John Miedema freut sich über 1000 Downloads seines Open Book Widgets. Gleichzeitig läd er zur Mitarbeit ein:

What is most compelling is that OpenBook is free and open source — the entire codebase is available for inspection and innovation by the library world. To facilitate that sharing, I invite you to join a conversation that is starting about open source book widgets. No techie experience required, only an interest in book widgets, those little interfaces for sharing book data. Join our discussion list. Edit our wiki. Or just have a peek. More information to follow later.

Dem Aufruf möchte ich mich anschließen. Entstehen kann im Idealfall eine Programmbibliothek, die Zugriff, Verbreitung und Verarbeitung bibliographischer Daten im Web einfacher und gebräuchlicher macht. Das Projekt befindet sich noch ganz am Anfang. Einflußnahme auf die Ziele und die Struktur sind also noch möglich und sogar erwünscht.