The authors describe IN Harmony: Sheet Music from Indiana (http://www.dlib.indiana.edu/projects/inharmony/), a collaborative project to catalog and digitize sheet music from four cultural heritage institutions in the state, Indiana State Library, Indiana Historical Society, Indiana State Museum, and Indiana University. Now halfway through a threeyear project, the partners have developed worked together to coordinate their workflows to facilitate standardsbased digitization and cataloging. The lead partner, Indiana University, is performing automated quality control on digitized images for all partners and has created a flexible cataloging tool for sheet music that will be made available to other institutions at the conclusion of the project.
IN Harmony Project description
Metadata standards and cataloging tools
Current Project status
IN Harmony Project description
In October 2004, the Indiana University Digital Library Program received a grant from the Institute of Museum and Library Services to fund IN Harmony: Sheet Music from Indiana, a threeyear project to catalog and digitize sheet music from the collections of four partner institutions — Indiana Universitys Lilly Library, Indiana State Museum, Indiana Historical Society, and Indiana State Library. The purpose of the Project is to create an online sheet music collection that will demonstrate how museums and libraries with complementary materials can work cooperatively to create shared digital resources. By concentrating initially on the collections of American sheet music owned by each of the partners, the Project will accomplish two highly adaptable goals: 1) it will demonstrate how approximately 10,000 digitized pieces of sheet music and their attendant metadata can be presented on a single Web site, offering federated searching of all collections or access to one or more selected collections; and, 2) it will demonstrate how collaborative digital library development can provide online access to the important regional collections of museums, libraries, and historical societies. These collections may, in fact, be sheet music, or they may be important materials in other formats, such as photographs, maps, manuscripts, or artifacts.
The partners hold diverse sheet music collections, with some focused exclusively on Indiana composer, lyricist, subject matter and others with no particular Indiana connection. The project will digitize all Indianarelated sheet music in these collections, then move out more broadly with priorities established by the partner institutions. Not all of the sheet music from partner institutions will have an Indiana connection other the location of the repository for the physical objects.
Indiana Historical Society (IHS)
The IHS sheet music collection is comprised of items gathered from a number of other collections, with about 950 items related to Indiana, by composers including Irving Berlin, Paul Dresser, Al Jolson, Jerome Kern, Fritz Krull, John A. McGee, Cole Porter, and Albert and Harry Von Tilzer. The Historical Society recently acquired the Jane B. Anderson Sheet Music Collection, with approximately 12,000 items, dating from 18541980, ranging from semiclassical to vaudeville. Selections from this collection will also be cataloged and digitized.
Indiana State Library
The Indiana Division owns a collection of approximately 1,600 pieces of sheet music, all with an Indiana connection. These pieces date from 1840 (a campaign song for William Henry Harrison) through the 1960s, with the bulk of the music dating from 1890 through 1950. The collection includes a wide variety of pieces: booster songs promoting a particular city or railroad stop, centennial songs from statewide, county, and city celebrations, religious works, college songs, and songs based on the literary works of wellknown Indiana authors such as James Whitcomb Riley.
Indiana State Museum (ISL)
The ISL owns about 1,800 pieces of Indianarelated sheet pieces from the Sam DeVincent collection and other sources. The collections include many of the published works of the Von Tilzer brothers, who were born in Indianapolis. Their popular songs include standards such as Take Me Out to the Ball Game, I Want a Girl, Apple Blossom Time, and Put Your Arms Around Me. The Museum also holds a very good selection of Indiana Ragtime, 19101925, and representative collections of composers Hoagy Carmichael and Cole Porter.
Indiana Universitys Lilly Library contains one of the nations largest collections of American sheet music: in addition to the DeVincent Collection, the Lilly Library owns the Starr Sheet Music Collection, containing more than 100,000 separate items. The DeVincent Collection had already been cataloged and selections had been digitized (http://www.letrs.indiana.edu/web/s/sheetmusic/). The IN Harmony Project builds upon and extends this work.
Each page of sheet music is scanned at 400 PPI, and the master image file is saved as uncompressed TIFF. We have created an automated Quality Control process that allows the partners to control their workflow. We developed a tool that will automatically validate the images (i.e. perform a technical quality control process), create the derivatives for delivery online (including downloadable and printable PDF files) and store all of the images safely (Figure 1).
Figure 1: IUDLP image validation and processing.
Studies revealed that users want to print sheet music for a variety of reasons the cover art, the lyrics, the advisements, but most importantly, for the music. People do not want to print each page separately; they want a full score sized to print on 8.5 x 11 paper. The best way to fulfill that functional requirement is via PDF. We want to provide a PDF of the full score optimized for both printing speed and readability. After much experimentation, we have developed a process for creating PDFs that have sufficient clarity for performance while maintaining a file size of approximately one megabyte. Using a temporary intermediate JPG file for each page image, we resample down from the original 400 PPI to 125 PPI while maintaining the dimensionality of the original. For grayscale images, we include two extra steps to increase the printability and the readability of the scores a 60 percent threshold to increase the disparity between black and white and a contrast filter to sharpen the edges.
Metadata standards and cataloging tools
A primary goal for this project is to provide robust, consistent browse and search access across collections. As a result, cataloging guidelines and tools for sheet music description have been developed to aide the project partners in a) cataloging sheet music not yet described in their respective collections; and, b) mapping existing cataloging records to a format that will facilitate crosscollection searching.
In order to generate cataloging guidelines and requirements, we needed to understand how sheet music is browsed and searched. We evaluated a subset of the query logs captured by the Indiana University Sheet Music Collection (housed at Indiana University) and the Sheet Music Consortium (housed at University of California, Los Angeles) Web sites to learn how people search sheet music, then performed a series of studies with real users of sheet music to help inform our choice of metadata elements. The complete usability studies can be found on the project Web site. Using the information gathered from the user studies, we developed a metadata model to describe sheet music that is flexible enough to meet the needs of the diverse project partners. The data will be stored in a relational database, but for purposes of interoperability with other digital library collections, records will be exportable into MODS. The table below (Table 1) is the complete data elements set with a brief description of the MODS transform.
Table 1: IN Harmony metadata elements. Title elements Repeatable? Required? MODS field title N Y titleInfo/title uniform title N N titleInfo/title/@type="uniform" first line N N titleInfo/title/@type="alternative"/@displayLabel="First line" first line of chorus N N titleInfo/title/@type="alternative"/@displayLabel="First line of chorus" alternative title Y N titleInfo/title/@type= "alternative"/@displayLabel="Alternative title" title of larger work N N relatedItem/@type= "host"/titleInfo/title series title Y N relatedItem/@type="series"/titleInfo/title/ Name elements Repeatable? Required? MODS field composer Y N name/namePart [and <role> indication] [@authority="naf" when applicable] [@type="personal|corporate"] arranger Y N name/namePart [and <role> indication] [@authority="naf" when applicable] [@type="personal|corporate"] lyricist Y N name/namePart [and <role>indication] [@authority="naf" when applicable] [@type="personal|corporate"] performer Y N name/namePart [and <role> indication] [@authority="naf" when applicable] [@type="personal|corporate"] dedicatee Y N engraver Y N relatedItem/@displayLabel="Cover information"/name/namePart [and <role> indication] lithographer Y N relatedItem/@displayLabel="Cover information"/name/namePart [and <role> indication] artist Y N relatedItem/@displayLabel= "Cover information"/name/namePart [and <role> indication] other name Y N name/namePart [and <role> indication?] [@authority= "naf" when applicable] [@type="personal|corporate"] Publication elements Repeatable? Required? MODS field publisher name Y N originInfo/publisher publishing place Y N originInfo/place/placeTerm/@type="text" date Y N originInfo/dateIssued OR originInfo/copyrightDate Subject elements Repeatable? Required? MODS field topical subject Y N subject/topic name as subject Y N subject/name/namePart/@type="personal|corporate" form/genre Y N genre temporal subject Y N subject/temporal instrumentation Y N note AND extension/inharmony/instruments/instrument place name subject Y N subject/hierarchicalGeographic cover subject Y N relatedItem/@displayLabel= "Cover"/subject/topic other geographic subject Y N subject/geographic language Y N language/languageTerm local subject Y N subject/topic Identification elements Repeatable? Required? MODS field IN Harmony ID number N Y recordInfo/recordIdentifier plate number Y N identifier/@type="music plate"/@displayLabel="Plate number" publisher number Y N identifier/@type= "music publisher"/@displayLabel="Publisher number" catalog number Y N identifier/@type= "catalog"/@displayLabel="Catalog number" call number N N identifier/@type= "call number"/@displayLabel= "Call number" Other elements Repeatable? Required? MODS field note Y N note
To facilitate the data collection, we developed a cataloging tool. This tool is a Java client application with an Oracle database on a central server, is designed to maximize efficiency and assist authority control by allowing catalogers to automatically see names, subjects, locations, and titles of other works as they type. Catalogers can easily reuse the information in existing records (Figure 2). If the institution chooses to scan prior to cataloging, the PDF of the piece is available to the cataloger through the interface.
Figure 2: Recycling of information in records.
Current Project status
The IN Harmony: Sheet Music from Indiana Project is at the halfway point. Digitizing is in full production mode. More than 5,000 pieces of sheet music have been scanned and processed through the quality control procedure. The cataloging tool has just been released to production. The partners have cataloged more than 200 items to date. Currently, we are developing the functional and technical requirements for the public delivery system that will constitute the next phase of the Project. We expect to complete the Project on schedule in August 2007.
About the authors
Kristine R. Brancolini is Director of the Indiana University Digital Library Program (http://www.dlib.indiana.edu) and Project Director on IN Harmony: Sheet Music from Indiana. This Project grew out of her work the Sheet Music Consortium, a metadata harvesting service for sheet music operated by UCLA (http://digital.library.ucla.edu/sheetmusic/); the Indiana University Sheet Music digitization program (http://www.letrs.indiana.edu/s/sheetmusic/); and, the Indiana State Library-sponsored effort to create a collaborative Indiana Digital Library (http://www.statelib.lib.in.us/www/isl/diglibin/). On 3 July 2006, she became Dean of Libraries at Loyola Marymount University.
Stacy Kowalczyk is a visiting librarian for the Digital Library Program and the Project Manager on the IN Harmony Project. She is a Ph.D. student at the Indiana University School of Library and Information Science with a focus on digital library architectures and digital preservation. Before entering the doctoral program at IU, she spent seven years as Manager, Applications Programming, for the Harvard University Library, Office for Information Systems, where she managed the development of the technical infrastructure, both systems and process, for the Harvard Library Digital Initiative (LDI).
Jenn Riley is responsible for overseeing all metadata activities for the IN Harmony Project. She is the Metadata Librarian with the Digital Library Program at Indiana UniversityBloomington, where she is responsible for planning metadata strategy for digital library projects and participates in the collaborative design of digital library systems. Jenns research interests include shareable metadata, the incorporation of thesaurus structures into search and browse systems, music digital libraries, and FRBR. Jenn is a member of the Metadata Working Group of the Digital Library Federations Aquifer Initiative and is an active contributor to the Best Practices for Open Archives Initiative Data Provider Implementations and Shareable Metadata. Jenn is the author of the blog Inquiring Librarian (http://inquiringlibrarian.blogspot.com), where her posts frequently center around improving intellectual access to library materials, and a contributor to the collaborative Blog and Wiki TechEssence (http://www.techessence.info), a technology resource for library administrators.
Paper received 4 June 2006; accepted 18 July 2006.
Copyright ©2006, First Monday.
Copyright ©2006, Kristine R. Brancolini, Stacy Kowalczyk, and Jenn Riley.
IN Harmony: Sheet Music from Indiana by Kristine R. Brancolini, Stacy Kowalczyk, and Jenn Riley
First Monday, volume 11, number 8 (August 2006),