A yellow banner with circles on with wording data exchange

Data Exchange and Strategic Planning

[Working Group 5]

WG 5 aims to negotiate principles and standards of data exchange as well as plans for digitizing and making freely accessible learned correspondence from the early modern period. Building on the survey of collections of printed and manuscript letters, inventories, and finding aids undertaken in WG 4, WG 5 will develop a master plan for a joint digitization program. In addition, the WG will address issues of Open Access and legal restrictions, content syndication (e.g. to EMLO, Europeana, etc.), the development of worldwide unique persistent identifiers for letter, the connecting letters through semantic web techniques, and the long-term archiving of digitized letters.

WG 5 is led by Dr Thomas Stäcker, the Deputy Director of the Herzog August Bibliothek, Wolfenbüttel, responsible for the library's programme of digitization and the Wolfenbüttel Digital Library.

Agenda

The census of collections of early modern learned correspondence (pursued in WG 4) should be complemented by investigation into the most efficient and reliable methods of generating large quantities of digital catalogue records.

For generating metadata on uncatalogued collections, WG 5 needs to explore the rapidly developing area of crowd-sourcing — scholarly, semi-scholarly and otherwise. Zooniverse is piloting flexible software for this purpose, and is looking for humanistic projects on which to pilot it (Oxford to lead).
For existing catalogue data in traditional card files and related formats, experience of the most cost-effective and reliable means of scanning and keying should be pooled (library community to lead).

This Action devolves the negotiation of individual components of the data model to the specialists assembled in WGs 1–4. WG 5 will chair the Action subcommittee within which these individual components will be integrated into a comprehensive standard data model.
Since individual letters can exist in multiple manifestations (drafts, copies, extracts, abstracts, etc. in manuscript, print, and digital form), a unique identifier scheme for individual letters is needed. This project should build on the experience of similar schemes for people (e.g. VIAF) and printed books (e.g. VD 16).

The census of correspondence collections (produced by WG 4), could also provide one basis for drawing up a master plan to digitize collections of learned correspondence.

Collections of printed correspondence now out of copyright can be digitized and web-mounted in the manner pioneered by CERA. WG 5’s initial role is to recommend standards and best practice.
Collections of manuscript correspondence can be digitized and web-mounted.
The challenge of funding such an enterprise could be assisted by drawing together information on various relevant funding schemes at the European, national, regional, civic, and institutional levels.

With such arrangements in place, WG 5 will coordinate a campaign to encourage contributions of relevant metadata, images, texts, and editions from a range of potential contributors.

Repositories contributing metadata render their collections more visible and discoverable.
Publishers of copyrighted editions of correspondence may regard digital catalogue records as advertisements for their products.
Collaborative research projects gain access to the digital tools and larger pools of data available on shared infrastructure.
Individual researchers render contributed data accessible and future-proof at minimal trouble and cost.

Issues of long-term preservation and sustainability require careful consideration.

One technical challenge is to develop means to allow central repositories of digitized data to be regularly up-graded and up-dated without contaminating the data or disrupting the functionality of the digital tools developed to process them.
Another is to develop an ontology cloud which allows data models, standards, and authorities to be incrementally refined over time as well. Here the experience of Aalto’s Semantic Computing Research Group will be particularly valuable, linking this strategic strand with work being undertaken in WG 2.
The financial challenge is to develop funding sources and mechanisms which allow the preservation and up-grading of both data and platforms to be sustained indefinitely.

People

Anna Skolimowska
Arno Bosse
Gregor Pobezin
Istvan Monok
Jeanine De Landtsheer
Karen Skovgaard-Petersen
Martin Lhoták
Neil Jeffries
Neven Jovanović
Plamena Popova
Stefan Schmunk
Thomas Stäcker
Wolfram Horstmann
Zeljka Salopek

Data Exchange and Strategic Planning

[Working Group 5]

Agenda

I. Generating digital metadata

II. Unifying metadata standards

III. Digitizing learned correspondence

IV. Sharing digital metadata: legal agreements and scholarly conventions

V. Recruiting contributions

V. Preservation and sustainability

People