Project objectives

tranScriptorium aims to develop innovative, efficient and cost-effective solutions for the indexing, search and full transcription of historical handwritten document images.

Project audience

tranScriptorium will focus on four languages: Spanish, German, English and Dutch. This not only to illustrate the applicability of the technology to different languages, but also because it will stimulate the uptake and validation of the technology for a wider audience.

Historia de las Plantas

Historia de las Plantas is a manuscript written by Bernardo Cienfuegos, who was one of the most outstanding Spanish botanists in the 17th century.

Hattem collection

Hattem manuscripts is a Dutch document of late medieval manuscripts belonging to the “Artes Liberales” literature, which were mostly written in the cursive style.

Bentham collection

The Bentham collection consists of a set of images of a collection of works on law and moral philosophy written by the philosopher Jeremy Bentham.

Demonstrations

Some project demonstrations are available at the Demonstrations section.

 

Home

tranScriptorium is a STREP of the Seventh Framework Programme in the ICT for Learning and Access to Cultural Resources challenge. tranScriptorium is planned to last from 1 January 2013 to 31 December 2015.

tranScriptorium aims to develop innovative, efficient and cost-effective solutions for the indexing, search and full transcription of historical handwritten document images, using modern, holistic Handwritten Text Recognition (HTR) technology.

tranScriptorium will turn HTR into a mature technology by addressing the following objectives:

  1. Enhancing HTR technology for efficient transcription
    Departing from state-of-the-art HTR approaches, tranScriptorium will capitalize on interactive-predictive techniques for effective and user-friendly computer-assisted transcrition.
  2. Bringing the HTR technology to users
    Expected users of the HTR technology belong mainly to two groups: a) individual reserachers with experience in handwritten documents transcription interested in transcribing specific documents. b) volunteers which collaborate in large transcription projects.
  3. Integrating the HTR results in public web portals
    The HTR technology will become a support in the digitization of the handwritten materials. The outcomes of the tranScriptorium tools will be attached to the published handwritten document images. This includes not only full, correct transcriptions, but also partially correct transcription and other kinds of automatically produced metadata, useful for indexing and searching.

Project objectives

Project structure