DWork – Heidelberg Digitization Workflow
The University Library of Heidelberg uses its in-house development DWork – Heidelberg Digitization Workflow to support the process flow of digitization and the web presentation of the digitized works.
The software as a web application thereby supports all single steps of the workflow from the creation of metadata, scan processing (inclusively image converting and automatic text recognition) creation of the web presentation to the storage of scans and metadata.
The DWork concept aims to realise the guidelines provided by the Deutsche Forschungsgemeinschaft (DFG) and to develop an easy to operate and transparent user interface.
Workflow, modules and technical requirements
The singleworkflow steps (General - Bibliography - Sequence - Structure - Converting + OCR – Export - Storage) are selected by a file card system. The progress within the workflow is visualised by a traffic light system so that an overview is available at any time.
The technical basis of the software consists of three modules:
- Digitization workflow
- Presentation
- Storage
Software is programmed in PERL. Further software requirements are:
- Webserver: Apache 2.x
- Database: MySQL 5.x
- Full-text search: SOLR
OCR-Converting
We use Abbyy Finereader for Linux for OCR converting.
Further information
- Workflow and applied technique in the Digitization Department of Heidelberg University Library (in German)
- Overview of digitized volumes: “Heidelberg historic literature - digitized”.
Contact
Dr. Thomas Wolf
Head of the Digitization Department
Dr. Maria Effinger
Head of the Department Manuscripts and Early Printed Books
Central project management
Technical staff
Leonhard Maylein
Head of Department IT
Jochen Barth
DWork developer

