Word spotting in Historical Manuscripts

Dnr:

SNIC 2017/7-97

Type:

SNAC Small

Principal Investigator:

Anders Hast

Affiliation:

Uppsala universitet

Start Date:

2017-06-13

End Date:

2018-07-01

Primary Classification:

10299: Annan data- och informationsvetenskap

Webpage:

http://www.it.uu.se/research/project/q2b

Allocation

Abstract

Word spotting in historical manuscripts is a computationally heavy task. To find one word in a document takes up to a couple of minutes per page Thus, finding hundreds of words in hundreds of pages takes weeks. Therefore, HPC resources are necessary in order to test new algorithms and parameter settings. The over all goal of the project is to find ways to do semi automatic transcription of historical manuscripts.