Dimosthenis Karatzas


Refereed Papers

Text Segmentation in Colour Posters from the Spanish Civil War Era

A. Clavelli and D. Karatzas

Proceedings of the 10th International Conference on Document Analysis and Recognition, IEEE Computer Society, Vol. 1, pp. 181-185, Barcelona, Spain, 2009


The extraction of textual content from colour documents of a graphical nature is a complicated task. The text can be rendered in any colour, size and orientation while the existence of complex background graphics with repetitive patterns can make its localization and segmentation extremely difficult. Here, we propose a new method for extracting textual content from such colour images that makes no assumption as to the size of the characters, their orientation or colour, while it is tolerant to characters that do not follow a straight baseline. We evaluate this method on a collection of documents with historical connotations: the Posters from the Spanish Civil War.

Full Paper



Valid XHTML 1.0! Valid CSS! Number of visitors since 3 June 2005:
Best viewed in 1024x768 - © 2005-06
Designed by: Christos Papadopoulos - Maintained by: Dimosthenis Karatzas