Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/6
Title: Arabic Word Spotting Based on Key-Points Features
Authors: Tari, Abdelkamel
CHERIET, Mohamed
GAGAOUA, Meriem
GHILAS, Hamza
Keywords: word
spotting
historical Arabic documents
features extraction; word matching
Issue Date: 2017
Publisher: IEEE Xplore
Series/Report no.: aestin3;3
Abstract: Abstract Word spotting is an efficient alternative to OCR (Optical Character Recognition) for understanding historical manuscripts by matching features of words. However, due to the large variability in the Arabic script, the retrieving results are still not satisfactory. In this work a novel method based on key-points features is proposed. Key-points features can capture both topological and local characteristics. Feature vectors were extracted from key-points and a distance function was proposed for word matching. To reduce space matching, the connected components (CCs) were clustered into metaclasses in a soft manner using a Gaussian mixture model, then a query CC was spotted by matching it only with CCs which belong to its meta-class. The experiments were carried out on a benchmark consisting of Arabic historical manuscripts by IBEN SINA and shows promising results.
Description: The digitization of historical documents is an efficient way to preserve them. Nowadays a large amount of digitized documents is available; thus the need to make them searchable becomes necessary. Since manual transcription of these documents is time consuming and the automatic transcription by Optical Character Recognition (OCR) is far from to be practical, the word spotting technique becomes a promising alternative. Word spotting aims to locate in a target document, regions that are most similar to a query word without recognizing its characters.
URI: http://localhost:8080/xmlui/handle/123456789/6
Appears in Collections:Articles

Files in This Item:
File Description SizeFormat 
(3)Arabic_word_spotting_based_on_key-points_features.pdf181,73 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.