IAD Index of Academic Documents
  • Home Page
  • About
    • About Izmir Academy Association
    • About IAD Index
    • IAD Team
    • IAD Logos and Links
    • Policies
    • Contact
  • Submit A Journal
  • Submit A Conference
  • Submit Paper/Book
    • Submit a Preprint
    • Submit a Book
  • Contact
  • Turkish Journal of Electrical Engineering and Computer Science
  • Volume:21 Issue:3
  • A word spotting method for Farsi machine-printed document images

A word spotting method for Farsi machine-printed document images

Authors : Yaghoub POURASAD, Houshang HASSIBI, Azam GHORBANI
Pages : 734-746
Doi:10.3906/elk-1107-26
View : 13 | Download : 12
Publication Date : 0000-00-00
Article Type : Research Paper
Abstract :In this paper, a word spotting approach for Farsi printed document images has been presented. The main idea of the paper is the font recognition of Farsi document images and query word modification according to the document image`s font before searching. This operation increases the similarity between the query word image and its instances in the document image; therefore, the performance of the word spotting system increases. In the proposed word spotting approach, after the query word modification, the query word image rectangle is searched in the text lines of the document image using XNOR similarity measurement. In order to increase the recall rate, we considered an almost low value as an acceptance/rejection threshold insert ignore into journalissuearticles values(d); and in order to increase precision rate, we used some other features, e.g., number of holes, ascenders, descenders, and dots. With multilevel matching and considering the mentioned features, the problem of justifying the operation insert ignore into journalissuearticles values(aligning the text to both the left and right); that occurs during the writing of Farsi documents has been solved. This approach was applied on a computer-made dataset consisting of 440 Farsi printed document images, and a precision rate of 97.5% at a recall rate of 92.1% was obtained. Moreover, when applying this approach on a dataset consisting of 224 Farsi scanned document images, a precision rate of 87.6% at recall rate of 79.3% was obtained.
Keywords : Farsi document image, font recognition, word spotting, retrieval

ORIGINAL ARTICLE URL
VIEW PAPER (PDF)

* There may have been changes in the journal, article,conference, book, preprint etc. informations. Therefore, it would be appropriate to follow the information on the official page of the source. The information here is shared for informational purposes. IAD is not responsible for incorrect or missing information.


Index of Academic Documents
İzmir Academy Association
CopyRight © 2023-2025