[Home ] [Archive]   [ فارسی ]  
:: Volume 21, Issue 3 (10-2006) ::
2006, 21(3): 1-34 Back to browse issues page
Automatic Recognition of Table of Content Given their Stylistics in Farsi and Western Dissertation
Esmaeil Faramarzi *
Abstract:   (10548 Views)
In any type of document, whether book, magazine, dissertation or likes, the table of content expresses concisely its logical structure. By using table of contents, the document structure is easily reviewed and the desired topic could be readily accessed. The present paper presents for the first time a method for automated recognition of the table of content in dissertations written in Farsi, Arabic and any other western script. In this method the content pages are recognized given their patterns without employing OCR and merely using image processing techniques. The method can recognize the table of content pages regardless of the language or text justifications. Since it does not use OCR, it is independent of the document scan quality. The method was tested over a number of IRANDOC Farsi, Arabic and Western Dissertations. Recognition accuracy of 99.7 percent was achieved.
Keywords: Document image analysis, Page Layout analysis, Structural Analysis, Logical analysis, Document Image understanding, content page recognition, Image processing, OCR, Pattern Recognition
Full-Text [PDF 1240 kb]   (2336 Downloads)    
Type of Study: Research | Subject: Information Technology
Received: 2009/07/28
Add your comments about this article
Your username or Email:

Write the security code in the box >

XML   Persian Abstract   Print

Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Faramarzi E. Automatic Recognition of Table of Content Given their Stylistics in Farsi and Western Dissertation. Journal of Information Processing and Management. 2006; 21 (3) :1-34
URL: http://jipm.irandoc.ac.ir/article-1-93-en.html

Volume 21, Issue 3 (10-2006) Back to browse issues page
پژوهشنامه پردازش و مدیریت اطلاعات Journal of Information processing and Management
Persian site map - English site map - Created in 0.256 seconds with 818 queries by yektaweb 3619