Volume 37, Issue 3 (Spring 2022)                   ... 2022, 37(3): 895-918 | Back to browse issues page


XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

fakhrzadeh A, Rahnama M, Nasiri J A. Automatic Annotation of Images in Persian Scientific Documents Based on Text Analysis Methods. .... 2022; 37 (3) :895-918
URL: http://jipm.irandoc.ac.ir/article-1-4681-en.html
Iranian Research Institute for Information Science and Technology (IranDoc);Tehran, Iran
Abstract:   (859 Views)
In this paper a new method for annotating images in Persian scientific documents is suggested. Images in scientific documents contain valuable information. In many cases, by analyzing images one can understand the main idea and important results of the document. Due to explosive growth of image data, automatic image annotation has attracted extensive attention and become one of the growing subjects in the literature. Image annotation is the first step in image retrieval methods, in which descriptive tags are assigned to each image.
Here, for image annotation the associated text is used. The caption and the part of the document that includes the reference to the image are considered. None phrases in the associated text are ranked based on five different methods: term frequency, inverse document frequency, term frequency–inverse document frequency, cosine similarity between word embedding of noun phrases in the text and the caption and using both term frequency–inverse document frequency and cosine similarity methods. Image tags in every method are the noun phrases with the highest rank. Suggested methods are evaluated on the test data from Iran scientific information database (Ganj), the main database of Persian scientific documents. Term frequency–inverse document frequency method gives the best results.
Full-Text [PDF 1735 kb]   (329 Downloads)    
Type of Study: Research | Subject: Big Data Analysis
Received: 2021/03/28 | Accepted: 2021/05/16 | Published: 2022/03/30

Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

© 2022 CC BY-NC 4.0 | Iranian Journal of Information processing and Management

Designed & Developed by : Yektaweb