Volume 34, Issue 3 (Spring 2019)                   ... 2019, 34(3): 1211-1234 | Back to browse issues page

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Rabiei M, MahdiHosseini-Motlagh S, Minaei Bidgoli B. Using One-Class SVM for Scientific Documents Classification Case study: Iranian Environmental Thesis . .... 2019; 34 (3) :1211-1234
URL: http://jipm.irandoc.ac.ir/article-1-4087-en.html
Iran University of Science and Technology
Abstract:   (622 Views)
The classification of research studies is important in order to identify and analyze the research supply and demand in various fields of science. In particular, the classification of environmental research is essential because of its importance in Iran and its interdisciplinary nature. This research proposes One-Class Classification (OCC) method to classify the research studies in this domain using Support Vector Machine (SVM) and consequently evaluates important parameters affecting the quality of this classification. The results show that the use of descriptive metadata has better performance than the content metadata in order to make a core data set to learn the model. Moreover, the use of the polynomial kernel and the binary weighing of words in the features vector matrix leads to better results than other states. In this paper a new weighing method has been proposed which is superior to the other methods especially in precision criterion. We call this weighing method as NG-TF, which can be used in term-document matrix to determine the indicator terms of scientific domains.
Full-Text [PDF 1156 kb]   (253 Downloads)    
Type of Study: Research | Subject: Information Technology
Received: 2018/10/16 | Accepted: 2018/12/30

Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


© 2019 All Rights Reserved | Iranian Journal of Information processing and Management

Designed & Developed by : Yektaweb