XML Persian Abstract Print


Department of Knowledge and Information science, University of Qom, Qom, Iran;
Abstract:   (578 Views)
Clustering as a process to understand the nature and structure of data, plays an important role in organizing data in many areas of science and technologies. So. one of the most widely used and simple algorithms for clustering is K-means. The present study was conducted to systematically reviewing research on improving the K-means algorithm on data clustering. This research examines the research conducted in this field and its role in organizing data in the range of 2010 to 2020 with a new strategy based on the shortcomings of the K-means algorithm. For this purpose, the amount of attention of researchers to eliminate any of the shortcomings of this algorithm in order to improve it in recent years has been compiled in the form of research questions. In this study, with the use of a search strategy for refining and extracting articles, 47 related sources were identified and examined. Findings showed that most of the research has been done by overcoming the sensitive shortcomings to initial cluster centers to improve the K-means algorithm. Also, out of a total of 47 studies, the improved K-means algorithm has been applied in 35 studies on non-textual data and in 12 studies on textual data. Finally, the results of a review of six studies showed that the amount of data is directly related to the performance of the improved K-means algorithm. In other words, this algorithm must be modified in such a way as to perform efficient and accurate clustering by applying it to different amounts of data.
Full-Text [PDF 1992 kb]   (424 Downloads)    
Type of Study: Review | Subject: Big Data Analysis
Received: 2021/01/11 | Accepted: 2021/05/26

Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

© 2021 CC BY-NC 4.0 | Iranian Journal of Information processing and Management

Designed & Developed by : Yektaweb