Iranian Journal of Information Processing and Management

Iranian Journal of Information Processing and Management

Comparative Assessment of Data Quality Dimensions in Scientific Multimedia Indexing Process

Document Type : Exploring the Relationship between Data Quality and Business Process Management

Authors
1 Assistant Professor; Information Technology Department. Iranian Research Institute for Information Science and Technology (IranDoc), Tehran. Iran.
2 Information Technology Department. Iranian Research Institute for Information Science and Technology (IranDoc), Tehran. Iran.
3 Associate Professor :Information Technology Department. Iranian Research Institute for Information Science and Technology (IranDoc), Tehran. Iran.
Abstract
Organizing a large volume of scientific multimedia data requires the use of appropriate indexing methods as one of the processes of information organization. Appropriate methods and algorithms are those that lead to the improvement of various aspects of quality in the process of organizing and retrieving information. For this reason, the purpose of this research is to identify the most important dimensions of data quality in the field of scientific multimedia indexing. In order to achieve this goal, a comparison of different dimensions of data quality has been made based on different criteria and the most important dimensions have been identified using Shannon entropy weighting approach and TOPSIS group ranking method. Also, using the correlation matrix, the intensity and direction of the relationship and correlation between the different dimensions of data quality have been evaluated. Based on the results of the first part of the research, the best ranks (priorities) were related to the data quality dimensions of recall, precision, completeness, appropriate amount of data, accuracy, relevancy, concise 1, consistency, concise 2, interpretability, value-added and accessibility, respectively. The results obtained from the second part of the research showed that the data quality dimensions of interpretability and relevancy had the highest correlation with the most important dimensions, i.e. recall and precision. As one of the implications of this research, it is possible to consider the measurement and evaluation of scientific multimedia data indexing methods based on different aspects of data quality and their importance.
Keywords
Subjects

References
Abhilash, R. K., Anurag, C., Avinash, V., & Uma, D. (2021). Lecture video summarization using subtitles. In 2nd EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing: BDCC 2019 (pp. 83-92). Springer International Publishing. DOI: 10.1007/978-3-030-47560-4_7
Albahr, A., Che, D., & Albahar, M. (2021). A novel cluster-based approach for keyphrase extraction from MOOC video lectures. Knowledge and Information Systems, 63 (7), 1663-1686. DOI: 10.1007/s10115-021-01568-2
Aminpour, H. (2018). Descriptive statistics in psychology and educational sciences. Tehran: Payam Noor University.
Dakka, M. A., Nguyen, T. V., Hall, J. M. M., Diakiw, S. M., VerMilyea, M., Linke, R., ... & Perugini, D. (2021). Automated detection of poor-quality data: case studies in healthcare. Scientific Reports, 11 (1), 18005. DOI: 10.1038/s41598-021-97341-0
Davila, K., Xu, F., Setlur, S., & Govindaraju, V. (2021). Fcn-lecturenet: extractive summarization of whiteboard and chalkboard lecture videos. IEEE Access, 9, 104469-104484.   
DOI: 10.1109/ACCESS.2021.3099427
Du, J., & Zhou, L. (2012). Improving financial data quality using ontologies. Decision Support Systems, 54 (1), 76-86. DOI: 10.1016/j.dss.2012.04.016
Ershadi, M. J., & Azizi, A. (2019). Measurement and analysis of data quality metrics in the process of registering theses/dissertations of domestic graduates. Tehran: Iranian research institute for information science and technology.
Ershadi, M. J., Ershadi, M. M., Fakhrazadeh, A., & Ghazizadeh. S. (2022). Providing implementation solutions to improve the quality of theses/ dissertations information in the registration system based on data mining techniques. Tehran: Iranian research institute for information science and technology.
Furini, M., Mirri, S., & Montangero, M. (2018, January). Topic-based playlist to improve video lecture accessibility. In 2018 15th IEEE Annual Consumer Communications & Networking Conference (CCNC) (pp. 1-5). IEEE. DOI: 10.1109/CCNC.2018.8319246
Ghosh, K., Nangi, S. R., Kanchugantla, Y., Rayapati, P. G., Bhowmick, P. K., & Goyal, P. (2022). Augmenting video lectures: Identifying off-topic concepts and linking to relevant video lecture segments. International Journal of Artificial Intelligence in Education, 32 (2), 382-412. DOI: 10.1007/s40593-021-00257-z
Goyal, P., Behera, L., & McGinnity, T. M. (2012). A context-based word indexing model for document summarization. IEEE Transactions on Knowledge and Data Engineering, 25 (8), 1693-1705. DOI: 10.1109/TKDE.2012.114
Guo, T., Bai, X., Tian, X., Firmin, S., & Xia, F. (2022). Educational anomaly analytics: features, methods, and challenges. Frontiers in big Data, 4, 811840.       
DOI: 10.3389/fdata.2021.811840
Hassani, H., Ershadi, M. J., & Mohebi, A. (2022). LVTIA: A new method for keyphrase extraction from scientific video lectures. Information Processing & Management, 59 (2), 102802. DOI: 10.1016/j.ipm.2021.102802
Hassani, H., Mohebi, A., Ershadi, M. J., & Jalalimanesh, A. (2023). A novel data quality framework for assessment of scientific lecture video indexing. Library Hi Tech.              
DOI: 10.1108/LHT-02-2023-0074
Husain, M., & Meena, S. M. (2019, February). Multimodal fusion of speech and text using semi-supervised LDA for indexing lecture videos. In 2019 National Conference on Communications (NCC) (pp. 1-6). IEEE. DOI: 10.1109/NCC.2019.8732253
Hwang, C. L., & Yoon, K. (1981). Methods for multiple attribute decision making. Multiple attribute decision making: methods and applications a state-of-the-art survey, 58-191.
DOI: 10.1007/978-3-642-48318-9_3
Jiang, Z., Miao, C., & Li, X. (2017). Application of keyword extraction on MOOC resources. International Journal of Crowd Science, 1 (1), 48-70. DOI: 10.1108/IJCS-12-2016-0003
Lotfabadi, H. (1996). Assessment and measurement in educational sciences and psychology. Mashhad: Samt.
Martins, J., Gonçalves, R., & Branco, F. (2022). A bibliometric analysis and visualization of e-learning adoption using VOSviewer. Universal Access in the Information Society, 1-15. DOI: 10.1007/s10209-022-00953-0
Pandiaraja, P., Boopesh, K. B., Deepthi, T., Laksmi Priya, M., & Noodhana, R. (2022, February). An analysis of document summarization for educational data classification using NLP with machine learning techniques. In International Conference on Computing in Engineering & Technology (pp. 127-143). Singapore: Springer Nature Singapore.                         
DOI: 10.1007/978-981-19-2719-5_12
Parasuraman, A., Zeithaml, V. A., & Berry, L. L. (1988). Servqual: A multiple-item scale for measuring consumer perc. Journal of retailing, 64 (1), 12.
Priestley, M., O’donnell, F., & Simperl, E. (2023). A survey of data quality requirements that matter in ML development pipelines. ACM Journal of Data and Information Quality, 15 (2), 1-39. DOI: 10.1145/3592616
Rahman, M. R., Koka, R. S., Shah, S. K., Solorio, T., & Subhlok, J. (2024). Enhancing lecture video navigation with AI generated summaries. Education and Information Technologies, 29 (6), 7361-7384. DOI: 10.1007/s10639-023-11866-7
Sun, F., & Tian, X. (2022). Lecture video automatic summarization system based on DBNet and Kalman filtering. Mathematical Problems in Engineering, 2022 (1), 5303503.   
DOI: 10.1155/2022/5303503
Uppal, M. A., Ali, S., & Gulliver, S. R. (2018). Factors determining e-learning service quality. British journal of educational technology, 49 (3), 412-426. DOI: 10.1111/bjet.12552
Valverde, C., Marotta, A., Panach, J. I., & Vallespir, D. (2022). Towards a model and methodology for evaluating data quality in software engineering experiments. Information and Software Technology, 151, 107029. DOI: 10.1016/j.infsof.2022.107029
Yang, H., & Meinel, C. (2014). Content based lecture video retrieval using speech and video text information. IEEE transactions on learning technologies, 7 (2), 142-154. DOI: 10.1109/TLT.2014.2307305

  • Receive Date 24 January 2024
  • Revise Date 21 April 2024
  • Accept Date 27 April 2024