Iranian Journal of Information Processing and Management

Iranian Journal of Information Processing and Management

Identification of Data Quality Indicators for Data Governance with Metasynthesis and Fuzzy Delphi Approach

Document Type : Original Article

Authors
1 PhD Candidate in Knowledge and Information Science; Department of Knowledge and Cumminucation Sciences; Islamic Azad University; Science and Research Branch; Tehran, Iran
2 PhD in Knowledge and Information Science; Professor; Department of Knowledge and Cumminucation Sciences; Islamic Azad University; Science and Research Branch; Tehran, Iran;
3 PhD in Knowledge and Information Science; Associate Professor; Department of Knowledge and Information Science; Allameh Tabataba’i University; Tehran, Iran
4 PhD in Knowledge and Information Science; Associate Professor; Iranian Research Institute for Information Science and Technology (IranDoc); Tehran, Iran
Abstract
Data is considered as an asset in organizations and its quality is an important principle to achieve productivity. For optimal management these organizational assets need a kind of governance, so that with its help, the data align with the goals of our constructive leadership. The purpose of this research is to identify the dimensions of data quality for data governance. To achieve this goal, a two-stage qualitative approach was used. In the first stage of the meta-combination method by searching for the key and keywords of data quality, Data management, data governance and data quality management in Air&Dec databases, Science Direct, Google Scholar, Springer, IEEE and ACM were conducted between the years (1995-2022) and 268 related articles were identified and in a detailed study and evaluation, 62 approved articles were decided. By reviewing and studying these articles fully, 8 concepts were identified. After applying the opinions of professors and three data experts, finally 55 components were extracted for the research question. In the second stage, fuzzy Delphi method was used to get the opinion of experts. For this purpose, the items needed to design a fuzzy Delphi questionnaire were provided from the output of metacombination, and this process was continued until the experts’ opinions on the answers to the questions reached a consensus. 21 experts who had at least one research paper in the field of data quality were selected and finally 14 completed questionnaires were returned. In response to the questions of data quality dimensions for data governance, there are 46 indicators: completeness, timing, communication, accessibility, compliance with laws and standards, confidentiality, interpretability, redundancy (ability to add), reputation and reliability, ability traceability, value (value), simplicity, update, concept, regularity, linkability, referential integrity, uniqueness, purposefulness, auditability, accuracy, comparability, consistency, commonality, completeness, metadata compliance, acceptability, validity, conciseness, applicability (usability), believability, comprehensibility, reliability, reasonableness, consistency, retrievability, reproducibility, ability to display null values, appropriateness, clarity, added value, comprehensiveness, extensibility, patternability and variability were identified. More than 80% of the indicators obtained from the extracombination were accepted by the experts, so our system and data-driven businesses can use these indicators as a priority for measuring the quality of their data. 
Keywords
Subjects

فهرست منابع
اخو‎‎ان آملی، ر‎امین. 1375. نقش اطلاعات د‎ر‎ شناخت عو‎‎امل د‎اخلی، خار‎جی و‎‎ محیطی سیستم. ماهنامه تد‎بیر 66: 38-41.
ارشادی، محمدجواد، جلال‌الدین نصیری، و فرهاد شیرانی. 1396. طراحی مدل کیفیت فراداده در سامانه ثبت پایان‌نامه، رساله‌های دانش‌آموختگان داخل و خارج کشور. طرح پژوهشی. وزارت علوم، تحقیقات و فناوری. پژوهشگاه علوم و فناوری اطلاعات ایران.
اشتریان اصفهانی، آیناز، محمدجواد ارشادی، و امیر عزیزی. 1398. توسعه شاخص‌های کیفیت داده به‌منظور ارزیابی سامانه‌های اطلاعاتی تحقیقاتی: یک مطالعه موردی. فصلنامه علمی-پژوهشی مدیریت استاندارد و کیفیت 9 (بهار): 60-74.
خسروانجم، داود، علی‌اصغر انواری رستمی، رسول چاوشینی، مسعود احمدزاده. 1392. توسعه مدل‌های AHP فازی برای ارزیابی تأثیر قابلیت‌های IT و ابعاد کیفیت داده‌ها. فصلنامه مدیریت صنعتی دانشگاه آزاد اسلامی واحد سنندج 8 (25): 116-105.
خلیلی جعفرآباد، احمد. 1396. بررسی تغییرات حوزه کیفیت داده با استفاده از تحلیل کلمات کلیدی.
دو-فصلنامه علمی-پژوهشی مدیریت اطلاعات 2 (3): 121-138.
رحیمی، علی‌رضا . 1395. بررسی تحولات پژوهش‌های حوزه ارزیابی کیفیت داده‌ها و اطلاعات در نظام‌های اطلاعاتی از سال 2000 تا نیمه نخست سال 2015. پژوهشنامه پردازش و مدیریت اطلاعات
http://Jipm.irandoc.ac.ir (دسترسی در 08/4/1396).
ر‎د‎من، تو‎‎ماس سی. 1381. د‎اد‎ه چیست یا د‎اد‎ه‎ها‎ چه هستند‎؟ تر‎جمه محمد‎حسین د‎یانی. کتابد‎ار‎ی و‎‎ اطلاع‌ر‎سانی 5 (20): 81-110.
ر‎ضاییان، علی. 1374. سیستم‌های اطلاعاتی مد‎یر‎یت. فصلنامه تحو‎‎ل اد‎ار‎ی ساز‎‎‎مان امو‎‎ر‎ اد‎ار‎ی و‎‎ استخد‎امی کشو‎‎ر 10 و‎‎ 11: 16-26.
سهرابی، بابک، حمیدرضا یزدانی، محمدجواد ارشادی، و سوده دوروش. 1400. شناخت و تحلیل سیستمی متدولوژی‌های کیفیت داده و ارائه یک چارچوب جامع (با استفاده از روش فراترکیب). پژوهشنامه پردازش و مدیریت اطلاعات 36 (3): 737-766.
صالحی، احمد، محمد محمد اقدسی، توکتم خطیبی، و مجید شیخ محمدی. 1402. ارائه یک چارچوب مفهومی برای پیش‌پردازش و بهبود کیفیت نگاره‌های رویداد در فرایندکاوی. پژوهشنامه پردازش و مدیریت اطلاعات 38 (3): 945-979.
عبدالوند، ندا، و آتنا بوبه رژ. 1395. رویکردی سیستماتیک به چالش کیفیت داده در راهبرد مشتری‌محور در صنعت بانکداری. فصلنامه علمی-پژوهشی تحقیقات بازاریابی نوین 1 (6): 177-196.
غفاری، مسعود، عادل آذر، و اشکان شباک. 1397. نگاشت علّی مدیریت محصول آماری با رویکرد کیفیت داده. پژوهشنامه پردازش و مدیریت اطلاعات 3 (33): 1041-1064.
محجوب، عباس، و حمیدرضا سیمه‌ساز. 1394. کیفیت داده‌ها پیشنیاز مدیریت منابع سازمان. فصلنامه مدیریت استاندارد و کیفیت 5 (17): 53-62.
نورزاد، عبدالرحمان. 1390. بهبود کیفیت داده‌ها در کامل بودن داده با استفاده از قوانین وابستگی. پایان‌نامه کارشناسی ارشد علوم کامپیوتر- مهندسی نرم‌افزار. دانشگاه پیام نور مرکز تهران. دانشکده فنی و مهندسی.
References
Ardagna, Danilo, Cinzia Cappiello, Walter Samá, & Monica Vitali. 2018. Context-aware data quality assessment for big data. Future Generation Computer Systems 89: 548-562.
Batini, C., C. Cappiello, C. Francalanci, & A. Maurino. 2009. Methodologies for data quality assessment and improvement. ACM computing surveys (CSUR) 41 (3):16.‏
Batini, C. & M. Scannapiec. 2016. In book: Data and Information Quality Dimensions, Principles and Techniques (pp.403-419). Cham: Springer International Publishing.
Bharati, P., & A. Chaudhury. 2004. An empirical investigation of decision-making satisfaction in web-based decision support systems. Decision support systems 37 (2): 187-197.‏
Bizer, C., & R. Cyganiak. 2009. Quality-driven information filtering using the WIQA policy framework. Journal of Web Semantics 7 (1): 1-10.‏
Bonner, J. M. 2010. Customer interactivity and new product performance: Moderating effects of product newness and product embeddedness. Industrial marketing management 39 (3): 485-492.‏
Boritz, J. E. 2005. IS practitioners’ views on core concepts of information integrity. International Journal of Accounting Information Systems 6 (4): 260-279.‏
Bovee, M., R. P. Srivastava, & B. Mak. 2003. A conceptual framework and belief-function approach to assessing overall information quality. International journal of intelligent systems 18 (1): 51-74.‏
CDDQ (List of Conformed Dimensions of Data Quality. 2019. Retrieved from https://dimensionsofdataquality.com/alldimensions (accessed Mar 5, 2022)
Chaffey, D., & G. White. 2010. Business information management: improving performance using information systems. Canada; Pearson Education
Chen, C. C., & Y. D. Tseng. 2011. Quality evaluation of product reviews using an information quality framework. Decision Support Systems 50 (4): 755-768.‏
Chen, C. W. 2010. Impact of quality antecedents on taxpayer satisfaction with online tax-filing systems—An empirical study. Information & Management 47 (5-6): 308-315.‏
Chengalur-Smith, I. N., D. P. Ballou, & H. L. Pazer. 1999. The impact of data quality information on decision making: an exploratory analysis. IEEE Transactions on Knowledge and Data Engineering 11 (6): 853-864.‏
Chien, S. W., & S. M. Tsaur. 2007. Investigating the success of ERP systems: Case studies in three Taiwanese high-tech industries. Computers in industry 58 (8-9): 783-793.‏
Chung, W. 2006. Studying information seeking on the non-English Web: An experiment on a Spanish business Web portal. International Journal of Human-Computer Studies 64 (9): 811-829.‏
Dai, T., H. Hu, Y. Wan, Q. Chen, & Y. Wang. 2015. A data quality management and control framework and model for health decision support. In 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD) (pp. 1792-1796). Zhangjiajie, China.
DAMA (Data Management Association International) 2017. DAMA-DMBOK. Data Management Body of Knowledge. 2nd Edition. New Jersey: Technics Publications LLC.
DAMA (Data Management Association International) 2008. The DAMA dictionary of data management. Denver, Colorado: Technics publications.
DAMA-UK (Data Management Association International) 2013. The six primary dimensions for data quality assessment: Defining Data Quality Dimensions. https://www.sbctc.edu/resources/documents/colleges-staff/commissions-councils/dgc/data-quality-deminsions.pdf (accessed Oct. 8, 2020).
Earley, S. 2011. The DAMA Dictionary of Data Management. (2nd). NJ: Technics Publications LLC.
English, L. P. 1999. Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits. Hoboken, NJ: Wiley.
Ehrlinger, L., & W. Wöß. 2022. A survey of data quality measurement and monitoring tools. Frontiers in Big Data 5: 850611.
Eurostat. 2015. ESS Handbook for Quality Reports. Brussels, Belgium: Eurostat.
Freitas, A., T. Knap, S. O’Riain, & E. Curry. 2011. W3P: Building an OPM based provenance model for the Web. Future Generation Computer Systems 27 (6): 766-774.‏
Freitas, A., T. Knap, S. O’Riain, & E. Curry. 2011. W3P: Building an OPM based provenance model for the Web. Future Generation Computer Systems 27 (6): 766-774.‏
Goel, S., & I. N. Chengalur-Smith. 2010. Metrics for characterizing the form of security policies. The Journal of Strategic Information Systems 19 (4): 281-295.‏
Haider, A., & S. H. Lee. 2012. Using six sigma for continuous improvement of asset management information quality. In Internatial Conference on Information Resources Management
)CONF-IRM(, Retrieved from https://aisel.aisnet.org/confirm2012/ (accessed Feb 25, 2022).
Hassenstein, M. J., & P. Vanella. 2022. Data quality—concepts and problems. Encyclopedia 2 (1): 498-510
Hazen, B. T., C. A. Boone, J. D. Ezell, & L. A. Jones-Farmer. 2014. Data quality for data science, predictive analytics, and big data in supply chain management: An introduction to the problem and suggestions for research and applications. International Journal of Production Economics 154: 72-80.‏
Hipp, J., U. Guntzer, & U. Grimmer. 2002. Data quality mining. In DMKD2001 Workshop on Research Issues in Data Mining and Knowledge Discovery DMKD2001.‏ Santa Barbara, CA, USA.
Hsieh, C. C., P. L. Kuo, S. C. Yang, & S. H. Lin. 2010. Assessing blog-user satisfaction using the expectation and disconfirmation approach. Computers in Human Behavior 26 (6): 1434-1444.‏
Hsu, Yu-Lung, Cheng-Haw Lee, & V.B. Kreng. 2010. The application of Fuzzy Delphi Method and Fuzzy AHP in lubricant regenerative technology selection. Expert Systems with Applications 37 (1): 419-425.
Ifinedo, P., B. Rapp, A. Ifinedo, & K. Sundberg. 2010. Relationships among ERP post-implementation success constructs: An analysis at the organizational level. Computers in Human Behavior 26 (5): 1136-1148
ISO 25012. (n.d.). Retrieved from https://iso25000.com/index.php/en/iso-25000-standards/iso-25012 (accessed Feb.20, 2022)
Jin, X. L., C. M. Cheung, M. K. Lee, & H. P. Chen. 2009. How to keep members using the information in a computer-supported social network. Computers in Human Behavior 25 (5): 1172-1181.‏
Jarke, M., R. Gallersdörfer, M. A. Jeusfeld, M. Staudt, & S. Eherer. 1995. ConceptBase—a deductive object base for meta data management. Journal of Intelligent Information Systems 4 (2): 167-192.‏
Kim, B., & I. Han. 2011. The role of utilitarian and hedonic values and their antecedents in a mobile data service environment. Expert Systems with Applications 38 (3): 2311-2318.‏
Kim, C., E. Oh, N. Shin, & M. Chae. 2009. An empirical investigation of factors affecting ubiquitous computing use and U-business value. International Journal of Information Management 29 (6): 436-448.
Kim, W., B. J. Choi, E. K. Hong, S. K. Kim, & D. Lee. 2003. A taxonomy of dirty data. Data mining and knowledge discovery 7 (1): 81-99.‏
Kim, Y. J., R. Kishore, & G. L. Sanders. 2005. From DQ to EQ: understanding data quality in the context of e-business systems. Communications of the ACM 48 (10): 75-81.‏
Kwon, O., N. Lee, & B. Shin. 2014. Data quality management, data usage experience and acquisition intention of big data analytics. International journal of information management 34 (3): 387-394.‏
Larburu, N., R. Bults, M. van Sinderen, & H. Hermens. 2015. Quality-of-data management for telemedicine systems. Procedia Computer Science 63: 451-458.‏
Lederer, A. L., D. J. Maupin, M. P. Sena, & Y. Zhuang. 2000. The technology acceptance model and the World Wide Web. Decision support systems 29 (3): 269-282.‏
Lee, H., J. Kim, & J. Kim. 2007. Determinants of success for application service provider: An empirical test in small businesses. International journal of human-computer studies 65 (9): 796-815.‏
‏Lee, K. C., & N. Chung. 2009. Understanding factors affecting trust in and satisfaction with mobile banking in Korea: A modified DeLone and McLean’s model perspective. Interacting with computers 21 (5-6): 385-392.‏
Lee, S. H., & A. Haider. 2011. A Framework for Information Quality Assessment Using Six Sigma Approach. Communications of the IBIMA. https://ibimapublishing.com/journals/communications-of-the-ibima (accessed Mar 6, 2022).
Lee, Y. W., D. M. Strong, B. K. Kahn, & R. Y. Wang. AIMQ: a methodology for information quality assessment. Information & management 40 (2): 133-146.‏
Li, S., S. S. Rao, T. S. Ragu-Nathan, & B. Ragu-Nathan. 2005. Development and validation of a measurement instrument for studying supply chain management practices. Journal of operations management 23 (6): 618-641.‏
Lin, A. 2006. The acceptance and use of a business-to-business information system. International Journal of Information Management 26 (5): 386-400.‏
Michnik, J., & M. C. Lo. 2009. The assessment of the information quality with the aid of multiple criteria analysis. European Journal of Operational Research 195 (3): 850-856.‏
Negash, S., T. Ryan, & M. Igbaria. 2003. Quality and effectiveness in web-based customer support systems. Information & management 40 (8): 757-768.‏
Naumann, F. 2002. Quality-driven query answering for integrated information systems (Vol. 2261). Berlin: Springer: Springer.‏
Park, J., J. Kim, & J. Koh. 2010. Determinants of continuous usage intention in web analytics services. Electronic Commerce Research and Applications 9 (1): 61-72.
Peer, E., D. Rothschild, A. Gordon, Z. Evernden, & E. Damer. 2022. Data quality of platforms and panels for online behavioral research. Behavior research methods. 54 (4): 1643-1662.
Petter, Stacie, William DeLone & Ephraim McLean. 2008. Measuring information systems success: models, dimensions, measures, and interrelationships. European Journal of Information Systems 17: 236–263.
Redman, T. C., & A. Blanton. 1996. Data quality for the information age. Norwood: Artech House.‏
Salaün, Y., & K. Flores. 2001. Information quality: Meeting the needs of the consumer. International Journal of Information Management 21 (1): 21-37.‏
Sandelowski, M., and J. Barroso. 2007. Handbook for synthesizing qualitative research. New York, NY: Springer.
Sebastian-Coleman, L. 2012. Measuring data quality for ongoing improvement: a data quality assessment framework. ‏Amsterdam: Morgan Kaufmann.
Serhani, M. A., H. T. El Kassabi, I. Taleb, & A. Nujum. 2016. An hybrid approach to quality evaluate across big data value chain. In 2016 IEEE International Congress on Big Data (Big Data Congress) (pp. 418-425). IEEE.
Song, J. 2007. Trust in health infomediaries. Decision support systems 43 (2): 390-407.‏
_____. and Fatemeh “Mariam” Zahedi. 2007. Trust in health infomediaries. Decision support systems 43 (2): 390-407.‏         
Stjepandić, J., & W. Korol. 2022. Data quality management for interoperability. DigiTwin: An Approach for Production Process Optimization in a Built Environment, 135-153. https://link.springer.com/chapter/10.1007/978-3-030-77539-1_7. (accessed Sept.25, 2021).
Sung, T. J., & M. You. 2007. A method for establishing an online design audit platform. Design Studies 28 (2): 195-211.‏
Syed, R., R. Eden, T. Makasi, I. Chukwudi, A. Mamudu, M. Kamalpour, ... & T. Myers. 2023. Digital Health Data Quality Issues: Systematic Review. Journal of Medical Internet Research 25: e42615.
Taleb, I., & M. A. Serhani. 2017. Big Data pre-processing: closing the data quality enforcement loop. In 2017 IEEE International Congress on Big Data (BigData Congress) (pp. 498-501). IEEE. Boston, USA.
Taleb, I., H. T. El Kassabi, M. A. Serhani, R. Dssouli, & C. Bouhaddioui. 2016, July. Big data quality: A quality dimensions evaluation. In 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld) (pp. 759-765). Toulouse, France.
Tayi, G. K., & D. P. Ballou. 1998. Examining data quality. Communications of the ACM 41 (2): 54-57.‏
Wagner, T., N. R. Lottig, M. L. Bartley, E. M. Hanks, E. M. Schliep, N. B. Wikle, ... & J. Zhou. 2020. Increasing accuracy of lake nutrient predictions in thousands of lakes by leveraging water clarity data. Limnology and Oceanography Letters 5 (2): 228-235.   
Wand, Y., & R. Y. Wang. 1996. Anchoring data quality dimensions in ontological foundations. Communications of the ACM 39 (11): 86-95.‏
Wang, R. Y. 1998. A product perspective on total data quality management. Communications of the ACM 41 (2): 58-66.‏
_____, & D. M. Strong. 1996. Beyond accuracy: What data quality means to data consumers. Journal of management information systems 12 (4): 5-33.‏
Wu, X., W. Zheng, X. Xia, & D. Lo. 2021. Data quality matters: A case study on data label correctness for security bug report prediction. IEEE Transactions on Software Engineering 48 (7): 2541-2556.
Yeganeh, N. K., S. Sadiq, M. A. & Sharaf. 2014. A framework for data quality aware query systems. Information Systems 46: 24-44.‏
Volume 40, Issue 3 - Serial Number 123
Spring 2025
Pages 987-1020

  • Receive Date 28 January 2024
  • Revise Date 17 September 2024
  • Accept Date 06 October 2024