Iranian Journal of Information Processing and Management

Iranian Journal of Information Processing and Management

Taxonomy of Customers Identification in Banking Industry Using Machine Learning: a Systematic Review with a Meta-Synthesis Approach

Document Type : Original Article

Authors
1 Department of Information Technology Management, Tarbiat Modares University
2 PhD in Management; Professor; Department of Business Administration; Faculty of Economics and Administrative Sciences; Vali-e-Asr University of Rafsanjan; Rafsanjan, Iran;
3 PhD in Systems Management; Professor; Department of Information Technology Management; Management and Economy School; Tarbiat Modares University; Tehran, Iran;
4 Professor, Department of Industrial Management, Tarbiat Modarres University, Tehran, Iran.
10.22034/jipm.2023.704230
Abstract
Nowadays, customers in any industry are not just looking for a product and expect to receive a personalized service based on their needs and create a different experience from the organization. From another point of view, the design of services according to the customer's needs will require a careful examination of the data related to the customer in different dimensions. Therefore, knowing the customer requires a systematic approach in order to identify the goals, influencing factors, algorithms and methods suitable for this field.
The upcoming research has analyzed the dimensions of customer recognition in the banking industry and its considerations with a data-oriented approach and the application of machine learning. The research method is applied according to the purpose and is meta-synthesis according to the collection of information. To select articles, 43 documents published in the period of 2016-2022 were identified as relevant and valid documents by searching in the reliable databases of Web of Science and Scopus, and further, with a meta-synthesis approach they were studied and coded.
The results of meta-synthesis led to the identification of three main categories: 1) customer identification objectives: understanding customer insight, identifying customer risk, organizational goals, determining customer lifetime value and product management, 2) customer identification factors: demographic, financial and behavioral and, 3) machine learning algorithms: Probabilistic, Neural Networks, Ensemble, Regularization, Regression, Bayesian, Decision Tree, Dimensionality Reduction, Instanced Based and Clustering. 
Based on the findings of the current research, according to the purpose of customer identification, available data and selected factors, basic and combined algorithms can be a way forward, but the important point is accurate data pre-processing. Another point is that no distinction has been made between real and legal customers, and most studies have focused on real customers, which can be attributed to the complexity of the chain of financial interactions of legal customers. Also, considering the banks' lack of emphasis on branches regarding the need to complete the information contained in the account opening forms or the lack of designing a suitable service to complete the information on electronic platforms, a more accurate understanding of the customer requires a review of these processes. It is worth mentioning that in none of the studies, customer recognition was done using only demographic factors, and depending on the purpose of the study, the factors were used in combination.
Keywords
Subjects

References:
Abbasimehr, H. & M. Shabani. 2021. A new methodology for customer behavior analysis using time series clustering A case study on a bank's customers. Kybernetes 50 (2): 221-242. https://doi.org/10.1108/k-09-2018-0506
AbdolvandAmir, N. & A. Albadvi. 2014. Customer Lifetime Value: Literature Scoping Map, and an Agenda for Future Research. international Journal of Management Perspective 1 (3):41-59.
Alam, T. M., K. Shaukat, I. A. Hameed, S. Luo, M. U. Sarwar, S. Shabbir, J. Li, & M. Khushi. 2020. An investigation of credit card default prediction in the imbalanced datasets [Article]. Ieee Access 8: 201173-201198. https://doi.org/10.1109/ACCESS.2020.3033784
Al-A'raj, M. & M. F. Abbod. 2016. Classifiers consensus system approach for credit scoring [Article]. Knowledge-Based Systems 104: 89-105. https://doi.org/10.1016/j.knosys.2016.04.013
_____ & M. Majdalawieh. 2021. Modelling customers credit card behaviour using bidirectional LSTM neural networks. Journal of Big Data 8 (1): Article 69. https://doi.org/10.1186/s40537-021-00461-7
Al-Qerem, A., G. Al-Naymat, M. Alhasan, & M. Al-Debei. 2020. Default prediction model: The significant role of data engineering in the quality of outcomes [Article]. International Arab Journal of Information Technology 17 (4 Special Issue): 635-644. https://doi.org/10.34028/iajit/17/4A/8
Al-Rubaiee, H., K. Alomar, R. Qiu, & D. Li. 2018. Tuning of Customer Relationship Management (CRM) via Customer Experience Management (CEM) using sentiment analysis on aspects level [Article]. International Journal of Advanced Computer Science and Applications 9 (5): 300-312. https://doi.org/10.14569/IJACSA.2018.090540
Arli, D. 2017. "Investigating consumer ethics: a segmentation study. Journal of Consumer Marketing, 34 (7): 636-645. https://doi.org/10.1108/JCM-08-2016-1908
Ashofteh, A., & J. M. Bravo. 2021. A conservative approach for online credit scoring* [Article]. Expert Systems with Applications, 176, 16, Article 114835. https://doi.org/10.1016/j.eswa.2021.114835
Atkins, S., S. Lewin, H. Smith, M. Engel, A. Fretheim, & J. Volmink. 2008. Conducting a meta-ethnography of qualitative literature: Lessons learnt. BMC Medical Research Methodology 8 (1) 21. https://doi.org/10.1186/1471-2288-8-21
Behare, N., S. Waghulkar, & S. A. Shah. 2018. A Theoretical Perspective on Customer Experience (CX) in Digital Business Strategy. [2018 ieee international conference on research in intelligent and computing in engineering (rice iii)]. 3rd IEEE International Conference on Research in Intelligent and Computing in Engineering (RICE), Univ Don Bosco, San Salvador, EL SALVADOR.
Bekamiri, H., M. Mehraeen, A. Pooya & H. Sharif. 2020. A Stochastic Approach for Valuing Customers in Banking Industry: A Case Study. Industrial Engineering and Management Systems, 19 (4), 744-757. https://doi.org/10.7232/iems.2020.19.4.744
Bharathi, S. V., D. Pramod, & R. Raman. 2022. An Ensemble Model for Predicting Retail Banking Churn in the Youth Segment of Customers. DATA, 7 (5), Article 61. https://doi.org/10.3390/data7050061
Borna,K., Sh. Hoseini & M. Aghaei. 2019. Customer satisfaction prediction with Michigan‑style learning classifier system. SN Applied Sciences. https://doi.org/https://doi.org/10.1007/s42452-019-1493-1
Calvo-Porral, C., & J. P. Levy-Mangin. 2020. An emotion-based segmentation of bank service customers. International Journal of Bank Marketing, 38 (7): 1441-1463. https://doi.org/10.1108/ijbm-05-2020-0285
Chen, T. H. 2020a. Do you know your customer? Bank risk assessment based on machine learning. Applied Soft Computing, 86, Article 105779. https://doi.org/10.1016/j.asoc.2019.105779
Chen, T. H. 2020b. Do you know your customer? Bank risk assessment based on machine learning [Article]. Applied Soft Computing Journal, 86, Article 105779. https://doi.org/10.1016/j.asoc.2019.105779
Cheng, L. C., C. C. Wu, & C. Y. Chen. 2019. Behavior Analysis of Customer Churn for a Customer Relationship System: An Empirical Case Study. Journal of Global Information Management 27 (1): 111-127. https://doi.org/10.4018/jgim.2019010106
Chopra, A., & P. Bhilare. 2018. Application of Ensemble Models in Credit Scoring Models [Article]. Business Perspectives and Research 6 (2): 129-141. https://doi.org/10.1177/2278533718765531
Çığşar, B., & D. Ünal. 2019. Comparison of Data Mining Classification Algorithms Determining the Default Risk. Scientific Programming, 2019, 8706505. https://doi.org/10.1155/2019/8706505
Coser, A., A. Aldea, M. M. Maer-Matei & L. Besir. 2020. Propensity to Churn in Banking: what Makes Customers Close the Relationship with a Bank? Economic Computation and Economic Cybernetics Studies and Research 54 (2): 77-94. https://doi.org/10.24818/18423264/54.2.20.05
Dawood, E. A., E. Elfakhrany & D. A. Maghraby. 2019. Improve Profiling Bank Customer's Behavior Using Machine Learning. Ieee Access 7: 109320-109327. https://doi.org/10.1109/access.2019.2934644
De Lima Lemos, R. A., T. C. Silva & B. M. Tabak. 2022. Propension to customer churn in a financial institution: a machine learning approach. Neural Computing and Applications 34 (14): 11751-11768. https://doi.org/10.1007/s00521-022-07067-x
Domingos, E., B. Ojeme & O. Daramola. 2021. Experimental Analysis of Hyperparameters for Deep Learning-Based Churn Prediction in the Banking Sector. Computation 9 (3): Article 34. https://doi.org/10.3390/computation9030034
Eltahir, A., T. Ahmed, A. Abdelmageed Mohammed, A. Hilal, & T, A. A. 2022. An Approach of Supervised and Unsupervised Machine Learning Model for E-CRM Bank’s Marketing. International Journal of Computer Science and Network Security 22 (4): 625-636.
Erwin, E., M. J. Brotherson & J. A. Summers. 2011. Understanding Qualitative Metasynthesis: Issues and Opportunities in Early Childhood Intervention Research. Journal of Early Intervention 33 (3). https://doi.org/10.1177/1053815111425493
Estrella-Ramon, A., M. Sanchez-Perez, G. Swinnen & K. VanHoof. 2017. A model to improve management of banking customers. Industrial Management & Data Systems 117 (2): 250-266. https://doi.org/10.1108/imds-03-2016-0107
Firman Pradana Rachman, Handri Santoso & A. Djajadi. 2021. Machine Learning Mini Batch K-means and Business Intelligence Utilization for Credit Card Customer Segmentation. International Journal of Advanced Computer Science and Applications 12 (10): 218-227.
Furio. Camillo & C. Liberati. 2018. Personal values and credit scoring: new insights in the financial prediction. Journal of the Operational Research Society 69 (12) https://doi.org/10.1080/01605682.2017.1417684
Garcia-Mendez, S., M. Fernandez-Gavilanes, J. Juncal-Martinez, F. J. Gonzalez-Castano & O. B. Seara. 2020. Identifying Banking Transaction Descriptions via Support Vector Machine Short-Text Classification Based on a Specialized Labelled Corpus. Ieee Access, 8: 61642-61655. https://doi.org/10.1109/access.2020.2983584
Gartner. 2017. The customer experience marketing leader's first 100 days. https://www.gartner.com/en/marketing/insights/articles/the-customer-experience-marketing-leaders-first-100-days
Hajipour, B. & M. Esfahani. 2019. Delta model application for developing customer lifetime value [Article]. Marketing Intelligence and Planning, 37 (3): 298-309. https://doi.org/10.1108/MIP-06-2018-0190
Huseynov, F. & S. Özkan Yıldırım. 2019. Online Consumer Typologies and Their Sho pping Behaviors in B2C E-Commerce Platforms. SAGE Open 9 (2): 215824401985463. https://doi.org/10.1177/2158244019854639
Ieva, M., & C. Ziliani. 2018. Mapping touchpoint exposure in retailing: Implications for developing an omnichannel customer experience [Article]. International Journal of Retail and Distribution Management 46 (3): 304-322. https://doi.org/10.1108/IJRDM-04-2017-0097
Jędrzejczyk, W. 2021. Managing Customer Relations and Value in Organizations with the Use of IT Tools: Customer Segmentation on the Market of Eco-Innovative Services. Procedia Computer Science 192: 2816-2825. https://doi.org/10.1016/j.procs.2021.09.052
Keramati, A., H. Ghaneei & S. M. Mirmohammadi. 2016. Developing a prediction model for customer churn from electronic banking services using data mining. Financial Innovation 2 (1), Article 10. https://doi.org/10.1186/s40854-016-0029-6
Komulainen, H., & S. Saraniemi. 2019. Customer centricity in mobile banking: a customer experience perspective [Article]. International Journal of Bank Marketing 37 (5): 1082-1102. https://doi.org/10.1108/ijbm-11-2017-0245
Kovacs, T., A. D. Ko & A. Asemi. 2021. Exploration of the investment patterns of potential retail banking customers using two-stage cluster analysis [Article]. Journal of Big Data 8 (1): 25, Article 141. https://doi.org/10.1186/s40537-021-00529-4
Ladyzynski, P., K. Zbikowski & P. Gawrysiak. 2019. Direct marketing campaigns in retail banking with the use of deep learning and random forests. Expert Systems with Applications 134: 28-35. https://doi.org/10.1016/j.eswa.2019.05.020
Li, K., J. Rollins & E. Yan. 2018. Web of Science use in published research and review papers 1997–2017: a selective, dynamic, cross-domain, content-based analysis. Scientometrics 115 (1): 1-20. https://doi.org/10.1007/s11192-017-2622-5
Livieris, I. E., N. Kiriakidou, A. Kanavos, V. Tampakas & P. Pintelas. 2018. On ensemble SSL algorithms for credit scoring problem [Article]. Informatics 5 (4), Article 40. https://doi.org/10.3390/informatics5040040
Long, H. V., L. H. Son, M. Khari, K. Arora, S. Chopra, R. Kumar, T. Le & S. W. Baik. 2019. A new approach for construction of geodemographic segmentation model and prediction analysis [Article]. Computational Intelligence and Neuroscience, 2019, Article 9252837. https://doi.org/10.1155/2019/9252837
Lu, Q., Z. Cui, Y. Chen & X. Chen. 2017. Extracting optimal actionable plans from additive tree models [Article]. Frontiers of Computer Science 11 (1): 160-173. https://doi.org/10.1007/s11704-016-5273-4
Mancisidor, R. A., M. Kampffmeyer, K. Aas, & R. Jenssen. 2021. Learning latent representations of bank customers with the Variational Autoencoder. Expert Systems with Applications, 164, Article 114020. https://doi.org/10.1016/j.eswa.2020.114020
Marinakos, G., & S. Daskalaki. 2017. Imbalanced customer classification for bank direct marketing [Article]. Journal of Marketing Analytics 5 (1): 14-30. https://doi.org/10.1057/s41270-017-0013-7
Melo, L., F. M. Nardini, C. Renso, R. Trani & J. A. Macedo. 2020. A novel approach to define the local region of dynamic selection techniques in imbalanced credit scoring problems [Article]. Expert Systems with Applications 152: 16, Article 113351. https://doi.org/10.1016/j.eswa.2020.113351
Mhlanga, D. 2021. Financial inclusion in emerging economies: The application of machine learning and artificial intelligence in credit risk assessment [Article]. International Journal of Financial Studies 9 (3) Article 39. https://doi.org/10.3390/ijfs9030039
Milosevic, M., N. Zivic & I. Andjelkovic. 2017. Early churn prediction with personalized targeting in mobile social games [Article]. Expert Systems with Applications, 83, 326-332. https://doi.org/10.1016/j.eswa.2017.04.056
Møller, A. M., & P. S. Myles. 2016. What makes a good systematic review and meta-analysis? BJA: British Journal of Anaesthesia 117 (4): 428-430. https://doi.org/10.1093/bja/aew264
Mosavi, A. B., & A. Afsar. 2018. Customer Value Analysis in Banks Using Data Mining and Fuzzy Analytic Hierarchy Processes. International Journal of Information Technology & Decision Making 17 (3): 819-840. https://doi.org/10.1142/s0219622018500104
Motevali, M. M., A. M. Shanghooshabad, R. Z. Aram & H. Keshavarz. 2019. WHO: A New Evolutionary Algorithm Bio-Inspired by Wildebeests with a Case Study on Bank Customer Segmentation. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE 33 (5), Article 1959017. https://doi.org/10.1142/S0218001419590171
Müller, J. M., B. Pommeranz, J. Weisser & K. I. Voigt. 2018. Digital, Social Media, and Mobile Marketing in industrial buying: Still in need of customer segmentation? Empirical evidence from Poland and Germany. Industrial Marketing Management 73: 70-83. https://doi.org/https://doi.org/10.1016/j.indmarman.2018.01.033
Pandey, M. K., M. Mittal & K. Subbiah. 2021. Optimal balancing & efficient feature ranking approach to minimize credit risk [Article]. International Journal of Information Management Data Insights, 1 (2), Article 100037. https://doi.org/10.1016/j.jjimei.2021.100037
Pawełoszek, I. 2021)/. Customer segmentation based on activity monitoring applications for the recommendation system. Procedia Computer Science, 192: 4751-4761. https://doi.org/https://doi.org/10.1016/j.procs.2021.09.253
Plawiak, P., M. Abdar, J. Plawiak, V. Makarenkov & U. R. Acharya. 2020. DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring [Article]. Information Sciences, 516: 401-418. https://doi.org/10.1016/j.ins.2019.12.045
Razavi, R., A. Gharipour, M. Fleury & I. J. Akpan. 2019. A practical feature-engineering framework for electricity theft detection in smart grids. Applied Energy, 238: 481-494. https://doi.org/https://doi.org/10.1016/j.apenergy.2019.01.076
Safarkhani, F., & S. Moro. 2021. Improving the Accuracy of Predicting Bank Depositor's Behavior Using a Decision Tree. APPLIED SCIENCES-BASEL (19), Article 9016. https://doi.org/10.3390/app11199016
Sandelowski, M., & J. Barroso. 2006. Handbook for synthesizing qualitative research. New york: Springer Publishing Company.
Seidlova, R., J. Poživil & J. Seidl. 2019. Marketing and business intelligence with help of ant colony algorithm [Article]. Journal of Strategic Marketing 27 (5): 451-463. https://doi.org/10.1080/0965254X.2018.1430058
Sheikha, A., T. Ghanbarpourb & D. Gholamiangonabadib. 2019. A Case Study of Fintech Industry: A Two-Stage Clustering Analysis for Customer Segmentation in the B2B Setting. Journal of Business-to-Business Marketing 26 (2). https://doi.org/10.1080/1051712X.2019.1603420
Shirazi, F., & M. Mohammadi. 2019. A big data analytics model for customer churn prediction in the retiree segment. International Journal of Information Management 48: 238-253. https://doi.org/10.1016/j.ijinfomgt.2018.10.005
Sivasankar, E., C. Selvi & S. Mahalakshmi. 2020. Rough set-based feature selection for credit risk prediction using weight-adjusted boosting ensemble method [Article]. Soft Computing 24 (6): 3975-3988. https://doi.org/10.1007/s00500-019-04167-0
Torrens, M. & A. Tabakovic. 2022. A Banking Platform to Leverage Data Driven Marketing with Machine Learning. Entrop 24 (3). https://doi.org/10.3390/e24030347
Urkup, C., B. Bozkaya & F. S. Salman. 2018. Customer mobility signatures and financial indicators as predictors in product recommendation [Article]. Plos One 13 (7): 18, Article e0201197. https://doi.org/10.1371/journal.pone.0201197
World economic Forum. 2017. Shaping the Future of Retail for Consumer Industries. https://www3.weforum.org/
Wu, J., L. Shi, W. P. Lin, S. B. Tsai, Y. Li, L. Yang & G. Xu. 2020. An Empirical Study on Customer Segmentation by Purchase Behaviors Using a RFM Model and <i>K</i>-Means Algorithm. Mathematical Problems in Engineering 2020: 8884227. https://doi.org/10.1155/2020/8884227
Xi, Song, Matthew Tingchi Liu, Qianying Liu, & Ben NiuBen Niu. 2021. Hydrological cycling optimization‐based multiobjective feature‐selection method for customer segmentation. International Journal of Intelligent Systems 36 (16). https://doi.org/10.1002/int.22381
Yanik, S., & A. Elmorsy. 2019. SOM approach for clustering customers using credit card transactions. International Journal of Intelligent Computing and Cybernetics 12 (3): 372-388. https://doi.org/10.1108/ijicc-11-2018-0157

  • Receive Date 18 October 2022
  • Revise Date 27 February 2023
  • Accept Date 30 October 2022