Iranian Journal of Information Processing and Management

Iranian Journal of Information Processing and Management

Next-Gen Machine Learning Models: Pushing the Boundaries of AI

Document Type : Original Article

Authors
1 Al-Turath University, Baghdad 10013, Iraq,
2 Al-Mansour University College, Baghdad 10067, Iraq
3 Osh State University, Osh City 723500, Kyrgyzstan
4 Al-Rafidain University College Baghdad 10064, Iraq
5 Madenat Alelem University College, Baghdad 10006, Iraq
Abstract
ABSTRACT
Background: Machine learning (ML) has developed significantly over the years, changing several industries through the use of automation and Big Data. By building better next-generation machine learning models, AI’s future has the potential of improving on existing problematic methods such as scalability, interpretability, and generalization.
Objective: This article examines about how new generation of ML models are developed and used to explain about the capabilities of AI in different fields. In particular, it is focused on changes in structural models, certain methods of training them, and the application of brand-new technologies as quantum computing.
Methods: A review of the state of the art and several case studies were carried out with regard to the latest work being done on different types of ML algorithms such as transformer models, reinforcement learning, and Neural Architecture Search. Moreover, the given models were tested in experiments concerning the applicability of these models in tasks including image recognition, natural language processing, and in autonomous systems.
Results: The next-gen models, thereby outperformed the traditional models in terms of accuracy, computational speed, and flexibility. The identified benefits were decreased training time, better interpretability, and better performance with multi-modal and cross-domain tasks.
Conclusion: These new generation of ML models are the game changers in AI development solving previous challenges while providing opportunities across numerous sectors. In this vein, further research in this field is needed to achieve AI’s solving of problems.
Keywords

References

Alnuaemy, L. M. (2023). Peculiarities of using neuro-linguistic programming for the rehabilitation of servicemen who were in armed conflicts.  Development of Transport Management and Management Methods, 3 (84), 40-55. https://doi.org/:10.31375/2226-1915-2023-3-40-55
Antunes, R. S., Costa, C. A. d., Küderle, A., Yari, I. A., and Eskofier, B. (2022). Federated Learning for Healthcare: Systematic Review and Architecture Proposal.  ACM Trans. Intell. Syst. Technol., 13 (4). https://doi.org/:10.1145/3501813
Aouedi, O., Sacco, A., Piamrat, K., and Marchetto, G. (2023). Handling Privacy-Sensitive Medical Data With Federated Learning: Challenges and Future Directions.  IEEE Journal of Biomedical and Health Informatics, 27 (2), 790-803. https://doi.org/:10.1109/JBHI.2022.3185673
Chitty-Venkata, K. T., Emani, M., Vishwanath, V., and Somani, A. K. (2022). Neural Architecture Search for Transformers: A Survey.  IEEE Access, 10, 108374-108412. https://doi.org/:10.1109/ACCESS.2022.3212767
Coyle, B. (2022). Machine learning applications for noisy intermediate-scale quantum computers.  arXiv preprint. 2205.09414. https://doi.org/:10.48550/arXiv.2205.09414
Fjellström, C. (2022). Long Short-Term Memory Neural Network for Financial Time Series. IEEE International Conference on Big Data (Big Data), 17-20 Dec. https://doi.org/:10.1109/BigData55660.2022.10020784.
Habib, A., Karmakar, C., and Yearwood, J. (2023). Interpretability and Optimisation of Convolutional Neural Networks Based on Sinc-Convolution.  IEEE Journal of Biomedical and Health Informatics, 27 (4), 1758-1769. https://doi.org/:10.1109/JBHI.2022.3185290
Hashim, N., Mohsim, A., Rafeeq, R., and Pyliavskyi, V. (2020). Color correction in image transmission with multimedia path.  ARPN Journal of Engineering and Applied Sciences, 15 (10), 1183-1188. https://www.arpnjournals.org/jeas/research_papers/rp_2020/jeas_0520_8215.pdf
Hodson, T. O. (2022). Root-mean-square error (RMSE) or mean absolute error (MAE): when to use them or not.  Geosci. Model Dev., 15 (14), 5481-5487. https://doi.org/:10.5194/gmd-15-5481-2022
Innan, N., Khan, M. A.-Z., and Bennai, M. (2023). Financial fraud detection: A comparative study of quantum machine learning models.  International Journal of Quantum Information, 22 (02), 2350044. https://doi.org/:10.1142/S0219749923500442
Jawad, A. M., Qasim, N. H., and Pyliavskyi, V. (2022). Comparison of Metamerism Estimates in Video Paths using CAM's Models. IEEE 9th International Conference on Problems of Infocommunications, Science and Technology (PIC S&T), 10-12 Oct. https://doi.org/:10.1109/PICST57299.2022.10238685.
Langenberger, B., Schulte, T., and Groene, O. (2023). The application of machine learning to predict high-cost patients: A performance-comparison of different models using healthcare claims data.  PLOS ONE, 18 (1), e0279540. https://doi.org/:10.1371/journal.pone.0279540
Lazcano, A., Herrera, P. J., and Monge, M. (2023). A Combined Model Based on Recurrent Neural Networks and Graph Convolutional Networks for Financial Time Series Forecasting. Mathematics, 11 (1). https://doi.org/:10.3390/math11010224.
Li, Y., Wang, J., Dai, X., Wang, L., Yeh, C. C. M., Zheng, Y., Zhang, W., et al. (2023). How Does Attention Work in Vision Transformers? A Visual Analytics Attempt.  IEEE Transactions on Visualization and Computer Graphics, 29 (6), 2888-2900. https://doi.org/:10.1109/TVCG.2023.3261935
Mei, X., Liu, Z., Robson, P. M., Marinelli, B., Huang, M., Doshi, A., Jacobi, A., et al. (2022). RadImageNet: An Open Radiologic Deep Learning Research Dataset for Effective Transfer Learning.  Radiology: Artificial Intelligence, 4 (5), e210315. https://doi.org/:10.1148/ryai.210315
Mijwil, M. M. (2022). Has the Future Started? The Current Growth of Artificial Intelligence, Machine Learning, and Deep Learning.  Iraqi Journal for Computer Science and Mathematics, 3 (1), 115-123. https://doi.org/:10.52866/ijcsm.2022.01.01.013
Mushtaq, A.-S., Ali Ihsan, A.-A., and Qasim, N. (2015). 2D-DWT vs. FFT OFDM Systems in fading AWGN channels.  Radioelectronics and Communications Systems, 58 (5), 228-233. https://doi.org/:10.3103/S0735272715050052
Myrzashova, R., Alsamhi, S. H., Shvetsov, A. V., Hawbani, A., and Wei, X. (2023). Blockchain Meets Federated Learning in Healthcare: A Systematic Review With Challenges and Opportunities. IEEE Internet of Things Journal, 10 (16), 14418-14437. https://doi.org/:10.1109/JIOT.2023.3263598
Naresh, V. S., and Thamarai, M. (2023). Privacy-preserving data mining and machine learning in healthcare: Applications, challenges, and solutions.  WIREs Data Mining and Knowledge Discovery, 13 (2), e1490. https://doi.org/:10.1002/widm.1490
Pagano, T. P., Loureiro, R. B., Lisboa, F. V. N., Peixoto, R. M., Guimarães, G. A. S., Cruz, G. O. R., Araujo, M. M., et al. (2023). Bias and Unfairness in Machine Learning Models: A Systematic Review on Datasets, Tools, Fairness Metrics, and Identification and Mitigation Methods. Big Data and Cognitive Computing, 7 (1), 15. https://doi.org/:10.3390/bdcc7010015
Qasim, N., and Fatah, O. (2022). The role of cyber security in military wars.  V International Scientific and Practical Conference: Problems of cyber security of information and telecommunication systems" (PCSITS). October 27 - 28, Kyiv, Ukraine.
Qasim, N. H., Vyshniakov, V., Khlaponin, Y., and Poltorak, V. (2021). Concept in information security technologies development in e-voting systems.  International Research Journal of Modernization in Engineering Technology and Science (IRJMETS), 3 (9), 40-54.
Reena, M. J. (2023). Preprocessing Big Data using Partitioning Method for Efficient Analysis. IEEE International Conference on Contemporary Computing and Communications (InC4), 21-22 April. https://doi.org/:10.1109/InC457730.2023.10262924.
Sandamirskaya, Y., Kaboli, M., Conradt, J., and Celikel, T. (2022). Neuromorphic computing hardware and neural architectures for robotics.  Science Robotics, 7 (67), eabl8419. https://doi.org/:10.1126/scirobotics.abl8419
Shamshad, F., Khan, S., Zamir, S. W., Khan, M. H., Hayat, M., Khan, F. S., and Fu, H. (2023). Transformers in medical imaging: A survey.  Medical Image Analysis, 88, 102802. https://doi.org/:10.1016/j.media.2023.102802
Staroverov, A., Gorodetsky, A. S., Krishtopik, A. S., Izmesteva, U. A., Yudin, D. A., Kovalev, A. K., and Panov, A. I. (2023). Fine-Tuning Multimodal Transformer Models for Generating Actions in Virtual and Real Environments. IEEE Access, 11, 130548-130559. https://doi.org/:10.1109/ACCESS.2023.3334791
Tuli, S., and Jha, N. K. (2023). TransCODE: Co-Design of Transformers and Accelerators for Efficient Training and Inference.  IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42 (12), 4817-4830. https://doi.org/:10.1109/TCAD.2023.3283443
Vuong, V.-Q., Cevallos, C., Hourahine, B., Aradi, B., Jakowski, J., Irle, S., and Camacho, C. (2023). Accelerating the density-functional tight-binding method using graphical processing units.  The Journal of Chemical Physics, 158 (8), 084802. https://doi.org/:10.1063/5.0130797
Xu, P., Zhu, X., and Clifton, D. A. (2023). Multimodal Learning With Transformers: A Survey.  IEEE Transactions on Pattern Analysis and Machine Intelligence, 45 (10), 12113-12132. https://doi.org/:10.1109/TPAMI.2023.3275156