ارائه چارچوبی جامع از ویژگی‌های مؤثر در تشخیص اخبار جعلی: یک مرور نظام‌مند

مظاهری, الهام; مهرآیین, محمد; کاظمی, مصطفی; غیاثی شیرازی, کمال‌الدین

doi:10.22034/jipm.2023.709085

ارائه چارچوبی جامع از ویژگی‌های مؤثر در تشخیص اخبار جعلی: یک مرور نظام‌مند

نوع مقاله : مقاله پژوهشی

نویسندگان

الهام مظاهری ¹

محمد مهرآیین ²

مصطفی کاظمی ³

کمال‌الدین غیاثی شیرازی ⁴

¹ دانشجوی دکتری مدیریت فناوری اطلاعات، گروه مدیریت، دانشکده علوم اداری و اقتصادی، دانشگاه فردوسی مشهد، مشهد، ایران

² دکتری سیستم های اطلاعاتی مدیریت، استاد؛ گروه مدیریت؛ دانشکده علوم اداری و اقتصادی؛ دانشگاه فردوسی مشهد؛ مشهد، ایران

³ استاد؛ گروه مدیریت؛ دانشکده علوم اداری و اقتصادی؛ دانشگاه فردوسی مشهد؛ مشهد، ایران

⁴ استادیار؛ گروه مهندسی کامپیوتر؛ دانشکده مهندسی؛ دانشگاه فردوسی مشهد؛ مشهد، ایران

10.22034/jipm.2023.709085

چکیده

در سال‌های اخیر، با توسعه سریع و افزایش محبوبیت رسانه‌های اجتماعی، شاهد رشد گسترده‌ای در حجم و تنوع اخبار جعلی بوده‌ایم. این پدیده اثرات عمیقی بر روی افراد و جامعه دارد. راستی‌آزمایی روشی است که به‌طوری گسترده برای مقابله با اثرات منفی انتشار اخبار جعلی استفاده می‌شود. اما این روش هنگام تحلیل حجم عظیم اطلاعات کارآمد نیست. بنابراین از مدل‌های پیشرفته یادگیری ماشین و رویکردهای مبتنی ‌بر ویژگی برای شناسایی خودکار اخبار جعلی استفاده می‌شود. در همان حال، تعداد زیاد مدل‌ها و ناهمگونی ویژگی‌های مورد استفاده در ادبیات، اغلب محدودیت‌هایی برای محققانی که سعی در بهبود عملکرد مدل دارند، ایجاد می‌کند. به ‌همین دلیل، در مطالعه حاضر با روش مرور نظام‌مند، چارچوبی جامع از ویژگی‌هایی که در تشخیص اخبار جعلی استفاده می‌شوند، ارائه شده ‌است. به‌منظور انجام این مرور نظام‌مند با استفاده از راهنمای ارائه‌شده توسط «اوکولی» و «اسچابرام»، کلیه مطالعات انجام‌گرفته در حوزه اخبار جعلی با استفاده از کلیدواژه‏های مرتبط، از پایگاه داده‎های ScienceDirect، Springer، Emerald، IEEE، ACM، Wiley، Sage، JSTOR، Taylor و WOS استخراج شد و سرانجام، تعداد 72 مقاله مرتبط مورد بررسی قرار گرفت. در نتیجه تحلیل مقالات مرتبط، ویژگی‌ها در دو طبقه اصلی محتوای خبر و زمینه خبر قرار گرفتند. محتوای خبر شامل ویژگی‌های زبانی و معنایی، ویژگی‌های بصری و ویژگی‌های مبتنی‌ بر سبک هستند. زمینه خبر نیز شامل ویژگی‌های مبتنی ‌بر کاربر، پست و شبکه است. نتایج به‌دست‌آمده نشان داد که پرکاربردترین ویژگی‌ها در تشخیص اخبار جعلی ویژگی‌های مبتنی ‌بر پروفایل کاربر، ویژگی‌های سبک آماری، الگوی نوشتاری و خوانایی هستند. با توجه به تنوع بالای ویژگی‌های موجود پیشنهاد می‌شود ارزیابی گسترده‌ای از ویژگی‌ها، مدل‌ها و همچنین عملکرد ‌آنها در مجموعه داده‌های متعدد انجام شود و از این طریق عملکرد مدل‌ها و مجموعه ویژگی‌های مختلف مقایسه شود تا بهترین ترکیب ویژگی در شرایط مختلف مشخص گردد.

کلیدواژه‌ها

اخبار جعلی

ویژگی

ویژگی‌های مبتنی ‌بر محتوا

ویژگی‌های مبتنی ‌بر زمینه اجتماعی

مرور نظام‌مند

موضوعات

دیگر زمینه‌ها، موضوع‌ها، و فناوری‌های نوین در پردازش و مدیریت اطلاعات و دانش علمی و فناورانه

عنوان مقاله English

Presenting a Comprehensive Framework of Effective Features in Fake News Detection: a Systematic Review

نویسندگان English

Elham Mazaheri ¹

Mohammad Mehraeen ²

Mostafa Kazemi ³

Kamaledin Ghiasi Shirazi ⁴

¹ PhD Candidate in Information Technology Management; Department of Management; Faculty of Economics and Administrative Sciences; Ferdowsi University of Mashhad; Mashhad, Iran

² PhD in Management Information Systems; Professor; Department of Management; Faculty of Economics and Administrative Sciences; Ferdowsi University of Mashhad; Mashhad, Iran

³ Professor; Department of Management, Faculty of Economics and Administrative Sciences, Ferdowsi University of Mashhad, Mashhad, Iran

⁴ Assistant Professor; Department of Computer Engineering; Faculty of Engineering; Ferdowsi University of Mashhad; Mashhad, Iran

چکیده English

Over recent years, with the rapid development and increasing popularity of social media, we have seen a massive growth in the volume and variety of fake news. This phenomenon has profound effects on individuals and society. Verification is a widely used method to counter the negative effects of fake news. But this method is not efficient when analyzing huge amount of data. Therefore, advanced machine learning models and feature-based approaches are used to automatically identify fake news. At the same time, the large number of models and the heterogeneity of features used in the literature often create limitations for researchers trying to improve model performance. For this reason, in the present study a comprehensive framework of the features used in the detection of fake news is presented with a systematic review method. In order to carry out this systematic review, using the guide provided by Okoli and Schabram, all studies conducted in the field of fake news using related keywords were taken from ScienceDirect, Springer, Emerald, IEEE, ACM, Wiley, Sage databases, JSTOR, Taylor and WOS and finally 72 related articles were analyzed. As a result of the analysis of related articles, the features were placed in two main categories of news content and news context. News content includes linguistic and semantic features, visual features and style-based features. The news context also includes features based on user, post and network. The obtained results showed that the most used features in detecting fake news are features based on user profile, features of statistical stylistic, writing pattern and readability. Due to the high variety of available features, it is suggested that a wide evaluation of features, models and their performance in multiple data sets should be done and in this way the performance of different models and feature sets should be compared in order to find the best combination of features in different conditions to be determined.

کلیدواژه‌ها English

Fake News

Feature

Content-Based Features

Social Context-Based Features

Systematic Review

Aïmeur, Esma, Sabrine Amri, and Gilles Brassard. 2023. Fake news, disinformation and misinformation in social media: a review. Social Network Analysis and Mining, 13 (1): 30.

Aljabri, Malak, Dorieh M Alomari, and Menna Aboulnour. 2022. Fake News Detection Using Machine Learning Models. In 2022 14th International Conference on Computational Intelligence and Communication Networks (CICN), 473-77. IEEE.

Anoop, K., P. Manjary Gangan, P. Deepak, and VL Lajish. 2019. Leveraging heterogeneous data for fake news detection, Linking and mining heterogeneous and multi-view data. ? 229-64.

Bondielli, Alessandro, and Francesco Marcelloni. 2019. A survey on fake news and rumour detection techniques. Information Sciences 497: 38-55.

Capuano, Nicola, Giuseppe Fenza, Vincenzo Loia, and Francesco David Nota. 2023. Content Based Fake News Detection with machine and deep learning: a systematic review. Neurocomputing 530 (1): 91-103.

Choudhary, Anshika, and Anuja Arora. 2021. Linguistic feature based learning model for fake news detection and classification, Expert Systems with Applications, 169: 114171.

Cohen, Jacob. 1960. A coefficient of agreement for nominal scales. Educational and psychological measurement 20: 37-46.

Cooke, Nicole A. 2017. Posttruth, truthiness, and alternative facts: Information behavior and critical information consumption for a new age. The library quarterly 87: 211-21.

Fake news. https://en.wikipedia.org/wiki/Fake_news. (Accessed May 16, 2018)

Ferrara, Emilio, and Zeyao Yang. 2015. Quantifying the effect of sentiment on information diffusion in social media. PeerJ Computer Science 1: e26 https://doi.org/10.7717/peerj-cs.26.

Figueira, Álvaro, and Luciana Oliveira. 2017. The current state of fake news: challenges and opportunities. Procedia Computer Science 121: 817-25.

Gupta, Aditi, Hemank Lamba, Ponnurangam Kumaraguru, and Anupam Joshi. 2013. Faking sandy: characterizing and identifying fake images on twitter during hurricane sandy. In Proceedings of the 22nd international conference on World Wide Web, 729-736.

Janze, Christian, and Marten Risius. 2017. Automatic detection of fake news on social media platforms. PACIS 2017 Proceedings, 261.

Jin, Zhiwei, Juan Cao, Yongdong Zhang, Jianshe Zhou, and Qi Tian. 2016. Novel visual and statistical image features for microblogs news verification. IEEE Transactions on Multimedia 19: 598-608.

Kondamudi, Medeswara Rao, Somya Ranjan Sahoo, Lokesh Chouhan, and Nandakishor Yadav. 2023. A comprehensive survey of fake news in social networks: Attributes, features, and detection approaches, Journal of King Saud University-Computer and Information Sciences 35: 101571.

Okoli, Chitu, and Kira Schabram. 2015. A guide to conducting a systematic literature review of information systems research. Sprouts: Working Papers on Information Systems, 10 (26). http://sprouts.aisnet.org/10-26.

Rapoza, Kenneth. 2017. Can fake news impact the stock market. Forbes News 11.

Shu, Kai, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. 2017. Fake news detection on social media: A data mining perspective. ACM SIGKDD Explorations Newsletter 19: 22-36.

Shu, K, and H Liu. 2019. Detecting fake news on social media. Synthesis Lectures on Data Mining and Knowledge Discovery 11 (3): 1-129.

Shu, Kai, Deepak Mahudeswaran, and Huan Liu. 2019. FakeNewsTracker: a tool for fake news collection, detection, and visualization. Computational and Mathematical Organization Theory 25: 60-71.

Stieglitz, Stefan, and Linh Dang-Xuan. 2013. Emotions and information diffusion in social media—sentiment of microblogs and sharing behavior. Journal of Management Information Systems 29: 217-48.

Takayasu, Misako, Kazuya Sato, Yukie Sano, Kenta Yamada, Wataru Miura, and Hideki Takayasu. 2015. Rumor diffusion and convergence during the 3.11 earthquake: a Twitter case study, PLoS one, 10: e0121443.

Vlachos, Andreas, and Sebastian Riedel. 2014. Fact checking: Task definition and dataset construction. In Proceedings of the ACL 2014 workshop on language technologies and computational social science, 18-22. Maryland, US.A

Vosoughi, Soroush, Deb Roy, and Sinan Aral. 2018. The spread of true and false news online. scienc, 359: 1146-51.

Zannettou, Savvas, Michael Sirivianos, Jeremy Blackburn, and Nicolas Kourtellis. 2019. The web of false information: Rumors, fake news, hoaxes, clickbait, and various other shenanigans. Journal of Data and Information Quality (JDIQ) 11: 1-37.