Open Access Open Access  Restricted Access Subscription Access

Detection of Fake Accounts on Social Media using Machine Learning Techniques

Srusti Agre, Shivani Reddy, Srushti Yerole, Pallavi ., Dr. Dayanand J

Abstract


The rise of fictitious social media accounts has proven detrimental to user privacy, information authenticity, and public dialogue. Fake accounts, created and managed by bots or bad actors, are easily used for spreading false information, influencing and deceiving people, or committing other scams. This paper focuses on the use of machine learning approaches on the problem of detecting fake accounts on social media. We examine posting behaviours, language use, followed and following user identity ratios, and several metrics of network connectivity as both content and behaviour features. Ensemble learning, deep learning, and graph-based techniques like Graph Neural Networks (GNNs) are employed alongside supervised techniques like Support Vector Machines (SVM), Random Forests, and Logistic Regression. Ensemble and graph-based model results proved to be more robust and accurate than traditional classifiers. This research illustrates the need for multi-criteria and sophisticated learning approaches, which greatly facilitates scalable detection of fake accounts in uncontrollable social media settings.

 


Full Text:

PDF

References


Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297. https://doi.org/10.1007/BF00994018.

Breiman, L. (2001). Random Forests. Machine Learning, 45(1),5–32.https://doi.org/10.1023/A:1010933404324.

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press. http://www.deeplearningbook.org/.

Pedregosa, F., Varoquaux, G., Gramfort, A., et al. (2011). Scikit- learn: Machine Learning in Python. Journal of Machine Learning Research, 12,2825–2830. https://jmlr.csail.mit.edu/papers/v12/pedregosa11a.html.

Chollet, F. (2015). Keras: Deep Learning library for Theano and TensorFlow.https://keras.io/.

Abadi, M., et al. (2016). TensorFlow: A system for large- scale machine learning. 12th USENIX Symposium on Operating Systems Design and Implementation.https://www.usenix.org/system/files/confer ence/osdi16/osdi16- abadi.pdf.

Ng, A. Y. (2004). Feature selection, L1 vs. L2 regularization, and rotational invariance. Proceedings of the 21st International Conference on Machine Learning (ICML).

Zhang, J., Luo, L., & Zhang, C. (2016). Detecting Fake Accounts in Online Social Networks. In 2016 IEEE International Conference on Big Data (Big Data). https://doi.org/10.1109/BigData.2016.7840755.

Scikit-learn documentation. (n.d.). https://scikit- learn.org/stable/TensorFlow documentation. (n.d.).


Refbacks

  • There are currently no refbacks.