Shikha Mundra, Jaiwanth Reddy, Ankit Mundra, Namita Mittal, Ankit Vidyarthi, Deepak Gupta

An Automated Data-driven Machine Intelligence Framework for Mining Knowledge To Classify Fake News Using NLP

  • General Computer Science

The rapid spread of fake news has become a serious concern over the internet. In recent years, social media platforms are widely used for news consumption. These platforms are excellent for their low-cost accessibility and rapid dissemination of news. Contrariwise, it encourages the rapid propagation of ’fake news,’ or low-quality news containing intentionally misleading content. The quick dissemination of fake news has the potential to have devastating consequences for individuals and society as a whole. Therefore, to overcome this problem, this paper proposed an artificial intelligence framework that incorporates ensembles of deep learning features for the classification of fake news. Deep learning approaches such as Multilayer Perceptron (MLP), Convolutional Neural Networks (CNN), and Bidirectional Long Short Term Memory (BILSTM) have been used to extract local and sequential features. To obtain relevant features at the word level, these approaches are initialized using pretrained GLOVE word embedding, which results in, three base learners as GLOVE+MLP, GLOVE+CNN, and GLOVE+BiLSTM. Moreover, to extract features at the sentence level, Bidirectional Encoder Representations from Transformers (BERT) are adopted, which results in, three more base learners as BERT+MLP, BERT+CNN, BERT+BiLSTM. In total, six models are employed as base learners. Later, predictions from the best of these models are ensembled and performance is computed using ensembling techniques. Overall, we have investigated nine ensembling techniques, including weighted voting, bagging, boosting, stacked ensembles like SVC, and logistic regression. The performance is computed using four publicly available datasets regarding the macro average f1-score. We observed that soft weighted voting-based ensemble outperformed other models on three datasets achieving an f1-score of 92.99% (McIntyre), 95.22% (Kaggle), and 78.3% (Gossipcop).

Need a simple solution for managing your BibTeX entries? Explore CiteDrive!

  • Web-based, modern reference management
  • Collaborate and share with fellow researchers
  • Integration with Overleaf
  • Comprehensive BibTeX/BibLaTeX support
  • Save articles and websites directly from your browser
  • Search for new articles from a database of tens of millions of references
Try out CiteDrive

More from our Archive