False News Detection from Text on Social Media Using K-Nearest Neighbors Bayesian Approach
Author(s):
Ravi Prakash, Shambu Kumar Singh
Keywords:
False News/Information Detection, K-Nearest Neighbours, Bayesian, Word2Vector, Term Frequency- Inverse Document Frequency.
Abstract
The pervasive spread of false news on social media presents significant threats to public discourse and societal stability. This study investigates advanced methodologies for the detection of false news specifically within social media contexts, leveraging the unique characteristics and challenges posed by these platforms. A diverse dataset is compiled from various social media sources, including platforms like Twitter, Facebook, and Reddit, consisting of both verified true news and identified false news. The preprocessing pipeline is tailored to handle the noisy and informal nature of social media text, including techniques such as tokenization, slang normalization, hashtag processing, and the handling of emojis and special characters. Feature extraction methods, including term frequency-inverse document frequency (TF-IDF), word embeddings, and advanced contextual embeddings (e.g., BERT, RoBERTa), are employed to capture the linguistic features of social media text. Additionally, network-based features, such as user interactions, repost patterns, and user credibility scores, are integrated to enrich the feature set. The proposed detection framework leverages the K-Nearest Neighbors algorithm to identify patterns and similarities in the feature space, while a Bayesian approach is integrated to provide probabilistic assessments of news veracity. This hybrid method aims to combine the strengths of KNN in capturing local data structures with the probabilistic reasoning capabilities of Bayesian models. The performance of the proposed method is evaluated using standard metrics such as accuracy, precision, recall, and F1-score. Comparative analysis with traditional machine learning models and deep learning approaches is conducted to validate the effectiveness of the KNN-Bayesian hybrid model. Experimental results demonstrate that the KNN-Bayesian approach achieves competitive performance, with significant improvements in detection accuracy and robustness compared to baseline models. The incorporation of metadata and contextual information further enhances the model's ability to discern false news from authentic content. Challenges such as the dynamic nature of false news, the need for scalable real-time detection, an
Article Details
Unique Paper ID: 165791

Publication Volume & Issue: Volume 11, Issue 1

Page(s): 1611 - 1620
Article Preview & Download


Share This Article

Join our RMS

Conference Alert

NCSEM 2024

National Conference on Sustainable Engineering and Management - 2024

Last Date: 15th March 2024

Call For Paper

Volume 11 Issue 1

Last Date for paper submitting for Latest Issue is 25 June 2024

About Us

IJIRT.org enables door in research by providing high quality research articles in open access market.

Send us any query related to your research on editor@ijirt.org

Social Media

Google Verified Reviews