Detection of Hoax Spread in The Whatsapp Group with Lexicon Based and Naive Bayes Classification
Jordan Andrean1, Suharjito2

1Jordan Andrean*, Magister Teknik Informatika, BINUS University , Jakarta, Indonesia.
2Suharjito, Magister Teknik Informatika, BINUS University , Jakarta, Indonesia.

Manuscript received on March 05, 2020. | Revised Manuscript received on March 16, 2020. | Manuscript published on April 30, 2020. | PP: 506-511 | Volume-9 Issue-4, April 2020. | Retrieval Number: C6587029320/2020©BEIESP | DOI: 10.35940/ijeat.C6587.049420
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (

Abstract: Spreading hoax through WhatsApp social media can lead to different beliefs and can cause disputes for those affected. This paper proposes a hybrid model for finding hoaxes in the WhatsApp group using a combination of knowledge-based and machine learning approaches. This Hybrid model combines two methods namely Lexicon based and Naive Bayes Classifier which will be applied to the WhatsApp monitoring application. This research focuses on two main aspects namely word weighting using the lexicon based method and data classification using the Naive Bayes Classifier and Decision tree-j48 methods. The dataset used is conversation data that is crossed from the WhatsApp group. Based on the experiments that have been carried out, it is obtained the results of classification using Naive Bayes classifier of 86.670% data conversation not indicated hoaxes and 13.330% indicated hoaxes. The average value of the percentage of truth obtained more than 75%. The average value of the classification performance evaluation results in a precision value of 0.771, a recall value of 0.754, an F-measure value of 0.773. 
Keywords: WhatsApp monitoring, Hoax, Hybrid Approach, Lexicon based, Naive Bayes Classifier.