Intrusion Detection System on KDD’99 Dataset with Imbalanced Classes
Anupam Agrawal

Anupam Agrawal*, Department of Computer Science, Maulana Azad National Institute of Technology (MANIT), Bhopal (M.P) India.
Manuscript received on October 26, 2021. | Revised Manuscript received on November 01, 2021. | Manuscript published on December 30, 2021. | PP: 35-38 | Volume-11 Issue-2, December 2021. | Retrieval Number: 100.1/ijeat.B32381211221 | DOI: 10.35940/ijeat.B3238.1211221
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: The paper describes a method of intrusion detection that keeps check of it with help of machine learning algorithms. The experiments have been conducted over KDD’99 cup dataset, which is an imbalanced dataset, cause of which recall of some classes coming drastically low as there were not enough instances of it in there. For Preprocessing of dataset One Hot Encoding and Label Encoding to make it machine readable. The dimensionality of dataset has been reduced using Principal Component Analysis and classification of dataset into classes viz. attack and normal is done by Naïve Bayes Classifier. Due to imbalanced nature, shift of focus was on recall and overall recall and compared with other models which have achieved great accuracy. Based on the results, using a self optimizing loop, model has achieved better geometric mean accuracy.
Keywords: Intrusion detection system, One Hot Encoding, Imbalanced classification, Geometric Mean Accuracy.