Journal of Software, Vol 6, No 12 (2011), 2350-2360, Dec 2011
doi:10.4304/jsw.6.12.2350-2360

An Efficient Hybrid Clustering-PSO Algorithm for Anomaly Intrusion Detection

Hongying Zheng, Meiju Hou, Yu Wang

Abstract


Generally speaking, in anomaly intrusion detection, modeling the normal behavior of activities performed by a user or a program is an important issue. Currently most machine-learning algorithms which are widely used to establish user’s normal behaviors need labeled data for training first, so they are computational expensive and sometimes misled by artificial data. This study proposes a PSO-based optimized clustering method IDCPSO for modeling the normal patterns of a user’s activities which combines an unsupervised clustering algorithm with the PSO technique, PSO algorithm is used to optimize the clustering results and obtain the optimal detection result. IDCPSO needs unlabeled data for training and automatically establishes clusters so as to detect intruders by labeling normal and abnormal groups. The famous KDD Cup 1999 dataset is used to evaluate the proposed system. In addition, we compare the performance of PSO optimization process with GA.


Keywords


PSO; Unsupervised Clustering; Anomaly Intrusion Detection; Optimization

References


[1] Yuebin Bai, Hidetsune Kobayashi. Intrusion detection systems:technology and development.Proceedings of the17 th International Conference on Advanced Information Networking and Applications (AINA’03).

[2] L. Portnoy,E. Eskin,S. Stolfo. Intrusion detection with unlabeled data using clustering. In:Proceedings of ACM CSS Workshop on Data Mining Applied to Security (DMSA-2001) Philadelphia. 2001:5-8.

[3] Nam Hun Park, Sang Hyun Oh, Won Suk Lee. Anomaly intrusion detection by clustering transactional audit streams in a host computer. Information Sciences 180 (2010) 2375–2389.
http://dx.doi.org/10.1016/j.ins.2010.03.001

[4]Shi-Jinn Horng, Ming-Yang Su, Yuan-Hsin Chen, Tzong-Wann Kao, Rong-Jian Chen, Jui-Lin Lai,Citra Dwi Perkasa. A novel intrusion detection system based on hierarchical clustering and support vector machines. Expert Systems with Applications 38 (2011) 306–313.

[5] Haidong Yang, Chunsheng Li, Jue Hu. RFID intrusion detection with possibilistic fuzzy c-Means clustering.Journal of Computational Information Systems, v 6, n 8, p 2623-2632, August 2010.

[6] Liang Hu, Nurbol, Xiaobo Liu, Kuo Zhao. A time stamped clustering method for intrusion detection. Journal of Information and Computational Science, v 7, n 2, p 399-406, February 2010.

[7]Panda, Mrutyunjaya,Patra, Manas Ranjan. A hybrid clustering approach for network intrusion detection using cobweb and FFT. Journal of Intelligent Systems, v 18, n 3, p 229-245, 2009.

[8] Pavel Laskov, Patrick D¨ussel, Christin Sch¨afer and Konrad Rieck. Learning intrusion detection:supervised or unsupervised?. 12489 Berlin, Germany.

[9]Y. G. Liu, K. F. Chen, X. F. Liao, Wei Zhang. A genetic clustering method for intrusion detection. Pattern Recognition. 37 (2004):927–942.
http://dx.doi.org/10.1016/j.patcog.2003.09.011

[10]Y. H. Liao, V. R. Vemuri. Use of K-nearest neighbor classifier for intrusion detection. Computers Security 2002;21:439–448.
http://dx.doi.org/10.1016/S0167-4048(02)00514-X

[11]C. F. Tsai, C. Y. Lin. A triangle area based nearest neighbors approach to intrusion detection. Pattern Recognition, In Press, Corrected Proof, Available online 3 June 2009.

[12] W. H. Chen, S. H. Hsu, H. P. Shen. Application of SVM and ANN for intrusion detection. Computers & Operations Research, Volume 32, Issue 10, October 2005: 2617-2634.
http://dx.doi.org/10.1016/j.cor.2004.03.019

[13] WE1 Yu-xin, WU Mu-qing. KFDA and clustering based Detection. The journal of china universities of posts and telecommunications. Volume 15, Issue 1, March 2008, pages: 123-128.

[14] Yang Yi, Jiansheng Wu, Wei Xu. Incremental SVM based on reserved set for network intrusion detection. Expert Systems with Applications, 38 (2011):7698-7707.
http://dx.doi.org/10.1016/j.eswa.2010.12.141

[15] Yao, Yu, Yang, Wei; Gao, Fu-Xiang; Yu, Ge. Anomaly intrusion detection approach using hybrid MLP/CNN neural network. In proceedings - ISDA 2006: Sixth International Conference on Intelligent Systems Design and Applications, 2006, v2, pages:1095-1102.
http://dx.doi.org/10.1109/ISDA.2006.253765

[16] Xiaobai Li. A scalable decision tree system and its application in pattern recognition and intrusion detection. Decision Support Systems 41 (2005):112–130.
http://dx.doi.org/10.1016/j.dss.2004.06.016

[17] M. Amini, R. Jalili, H. R. Shahriari. RT-UNNID: A practical solution to real-time network-based intrusion detection using unsupervised neural networks. Computers & Security, Volume 25, Issue 6, September 2006:459-468.
http://dx.doi.org/10.1016/j.cose.2006.05.003

[18] Günes Kayacik, H. Nur Zincir-Heywood, A.; Heywood, Malcolm I. On the Capability of an SOM based Intrusion Detection System. Proceedings of the International Joint Conference on Neural Networks, v 3, pages:1808-1813, 2003.
http://dx.doi.org/10.1109/IJCNN.2003.1223682

[19] H.G. Kayacik, A.N. Zincir-Heywood, M.I. Heywood, A hierarchical SOM-based intrusion detection system, Engineering Applications of Artificial Intelligence 20 (4) (2007) 439–451.
http://dx.doi.org/10.1016/j.engappai.2006.09.005

[20] X. B. Tan, H. S. Xi. Hidden semi-Markov model for anomaly detection. Applied Mathematics and Computation, Volume 205, Issue 2, 15 November 2008:562-567.
http://dx.doi.org/10.1016/j.amc.2008.05.028

[21] S. H. Oh, W. S. Lee. An anomaly intrusion detection method by clustering normal user behavior. Computers & Security. 2003.22(7): 596-612.

[22] K. Leung, et al. Unsupervised Anomaly Detection in Network Intrusion Detection Using Clusters. Proceedings of the Twenty-eighth Australasian conference on Computer Science, 2005:333-342.

[23] L. Khan, M. Awad, B. Thuraisingham, A new intrusion detection system using support vector machines and hierarchical clustering, The VLDB Journal 16 (2007): 507–521.
http://dx.doi.org/10.1007/s00778-006-0002-5

[24] X. Cheng, P. C. Yong, L. S. Meng. Design of multiple-level hybrid classifier for intrusion detection system using Bayesian clustering and decision trees. Pattern Recognition Letters. Volume 29, Issue 7 (May 2008) :918-924.

[25] Hongying Zheng, Meiju Hou, Yu Wang. Application of Particle Swarm Optimization to Clustering for Intrusion Detection. The proceeding of 3rd international symposium on parallel architectures, algorithms and programming. Dalian, China,18-20 december 2010,Pages:221-228.

[26]D. E. Brown. C. L. Huntley. A Practical Application of Simulated Annealing to Clustering. Pattern Recognition,1992, 25(4):401-412.
http://dx.doi.org/10.1016/0031-3203(92)90088-Z

[27]R. Eberhart, J. Kennedy. A new optimizer using particle swarm theory. In: Proceedings of the sixth international symposium on micromachine and human science. Nagoya. 1995:39-43.

[28]J. Kennedy, R. Eberhart. Particle swarm optimization. In: Proceedings IEEE international conference on neural networks. Perth. 1995.1942-1948.

[29]Y.S. Jiang, J.X. Wang, H. Z. Yang. Attribute Discretization for Decision System Based on Binary Particle Swarm Optimization. Control Engineering of China:2008 V0l.15, No.4:360-363.

[30]C. J. Liao, C. T. Tseng, P. Luarn. A discrete version of particle swarm optimization for flowshop scheduling problems. Computers & Operations Research, Volume 34, Issue 10, October 2007:3099-3111.
http://dx.doi.org/10.1016/j.cor.2005.11.017

[31]M. Maitra, A. Chatterjee. A hybrid cooperative–comprehensive learning based PSO algorithm for image segmentation using multilevel thresholding. Expert Systems with Applications, Volume 34, Issue 2, February 2008:1341-1350.
http://dx.doi.org/10.1016/j.eswa.2007.01.002

[32]H. Pan, L. Wang, B. Liu. Particle swarm optimization for function optimization in noisy environment. Applied Mathematics and Computation, Volume 181, Issue 2, 15 October 2006: 908-919.
http://dx.doi.org/10.1016/j.amc.2006.01.066

[33]T. K. Rasmussen, T. Krink. Improved Hidden Markov Model training for multiple sequence alignment by a particle swarm optimization—evolutionary algorithm hybrid. Biosystems, Volume 72, Issues 1-2, November 2003:5-17.

[34]P. Y. Yin, S. S. Yu, P. P. Wang, Y. T. Wang. A hybrid particle swarm optimization algorithm for optimal task assignment in distributed systems. Computer Standards & Interfaces 28 (2006): 441–450.
http://dx.doi.org/10.1016/j.csi.2005.03.005

[35]Lincoln Labs, KDD-cup data set. http://kdd.ics.uci.edu/databases/kddcup99.html.

[36]Louis Gosselin, Maxime Tye-Gingras, François Mathieu-Potvin. Review of utilization of genetic algorithms in heat transfer problems. International Journal of Heat and Mass Transfer, Volume 52, Issues 9-10, April 2009, Pages 2169-2188.

[37] N.F. Wang, K. Tai. Target matching problems and an adaptive constraint strategy for multiobjective design optimization using genetic algorithms. Computers & Structures, Volume 88, Issues 19-20, October 2010, Pages 1064-1076.

[38]Xiao-Ping Zeng, Yong-Ming Li, Jian Qin. A dynamic chain-like agent genetic algorithm for global numerical optimization and feature selection. Neurocomputing, Volume 72, Issues 4-6, January 2009, Pages 1214-1228.


Full Text: PDF


Journal of Software (JSW, ISSN 1796-217X)

Copyright @ 2006-2012 by ACADEMY PUBLISHER – All rights reserved.