Journal of Advances in Information Technology, Vol 2, No 4 (2011), 199-203, Nov 2011
doi:10.4304/jait.2.4.199-203

Building Machine Learning Based Senti-word Lexicon for Sentiment Analysis

Alaa Hamouda, Mahmoud Marei, Mohamed Rohaim

Abstract


Sentiment analysis involves classifying opinions in text into categories like "positive" or "negative". One of approaches used to make sentiment classification is using sentiment lexicon. This paper aims to build a sentiment lexicon which is domain independent. We propose a Machine Learning Based Senti-word Lexicon (MLBSL) based on the Amazon data set which contains reviews from different domains. Our proposed MLBSL yields an improvement over previous published manual and automatic-built lexicons like SentiWordNet. We also provide an improvement in calculation method used in reviews sentiment analysis.



Keywords


Sentiment Analysis, Sentiment Lexicon, Machine Learning

References


B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment Classification using Machine Learning Techniques. Proceedings of EMNLP, 2002.

T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis. Proceedings of HLT/EMNLP, Vancouver, Canada, 2005.

F. Salvetti, S. Lewis and C. Reichenbach. Automatic Opinion Polarity Classification of Movie Reviews. Colorado Research in Linguistics. Volume 17, Issue 1 (June 2004). Boulder: University of Colorado.

A. Kennedy and D. Inkpen. Sentiment Classification of Movie Reviews Using Contextual Valence Shifters. Computational Intelligence, Vol. 22, 110–125, 2006.
http://dx.doi.org/10.1111/j.1467-8640.2006.00277.x

B. Ohana and B. Tierney. Sentiment Classification of Reviews Using SentiWordNet. 9th. IT&T Conference, Dublin Institute of Technology, Dublin, Ireland, 22nd.-23rd. October, 2009.

H. Saggionα and A. Funk. Interpreting SentiWordNet for Opinion Classification. Proceedings of the Seventh conference on International Language Resources and Evaluation LREC10, 2010. A. Funk, Y. Li and H. Saggi, K. Bontchevaon and C. Leibold. Opinion Analysis for Business Intelligence Applications. Proceedings of the first international workshop on Ontology-supported business intelligence, 2008.

W. Du and S. Tan. Building Domain-oriented Sentiment Lexicon by Improved, Information Bottleneck. ACM 978-1-60558-512-3/09/11, 2009

N. Slonim and N. Tishby. Agglomerative information bottleneck. NIPS 1999. G. Qiu, B. Liu, J. Bu and C. Chen. Expanding Domain Sentiment Lexicon through Double Propagation. IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence, 2009. A. Esuli and F. Sebastiani, SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining. Proceedings from International Conference on Language Resources and Evaluation (LREC), Genoa, 2006. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL’02), 2002. A. Hamouda and M. Rohaim. Reviews Classification Using SentiWordNet Lexicon. World Congress on Computer Science and Information Technology, January, 2011.


Full Text: PDF


Journal of Advances in Information Technology (JAIT, ISSN 1798-2340)

Copyright @ 2006-2012 by ACADEMY PUBLISHER – All rights reserved.