Research Article Open Access

Integrating a Lexicon Based Approach and K Nearest Neighbour for Malay Sentiment Analysis

Ahmed Alsaffar1 and Nazlia Omar1
  • 1 FTSM University Kebangsaan Malaysia, Malaysia

Abstract

Sentiment analysis or opinion mining refers to the automatic extraction of sentiments from a natural language text. Although many studies focusing on sentiment analysis have been conducted, there remains a limited amount of studies that focus on sentiment analysis in the Malay language. In this article, a new approach for automatic sentiment analysis of Malay movie reviews is proposed, implemented and evaluated. In contrast to most studies that focus on supervised or unsupervised machine learning approaches, this research aims to propose a new model for Malay sentiment analysis based on a combination of both approaches. We used sentiment lexicons in the new model to generate a new set of features to train a k-Nearest Neighbour (k-NN) classifier. We further illustrated that our hybrid method outperforms the state of-the-art unigram baseline.

Journal of Computer Science
Volume 11 No. 4, 2015, 639-644

DOI: https://doi.org/10.3844/jcssp.2015.639.644

Submitted On: 6 May 2015 Published On: 7 July 2015

How to Cite: Alsaffar, A. & Omar, N. (2015). Integrating a Lexicon Based Approach and K Nearest Neighbour for Malay Sentiment Analysis. Journal of Computer Science, 11(4), 639-644. https://doi.org/10.3844/jcssp.2015.639.644

  • 3,773 Views
  • 2,587 Downloads
  • 11 Citations

Download

Keywords

  • Malay Sentiment Analysis
  • Feature Extraction
  • Machine Learning
  • Combinations Techniques