Simple Rules Malay Stemmer

Syed Abdullah , Engku Fadzli Hasan and Ahmad Saany, Syarilla Iryani and Hassan , Hasni and Mohd Satar , Siti Dhalila (2012) Simple Rules Malay Stemmer. In: The International Conference on Informatics & Applications (ICIA2012), 3 - 5 June 2012, Kuala Terengganu.

[img] Text
Restricted to Registered users only

Download (160Kb)
Official URL:


Stemming is a morphological analysis that tries to associate variants of the same term with a common root form. It is important to improve recall and precision in IR systems. Malay word stemming is considered complicated compared with other languages because of its unique morphological structure. Many research in Malay stemming relies heavily on dictionary which needs higher processing cost and offers lower coverage. This paper presents a stemming approach called UniSZA stemmer which attempts to reduce dictionary dependencies and lower the processing cost by proposing 7 simple rules. Experimental results show that the approach produces higher compression ratio and processing speed compared to RAO and RFO methods

Item Type: Conference or Workshop Item (Paper)
Keywords: Information retrieval, Malay stemming
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
T Technology > T Technology (General)
Faculty / Institute: Faculty of Informatics & Computing
Depositing User: Syarilla Iryani Ahmad Saany
Date Deposited: 19 May 2014 06:26
Last Modified: 19 Aug 2015 04:04

Actions (login required)

View Item View Item