A Word Stemming Algorithm for Hausa Language

Muazzam, Muazzam and Rozaimee, Azilawati and Wan Isa, Wan Malini (2015) A Word Stemming Algorithm for Hausa Language. IOSR Journal of Computer Engineering, 17 (3). pp. 25-31. ISSN 2278-0661

[img] Text
J-2015-word stemming.pdf
Restricted to Registered users only

Download (289Kb)
Official URL: http://www.iosrjournals.org/IOSR-JCE.html

Abstract

Hausa, a highly inflected language, needs a worthy stemming approach for efficient information retrieval (IR). However, there is a limited or unavailable study to stemming in the language. Stemming refers to the systematic way of reducing a word to its base or root form. It is a crucial aspect in the field of natural language processing (NLP) such as text summarization and machine translation. As such, this study inspirationally presents an automatic word stemming system for Hausa language with a view to contributing to the field of electronic text processing, as well as NLP, in general. The proposed method is a modification of Porter’s algorithm to fit Hausa morphological rules. The system has an accuracy of 73.8% for implementation with 2573 words extracted from four different articles from Hausa Leadership newspaper. If immensely improved over time (employing more exceptional cases in future work), it would inspire the development

Item Type: Article
Keywords: Hausa language, Information retrieval, Natural language processing, Stemming.
Subjects: T Technology > T Technology (General)
Faculty / Institute: Faculty of Informatics & Computing
Depositing User: WAN MALINI WAN ISA
Date Deposited: 04 Oct 2015 09:20
Last Modified: 04 Oct 2015 09:20
URI: http://erep.unisza.edu.my/id/eprint/3768

Actions (login required)

View Item View Item