Applying Rough Set Theory for DNA Sequence Database Classification and Reduction

Abd Rahman, Mohd Nordin (2010) Applying Rough Set Theory for DNA Sequence Database Classification and Reduction. International Journal of Biomedical Soft Computing and Human Sciences, 16 (2). pp. 115-124. ISSN 1345-1529

[img] Text
BSCHV16N02_PS14.pdf - Published Version
Restricted to Registered users only

Download (175Kb)


Data classification is a vital task in large scale data mining application. DNA sequences are the basis of life and they encode all the necessary information needed to reproduce life. The size of public DNA sequence databases are growing doubling every year. This situation makes automatic classification and reduction of DNA sequences has become important for effective sequence similarity search problem. A challenge in DNA sequence similarity search is that the sequence record structure does not have any attribute that can be used for implementing classification process. In this paper, by means of filtering process an automaton based exact string matching is employed to generate a special attribute used for DNA sequence database classification and reduction. Rough sets theory provides an indiscernibility relation technique which can be used to classify and reduct the database based on some definition of ‘equivalence’. The generated attribute is used for constructing indiscernibility relation among sequences. With computational implementation, the experiments are executed to investigate the effectiveness of rough sets theory on generating DNA sequence database classification and reduction. Moreover, the experiments will demonstrate that theDNA sequence similarity search performance is significantly improved by using this approach.

Item Type: Article
Keywords: DNA sequence, filtering, rough sets theory, database classification, optimal alignment
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Faculty / Institute: Faculty of Informatics & Computing
Depositing User: Dr Mohd Nordin Abdul Rahman
Date Deposited: 26 May 2014 07:42
Last Modified: 26 May 2014 07:42

Actions (login required)

View Item View Item