RNA family classification using the conditional random fields model
Sitthichoke Subpaiboonkit[a], Chinae Thammarongtham[b] and Jeerayut Chaijaruwanich*[a,b,d]* Author for corresponding; e-mail address: jeerayut@science.cmu.ac.th
Volume: Vol.39 No.1 (JANUARY 2012)
Research Article
DOI:
Received: 21 June 2011, Revised: -, Accepted: 6 October 2011, Published: -
Citation: Subpaiboonkit S., Thammarongtham C. and Chaijaruwanich[a,b,d] J., RNA family classification using the conditional random fields model, Chiang Mai Journal of Science, 2012; 39(1): 1-7.
Abstract
RNA family classification is one of the necessary tasks needed to characterize sequenced genomes. RNA families are defined by member sequences which perform the same function in different species. Such functions have a strong relationship with RNA secondary structures but not the primary sequence. Thus RNA sequences alone are not sufficient to classify RNA families. Here, we focus on computational RNA family classification by exploring primary sequences with RNA secondary structures as the selected feature to classify the RNA family using the method of conditional random fields (CRFs). This model treats RNA data sets with optimal F-score prediction between 98.77% - 99.32% for different RNA families.