Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures

Fu, Xiangzheng and Zhu, Wen and Cai, Lijun and Liao, Bo and Peng, Lihong and Chen, Yifan and Yang, Jialiang (2019) Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures. Frontiers in Genetics, 10. ISSN 1664-8021

[thumbnail of pubmed-zip/versions/1/package-entries/fgene-10-00119/fgene-10-00119.pdf] Text
pubmed-zip/versions/1/package-entries/fgene-10-00119/fgene-10-00119.pdf - Published Version

Download (2MB)

Abstract

Playing critical roles as post-transcriptional regulators, microRNAs (miRNAs) are a family of short non-coding RNAs that are derived from longer transcripts called precursor miRNAs (pre-miRNAs). Experimental methods to identify pre-miRNAs are expensive and time-consuming, which presents the need for computational alternatives. In recent years, the accuracy of computational methods to predict pre-miRNAs has been increasing significantly. However, there are still several drawbacks. First, these methods usually only consider base frequencies or sequence information while ignoring the information between bases. Second, feature extraction methods based on secondary structures usually only consider the global characteristics while ignoring the mutual influence of the local structures. Third, methods integrating high-dimensional feature information is computationally inefficient. In this study, we have proposed a novel mutual information-based feature representation algorithm for pre-miRNA sequences and secondary structures, which is capable of catching the interactions between sequence bases and local features of the RNA secondary structure. In addition, the feature space is smaller than that of most popular methods, which makes our method computationally more efficient than the competitors. Finally, we applied these features to train a support vector machine model to predict pre-miRNAs and compared the results with other popular predictors. As a result, our method outperforms others based on both 5-fold cross-validation and the Jackknife test.

Item Type: Article
Subjects: STM Library Press > Medical Science
Depositing User: Unnamed user with email support@stmlibrarypress.com
Date Deposited: 10 Feb 2023 08:57
Last Modified: 01 Aug 2024 07:05
URI: http://journal.scienceopenlibraries.com/id/eprint/446

Actions (login required)

View Item
View Item