Classifying low confidence miRNA as true and false miRNA
MetadataVis full innførsel
Micro RNAs (miRNAs) are a group of apx. 22 nucleotide (nt) non-coding RNA sequences, playing an important role in gene regulation. The set of known miRNA for humans are grouped as high confidence (HC) miRNA, and low confidence (LC) miRNA. The HC miRNA are considered good, but the set of LC miRNA are may have several false positives. My SVM classifier for classifying LC miRNA shows separate between good and bad LC miRNA. It finds several LC miRNAs which are likely to be true miRNA, and several other LC miRNA which are very likely not to be miRNA. I have also classified candidate miRNA, which does not get better results than the well known mirDeep2. I have also made two new features that are used in the classification, which both separate High confidence and non-hairpin structures, but I cannot prove that using them gives any better results.