Hello, I am currently conducting an analysis to identify lncRNA in a human cell line. I have used STAR-cufflinks-cuffmerge pipeline and successfully identified novel lncRNAs.
What I want to ask is: how do I identify known lncRNAs, not novel ones? In one paper, I've seen that they used blastn to identify known lncRNAs in sheep samples by blasting against a database called ALDB.(A domestic animal long noncoding RNA database) Can the same method be applied to human cell lines or mouse cell lines to identify known lncRNAs? Or are there other methods to find known lncRNAs?
Thanks!
Known lncRNAs (such as Xist and Malat1) already exist in the standard human genome reference's GTF annotation.
If you just align to the human genome reference with STAR, you'll already have the expression levels of those lncRNAs.
Is there something else you're asking about? Edit: Ah, seems like you want to identify it from the output of cufflinks-cuffmerge. The answer below should give you that.