Dear Biostars, Hi
I have searched my transcripts (longest isoform of each gene from RNA-seq data) using MISA to report any potential SSRs. My total number of SSR containing sequences is 93022.
Q: How to figure out that how many of these sequences/transcripts contain any ORF ?
Thanks
NOTE:
I have used Transdecoder to discover ORF of my whole transcripts, too. But I can not test all 93022 ID in Transdecoder result, manually.
can I collect the ssr contained transcript IDs in a text file and check for their representative in Trinity.fasta.transdecoder.pep file using some linux command line tools such as grep -F -f ?