Entering edit mode
11.4 years ago
Cara
•
0
Hi, I am new to bioinformatics. I have .fa files from two data sources about lncRNA transcripts. I only want to keep those lncRNA transcripts with overlap on the same strand between these two sources to avoid redundancy. How can I do this? What software should I use to run the .fa files?
you can use EMBOSS-skip-redundant tool to remove redundant reads. Alternatively you can remove redundant sequeces using awk (I generally use awk for these purposes; it's quite fast).