Entering edit mode
4.4 years ago
the_cowa
▴
40
I have a fasta file which contains coding sequences as like this
>Seq1
ATGAATGACAAAATTAA
GTTATATTTAAA
TCCTTAATAACTTACCAGAAGAGACTAATGA
CATCTCATGCCATGAAG
I need to find Reverse Complement of each line and concatenate as like this
>Seq1
TTAATTTTGTCATTCATTTTAAATATAACTCATTAGTCTCTTCTGGTAAGTTATTAAGGACTTCATGGCATGAGATG
Normally "cat fasta |tr ACGTacgt TGCAtgca | rev" will Reverse Complement but here I need to find for each CDS
http://emboss.sourceforge.net/apps/cvs/emboss/apps/transeq.html
How is this relevant for the original question @Pierre? OP is asking for reverse-complementing individual chunks of sequence on each line and then catting them together? It seems to be an odd/very specific request.
I understand the question as :" I want to find all the ORFs in the reverse complement."
I see. You included that as final solution assuming this is done first.