Hi guys, How do i extract a sequences from a fasta file by taking the start and end position from a gene predicted file: here the example the file with the orf statistics is my predicted file and for example the start position for the first orf is 65 and the end is 213. and the fasta file i'm going to search those position is the other one
my predicted file looks like this
>Seq1 [organism=S.burgodofry...
orf00001 65 213 +1 2.93
orf00002 799 2328 +1 7.09
orf00003 2331 3437 +3 6.09
orf00004 3457 4044 +1 6.15
>Seq2 [organism=S.burgodofry...
orf00001 55 317 +1 2.17
orf00002 206 610 +2 5.28
orf00003 747 2408 +3 4.85
and my fasta sequence sequence look like this:
>Seq1 [organism=S.burgodofry]...
ACTGTAGATGACATGACCAGTACGATACAGAT...
....
........
>Seq2 [organism=.....]
ATGTCGTGACTAGTACGATCAGATCAGAT
.........................
..............
...
You don't say which fields in your gene prediction file correspond to start and end. And that isn't a Fasta file, my friend. What have you tried so far?
my bad, the file with the orf statistics are is my predicted file and for example the start position for the first orf is 65 and the end is 213. and the fasta file i'm going to search those position is the other one