Hi, I have a genome file in multifasta format
>chr1
ATATATACCGA
>chr2
TGGCGTATTAT
...
and I would like to extract specific coordinates from specific sequences. Coordinates are stored in a three columns file table
chr1 2 4
chr2 4 6
chr2 3 6
...
Ideally the output would compile the extracted sequences in a single file indicating the chromosome and coordinates in the header:
>chr1_2_4
TAT
>chr2_4_6
CGT
...
Could anyone help me with this?
Thanks a lot in advance Florianino
Hey @Aleksandr, how should the fasta headers should be? Exactly what the 'a001.coords' col1 says, right?