Entering edit mode
5.9 years ago
kari_vo3
•
0
Hello, i want to change the fasta header of this input file :
>M04631:312:000000000-C6V6K:1:2107:11495:1734 1:N:0:ACTGAGCG+TTATGCGA
>M04631:312:000000000-C6V6K:1:2107:13059:1785 1:N:0:ACTGAGCG+TTATGCGA
In this fasta header:
>adjH001
>adjH002
>adjH....
>adjH099
>adjH100
what script do I have to use??
Looks like you have MiSeq data, please provide the actual context about what you're doing. It should include the words "MiSeq" and "fastq". It's highly unlikely that your actual question will end up actually renaming fasta entries.
You need to explain your naming convention further. There’s no information in those headers to generate what you’re asking for.
If you just want all the sequences named
adjH
followed by some number, how high should these numbers go? How many sequences do you have? Since you want them 0-padded this matters.Are the names arbitrary in order?
i have 300000 reads and then i'll renamed them to a metagenomic analysis :
That's not clear. We still don't know where the
adjH001
identifiers are based on.You need to tell us how:
>adjH001
pairs with>M04631:312:000000000-C6V6K:1:2107:11495:1734 1:N:0:ACTGAGCG+TTATGCGA
or any other sequence. Is it purely based on the order in which they occur?
You may be interested in SEDA (https://www.sing-group.org/seda/), a user-friendly GUI-based tool with a powerful option to rename FASTA headers (https://www.sing-group.org/seda/manual/operations.html#rename-header).