Entering edit mode
2.8 years ago
Confused_human
▴
30
Hi everyone,
I have a dataset of fasta file which looks like this:
>13_seq2344_ATCGACGGAACTGA
>1342_seq2134_AGCTGTGGCAT
>130_SEQ2289_TCGAATCGAGGAAC
I want to remove the line which contains "13" only
so my output should look like:
>1342_seq2134_AGCTGTGGCAT
>130_SEQ2289_TCGAATCGAGGAAC
I am trying grep -w
, grep -o
, grep -E
all these are not working for me.
Do suggest any command that works.
Thank you
or
you also have to think about removing sequences as well. Try this:
seqkit -w 0 grep -vrip "^13_" input.fa
. Awk or sed remove only the matching the line, not the following sequences. It is difficult if sequences are in multlines.