I want to extract seuqneces that partially match the name list. my fasta file:
>4_chromosome_RagTag|c(3245459..3246846)|DNA|DNA-1_Dmon
CTACTTAAATCACTTGATGGCGATCGAATTCAATATAGGTCTATCGATTCTGTTACT
>4_chromosome_RagTag|(3012739..3012917)|DNA|DNA-2_Dmon
CATATATCAAATATTCACTGCAGGGTCTTTGCAGACAATTTTGAAAAAATGCTCACAA
>4_chromosome_RagTag|c(1350081..1350297)|DNA|DNA-2_Dmon
AAAATAAGTATTATTAACAAATCAATTTTTATATTTATTTTTATTATTTTAAATTTA
my namelist:
12471413
3245459
1350081
I am trying this code:
seqkit grep --pattern "^[^]/|/(\D )(\w+) " --pattern-file coordinates file.fasta
so I am trying to tell it to ignore the first part and find the match numbers in the second part, but it doesn't work, how should I fix it?
Using
filterbyname.sh
from BBMap suite: