Hi,
I have a protein fasta file
and a gene list file
. I want to retrieve sequences of all genes in gene list file at one time to save time. I am using this command line:
for i in `cat gene_list.txt` ; do grep -A1 "$i" protein.fasta ; done
However, it only gives sequence for the last gene in the gene_list.txt file. I want to retrieve sequences of all the gene. Thank you for the help!
This can't be a genome file since you would not be able to get individual gene sequences from it.
Best option is to use
faSomeRecords
utility. See: C: How do I extract Fasta Sequences based on a list of IDs?By genome I meant protein. Sorry for the confusion. I have edited it in the question.