Entering edit mode
11.8 years ago
redspider19800915
▴
40
Help needed for a simple question:
I have a local database containing millions of sequences in fast format as follows:
>leaf_1
AAGACCATTCGAGCTTATCTCTTC
>leaf_2
ATGGAGAAGGAAATGAAGAGCAGT
>leaf_3
TGGCTGTAAGTCATACCTGTCA
>leaf_4
CGCGGAGTAGATCAGTTTGGTA
>leaf_5
AGTAACGGCTTTACAAGAATCAAA
......
I now have an query list of selected sequences of interest, which I need to extract the sequence information from the above database. For example: I need leaf_2, leaf_4 and leaf_5 sequences to be retrieved and output in tab-delimited format as follows:
>leaf_2 ATGGAGAAGGAAATGAAGAGCAGT
>leaf_4 CGCGGAGTAGATCAGTTTGGTA
>leaf_5 AGTAACGGCTTTACAAGAATCAAA
Anyone could provide a perl script for me? Thanks a lot !
http://whathaveyoutried.com/
i think this is should be read first given to any one in biostars before asking a question as it will help him to try first ask second