Question

How would I go about picking out certain files in a FASTA file and putting them in a new, separate file?

0

Entering edit mode

3.4 years ago

crisstyl ▴ 10

Hello,

I'm an undergraduate who has been working with quite a few FASTA files for the past few months, and something that has been irking me is my inability to quickly take out a select few sequences from a 'master' fasta and put them into their own dataset.

I have a 'master' fasta file that has ~200 sequences in it, each labelled with their respective name like >NZ_123456. I've been constructing smaller datasets out of the plasmids in the larger one by copying and pasting the sequences I need into a new file, and that has gotten tedious. Would there be a way for me to list out the sequences I need and have their sequences copied from that dataset and put together into a new dataset? I imagine there would be a way to use either the terminal or python to do this, but I am a novice in the field so I am curious about suggestions. Thank you and have a good day!

MSA fasta dataset multiple_sequence_alignment • 1.0k views

ADD COMMENT • link 3.4 years ago by crisstyl ▴ 10

0

Entering edit mode

Please search the forum for similar posts (or google "subset fasta by id". This question has been answered multiple times.

ADD REPLY • link 3.4 years ago by Ram 44k

0

Entering edit mode

Ah, good to know. Thanks! I'll take down my post.

ADD REPLY • link 3.4 years ago by crisstyl ▴ 10

0

Entering edit mode

Please do not delete posts that have received feedback (GenoMax has answered your question already). Instead, accept GenoMax's answer to mark the post as resolved.

ADD REPLY • link 3.4 years ago by Ram 44k

score 0 · Answer 1 · 2021-08-02

0

Entering edit mode

3.4 years ago

GenoMax 148k

See: How do I extract Fasta Sequences based on a list of IDs?

ADD COMMENT • link 3.4 years ago by GenoMax 148k

0

Entering edit mode

Thank you! That should give me ample resources to solve this. Resolved.

ADD REPLY • link 3.4 years ago by crisstyl ▴ 10