I have a huge fasta file of around 20 GB size. I also have some sequence IDS from the same fasta file in text format. Now, I want to retrieve those sequences which don't have those particular ids in the text file.
How shall I proceed? I use Ubuntu 12. I am a novice and have very little knowledge of bash, shell or perl. Any Linux or Samtools or Bioperl command will be helpful.
Thanks.
and also faSomeRecords
Thanks for the commands.
I am a beginner in this field. Can you please tell me what does each component of your command does?
Thanks
Execute each command on your own, then you will understand very easily what each command is doing.